擅长:python、mysql、java
<p>使用正则表达式,可以指定多个分隔符:</p>
<pre><code>import re
def clean_list(lst):
lst = [re.split('\n\n\n|\n\xa0\n',i) for i in lst]
return [[' '.join(i.split()) for i in sublist] for sublist in lst]
</code></pre>
<p><code>print(clean_list(text1), clean_list(text2))</code>:</p>
<pre><code>[['Pie Type', 'Main Ingrediënt', 'Country of Origin'], ['Applie Pie', 'Apples', 'United Kingdom']]
[['Pie Type', 'Main Ingrediënt', 'Country of Origin'], ['Apple Pie', 'Apples', 'United Kingdom']]
</code></pre>