擅长:python、mysql、java
<p>下面是一个答案,它假设您在每个字符串中查找的子字符串要么在两个换行符之间,要么在字符串的开头,后跟一个换行符</p>
<pre><code>import re
def find_substring(string):
string = (eval(string)).decode('UTF-8')
pattern = r"\n?.*\.?\n"
lst = re.findall(pattern,string)
if len(lst) == 1:
substring = lst[0].strip(".\n")
else:
pattern2 = r"\n.*\n"
lst2 = re.findall(pattern2,"".join(lst))
substring = lst2[0].strip("\n")
return substring
</code></pre>
<p>然后,像这样映射到列表</p>
<p><code> list(map(find_substring,extracted))</code></p>
<p>这将产生:</p>
<pre><code>['SYRUP',
'mi',
'100',
'SYRUP',
'tamol, Ambroxol k',
'Guaiphenesin',
'Syrup',
'ol HCl &',
'quantity']
</code></pre>