擅长:python、mysql、java
<p>应使用正则表达式提取所需的数据:</p>
<pre><code>import re
import os, os.path
PATH = 'path/to/your/files/'
conclusions = []
for file in os.listdir(path):
with open(os.path.join(PATH, file)) as f:
data = f.read()
conclusion = re.search('CONCLUSION: (.*?)([A-Z]{2,})', data).group(1)
conclusions.append(conclusion)
</code></pre>
<p>这将查找<code>'CONCLUSION: '</code>头,然后扫描之后的数据,在下一个标题之后停止,该标题将始终是您指定的大写单词。在</p>