擅长:python、mysql、java
<p>你们需要把你们的内容分成几行,然后每一行在短划线上分开,并保留第一部分</p>
<pre><code>def partiesExtractor(doc):
executives = []
for line in doc.splitlines()[1:]:
executives.append(line.split("-")[0].strip())
return executives
# return [line.split("-")[0].strip() for line in doc.splitlines()[1:]] # list-comprenhension
text = """Executives
Garet Hayes - Director, Public Relations
Ronald Allen - Chief Executive Officer
Gilbert Danielson - Executive Vice President and Chief Financial Officer
Steven Michaels - President
John Robinson - Executive Vice President and President and Chief Executive
Officer, Progressive Finance Holdings LLC"""
print(partiesExtractor(text)) # ['Garet Hayes', 'Ronald Allen', 'Gilbert Danielson', 'Steven Michaels', 'John Robinson']
</code></pre>
<p>您也可以使用<code>regex</code></p>
<pre><code>def partiesExtractor(doc):
return re.findall("^[A-Z][a-z]+ [A-Z][a-z]+", doc, flags=re.MULTILINE)
</code></pre>