擅长:python、mysql、java
<p>下面是一个使用<code>re.findall</code>的选项:</p>
<pre><code>text1 = "sedentary. Allan Takocok. That's the conclusion of two studies published in this week's issue of The New England Journal of Medicine."
matches = re.findall(r'(?:(?<=^)|(?<=[^.]))\s+([A-Z][a-z]+)', text1)
print(matches)
</code></pre>
<p>这张照片:</p>
<pre><code>['Takocok', 'The', 'New', 'England', 'Journal', 'Medicine']
</code></pre>
<p>下面是正则表达式模式的解释:</p>
<pre><code>(?:(?<=^)|(?<=[^.])) assert that what precedes is either the start of the string,
or a non full stop character
\s+ then match (but do not capture) one or more spaces
([A-Z][a-z]+) then match AND capture a word starting with a capital letter
</code></pre>