<p>我想确定牢房里是否有“麦当劳”这个词。然而,我希望忽略“McDonald”之前的单词有首个大写字母的情况,如“Kevin McDonald”。有没有建议如何在一个数据帧中通过正则表达式来实现它</p>
<pre><code>data = {'text':["Kevin McDonald has bought a burger.",
"The best burger in McDonald is cheeze buger."]}
df = pd.DataFrame(data)
long_list = ['McDonald', 'Five Guys']
# matching any of the words
pattern = r'\b{}\b'.format('|'.join(long_list))
df['count'] = df.text.str.count(pattern)
</code></pre>
<pre><code> text
0 Kevin McDonald has bought a burger.
1 The best burger in McDonald is cheeze buger.
</code></pre>
<p>预期产出:</p>
<pre><code> text count
0 Kevin McDonald has bought a burger. 0
1 The best burger in McDonald is cheeze buger. 1
</code></pre>