擅长:python、mysql、java
<p>您可以使用<a href="https://pandas.pydata.org/pandas-docs/stable/generated/pandas.Series.isin.html#pandas-series-isin" rel="nofollow noreferrer">^{<cd1>}</a></p>
<pre><code>providers = {'gmail', 'aol', 'yahoo','hotmail', 'outlook'}
data['emailDomain'].isin(providers)
</code></pre>
<h2>正在搜索提供程序</h2>
<p>与对每行中的每个电子邮件应用<code>re</code>不同,您可以使用<a href="http://pandas.pydata.org/pandas-docs/stable/api.html#string-handling" rel="nofollow noreferrer">^{<cd3>}</a>方法一次对一个列执行该操作</p>
^{pr2}$
<p>所以就变成这样:</p>
<pre><code>pattern2 = '(?<=@)([^.]+)(?=\.)'
providers = {'gmail', 'aol', 'yahoo','hotmail', 'outlook'}
df = pd.DataFrame(data={'email': ['test.1@gmail.com', 'test.2@aol.com', 'test3@something.eu']})
provider_serie = df['email'].str.extract(pattern2, expand=False)
</code></pre>
<blockquote>
<pre><code>0 gmail
1 aol
2 something
Name: email, dtype: object
</code></pre>
</blockquote>
^{4}$
<blockquote>
<pre><code>0 True
1 True
2 False
Name: email, dtype: bool
</code></pre>
</blockquote>
<p>如果您真的想要<code>0</code>s和<code>1</code>s,可以添加<code>.astype(int)</code></p>