擅长:python、mysql、java
<p>如果您将您的tweets加载到pandas数据框中,则可以非常轻松快速地对其进行过滤:</p>
<pre><code>In [11]:
df = pd.DataFrame({'tweet':['@Koningsbruggen tweeted: @CGCommunicatie are you guys in "KEYWORD"?', '@"KEYWORD"_lady tweeted: @rvanbommel yes thats okay']})
df
Out[11]:
tweet
0 @Koningsbruggen tweeted: @CGCommunicatie are y...
1 @"KEYWORD"_lady tweeted: @rvanbommel yes thats...
</code></pre>
<p>我们可以调用向量化的<a href="http://pandas.pydata.org/pandas-docs/stable/api.html#string-handling" rel="nofollow">^{<cd1>}</a>方法来<code>split</code>该tweet,并使用<code>contains</code>过滤它们:</p>
^{pr2}$
<p>有很多方法可以将数据加载到panda中:<a href="http://pandas.pydata.org/pandas-docs/stable/io.html" rel="nofollow">http://pandas.pydata.org/pandas-docs/stable/io.html</a></p>