擅长:python、mysql、java
<p>您可以使用<a href="https://docs.python.org/3/library/collections.html#collections.Counter" rel="nofollow">^{<cd1>}</a>数据类型计算每个标签的频率,如下所示:</p>
<pre><code>from collections import Counter
freq = Counter()
with open("twitter_data.txt") as data:
for line in data:
for part in line.split():
if "#" in part:
freq[part] += 1
print(freq.most_common())
</code></pre>
<p>根据问题和现有代码的结构,<code>twitter_data.txt</code>看起来像这样(每条tweet用newline分隔):</p>
^{pr2}$
<p>在此示例文件上运行上述代码将生成以下输出:</p>
^{3}$