擅长:python、mysql、java
<pre><code>In [1]: import pandas as pd
In [2]: a = [(('@UniverCurious:', 'The'), 39),
...: (('The', 'underside'), 38),
...: (('underside', 'of'), 38),
...: (('of', 'Jupiter.'), 38),
...: (('Jupiter.', 'Credit:'), 38),
...: (('Credit:', 'NASA/JPL/JUNO'), 38),
...: (('to', 'the'), 25),
...: (('just', '100'), 15),
...: (('20', 'years'), 14)]
In [3]: ngram_list = [" ".join(p[0]) for p in a]
In [4]: cnt_list = [p[1] for p in a]
In [5]: df = pd.DataFrame(list(zip(ngram_list, cnt_list)), columns=['bigram', 'cnt'])
In [6]: df
Out[6]:
bigram cnt
0 @UniverCurious: The 39
1 The underside 38
2 underside of 38
3 of Jupiter. 38
4 Jupiter. Credit: 38
5 Credit: NASA/JPL/JUNO 38
6 to the 25
7 just 100 15
8 20 years 14
</code></pre>
<p>这个怎么样?对于wordcloud,您可能需要使用其他模块,如<code>wordcloud</code>。有关示例,请参见<a href="https://www.datacamp.com/community/tutorials/wordcloud-python" rel="nofollow noreferrer">this link</a></p>