擅长:python、mysql、java
<p>我认为最简单的方法是在将数组传递给函数之前组合数组。在</p>
<pre><code>allwords = [inner for outer in stem_list for inner in outer]
fdist = nltk.FreqDist(allwords)
for word, frequency in fdist.most_common(50):
print(y'{};{}'.format(word, frequency))
</code></pre>
<p>或更短:</p>
^{pr2}$
<p>我认为你的意见是:</p>
<pre><code>stem_list = [[u'anyon', u'think', u'forgotten', u'day', u'parti', u'friend', u'friend', u'paymast', u'us', u'longer', u'memori'],
[u'valu', u'friend', u'bought', u'properti', u'actual', u'relev', u'repres', u'actual', u'valu', u'properti'],
[u'monster', u'wreck', u'reef', u'cargo', u'vessel', u'week', u'passeng', u'ship', u'least', u'24', u'hour', u'upload', u'com'],
[.....], etc for the other sentences ]
</code></pre>
<p>所以你有两个数组-第一个是句子,第二个是sentenc中的单词。在allwords=[inner for outer in stem_list for inner in outer]的情况下,你可以遍历句子并将它们组合成一个单词数组。在</p>