擅长:python、mysql、java
<p>下面的代码为给定的句子生成一个<code>bigram</code>列表</p>
<pre><code>>>> import nltk
>>> from nltk.tokenize import word_tokenize
>>> text = "to be or not to be"
>>> tokens = nltk.word_tokenize(text)
>>> bigrm = nltk.bigrams(tokens)
>>> print(*map(' '.join, bigrm), sep=', ')
to be, be or, or not, not to, to be
</code></pre>