无法使Counter（）在python中工作

from nltk import trigrams from nltk.tokenize import wordpunct_tokenize from nltk import bigrams from collections import Counter import nltk text= ["This is an example sentence."] trigram_top= ['PRP', 'MD', 'VB'] for words in text: tokens = wordpunct_tokenize (words) tags = nltk.pos_tag (tokens) trigram_list=trigrams(tags) list_tri=Counter (t for t in trigram_list if t in trigram_top) print list_tri

1条回答

网友

1楼 · 发布于 2024-10-02 12:37:40

让我们放一些print来调试：

from nltk import trigrams
from nltk.tokenize import wordpunct_tokenize
from nltk import bigrams
from collections import Counter
import nltk
text= ["This is an example sentence."]
trigram_top= ['PRP', 'MD', 'VB']

for words in text:
    tokens = wordpunct_tokenize (words)
    print tokens
    tags = nltk.pos_tag (tokens)
    print tags
    list_tri=Counter (t[0] for t in tags if t[1] in trigram_top)
    print list_tri

#['This', 'is', 'an', 'example', 'sentence', '.']
#[('This', 'DT'), ('is', 'VBZ'), ('an', 'DT'), ('example', 'NN'), ('sentence', 'NN'), ('.', '.')]
#Counter()

注意，list=部分是多余的，我已经将生成器更改为只接受单词而不是pos标记

我们可以看到，没有一个pos标记直接匹配您的trigram_top-您可能需要修改您的比较检查来迎合VB/VBZ。。。在

一种可能是改变路线：

^{pr2}$

相关问题更多 >

编程相关推荐

热门问题

热门文章