NLTK:TypeError:不可损坏类型：“list”

from nltk.translate.bleu_score import sentence_bleu reference = [['this', 'is', 'a', 'test'], ['this', 'is' 'test']] candidate = ['this', 'is', 'a', 'test'] score = sentence_bleu(reference, candidate) print(score)

Traceback (most recent call last): File "nltk-bleu-score.py", line 56, in <module> score = sentence_bleu(reference, candidate) File "C:\Users\Fachri\Anaconda3\lib\site-packages\nltk\translate\bleu_score.py", line 89, in sentence_bleu emulate_multibleu) File "C:\Users\Fachri\Anaconda3\lib\site-packages\nltk\translate\bleu_score.py", line 162, in corpus_bleu p_i = modified_precision(references, hypothesis, i) File "C:\Users\Fachri\Anaconda3\lib\site-packages\nltk\translate\bleu_score.py", line 292, in modified_precision counts = Counter(ngrams(hypothesis, n)) if len(hypothesis) >= n else Counter() File "C:\Users\Fachri\Anaconda3\lib\collections\__init__.py", line 535, in __init__ self.update(*args, **kwds) File "C:\Users\Fachri\Anaconda3\lib\collections\__init__.py", line 622, in update _count_elements(self, iterable) TypeError: unhashable type: 'list'

Opening references file... [['two', 'airplanes', 'are', 'waiting', 'on', 'the', 'tarmac'], ['Two', 'airplanes', 'parked', 'at', 'the', 'airport', '.']] Opening candidates file... [['An', 'airplane', 'sitting', 'on', 'the', 'tarmac', 'at', 'an', 'airport', 'with', 'another', 'plane', 'in', 'the', 'background', '.']]

1条回答

网友

1楼 · 发布于 2024-10-01 00:31:50

hypothesis的预期类型是list(str)，来自{a1}：

:type hypothesis: list(str)

candidate是一个list(list(str))，可以这样计算bleu分：

from nltk.translate.bleu_score import sentence_bleu

references = [['two', 'passenger', 'planes', 'on', 'a', 'grassy', 'plain'],
              ['An', 'airplane', 'sitting', 'on', 'the', 'tarmac', 'at', 'an', 'airport', 'with', 'another', 'plane',
               'in', 'the', 'background', '.'],
              ['A', 'white', 'an', 'blue', 'airplane', 'parked', 'at', 'the', 'airport', 'near', 'another', 'small',
               'plane', '.'], ['Blue', 'and', 'white', 'airplane', 'parked', '.'],
              ['two', 'airplanes', 'are', 'waiting', 'on', 'the', 'tarmac'],
              ['Two', 'airplanes', 'parked', 'at', 'the', 'airport', '.'],
              ['A', 'passenger', 'aircraft', 'with', 'landing', 'gear', 'down', '.'],
              ['A', 'passenger', 'jet', 'flies', 'through', 'the', 'air', '.'],
              ['A', 'passenger', 'plane', 'fly', 'through', 'the', 'sky', '.'],
              ['The', 'Austrian', 'plane', 'soars', 'in', 'the', 'sky', '.']]

candidates = [
    ['An', 'airplane', 'sitting', 'on', 'the', 'tarmac', 'at', 'an', 'airport', 'with', 'another', 'plane', 'in', 'the',
     'background', '.'], ['A', 'passenger', 'jet', 'flies', 'through', 'the', 'air', '.']]

for candidate in candidates:
    print(sentence_bleu(references, candidate))

输出

^{pr2}$

相关问题更多 >

编程相关推荐

热门问题

热门文章