如何解释fairseq生成的P数？

Why is it rare to discover new marine mammal species? S-0 Why is it rare to discover new marine mam@@ mal species ? H-0 -0.0643349438905716 Pourquoi est-il rare de découvrir de nouvelles espèces de mammifères marins? P-0 -0.0763 -0.1849 -0.0956 -0.0946 -0.0735 -0.1150 -0.1301 -0.0042 -0.0321 -0.0171 -0.0052 -0.0062 -0.0015

1条回答

网友

1楼 · 发布于 2024-06-26 10:25:19

Q: I'm wondering if it is reasonable to say a low (absolute) number in the P row means higher confidence in that particular word?

对。正如文档所说，“P是每个标记位置的位置分数”。分数实际上是对数概率，因此越高（即绝对数越低）越“自信”。源代码可能不那么容易理解，但是分数是由^{}生成的，在那里您可以看到分数是标准化的（如果您使用single model或ensemble，它包括一个log）。此外，在打印分数时，他们convert them from base e to 2：
```
print('P-{}\t{}'.format(
    sample_id,
    ' '.join(map(
        lambda x: '{:.4f}'.format(x),
        # convert from base e to base 2
        hypo['positional_scores'].div_(math.log(2)).tolist(),
))
```

Q: What I'm trying to work out is if I can use either the H number, or somehow to use the individual P numbers, to get a confidence measure in its translation.

结果表明，H值只是p值的平均值，如您所见here：
```
score_i = avg_probs_i.sum() / tgt_len
```
还有converted to base 2。您可以在示例中检查：
```
import numpy as np
print(np.mean([-0.0763,-0.1849 ,-0.0956 ,-0.0946 ,-0.0735 ,-0.1150 ,-0.1301 ,-0.0042 ,-0.0321 ,-0.0171 ,-0.0052 ,-0.0062 ,-0.0015]))
# >>> -0.06433076923076922
```
另一个常用于评估语言模型性能的度量是Perplexity。一件好事是，可以根据P值轻松计算复杂度，如fairseq存储库的Language Model example所示：
```
# Compute perplexity for a sequence
en_lm.score('Barack Obama is coming to Sydney and New Zealand')['positional_scores'].mean().neg().exp()
# tensor(15.1474)
```
我不是NLP方面的专家，所以我真的不能告诉你在你的案例中应该使用哪一个

相关问题更多 >

编程相关推荐

热门问题

热门文章

如何解释fairseq生成的P数？

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >