擅长:python、mysql、java
<p>除了@blhsing的回答,我建议使用一个冰冻套;同时进行替换(当然,除非出于其他目的需要保留原始列表):</p>
<pre><code>def UNKWords(words):
return frozenset(word for word, count in Counter(words).items() if count == 1)
wordsToBeReplaced = UNKWords(trainingWords)
for i, word in enumerate(trainingWords):
if word in wordsToBeReplaced:
trainingWords[i] = '<UNK>'
</code></pre>