擅长:python、mysql、java
<p>这是一个真正的文本分类器,
与sklearn和NLTK一起工作</p>
<pre><code>from collections import defaultdict
refsets = defaultdict(set)
testsets = defaultdict(set)
labels = []
tests = []
for i, (feats, label) in enumerate(testset):
refsets[label].add(i)
observed = classifier.classify(feats)
testsets[observed].add(i)
labels.append(label)
tests.append(observed)
print(metrics.confusion_matrix(labels, tests))
print(nltk.ConfusionMatrix(labels, tests))
</code></pre>