我一直在用sklearn做情绪分析。我有一个3000多个评论的csv文件,我正在训练我的分类器60%的评论。 当我尝试为分类器提供自定义检查以预测标签时,使用CountVectorizer.transform()它引发以下错误:
Anaconda\lib\site-packages\sklearn\feature_extraction\text.py", line 864, in transform
raise ValueError("Vocabulary wasn't fitted or is empty!")
ValueError: Vocabulary wasn't fitted or is empty!
请帮帮我,这是安装训练套件的代码:
^{pr2}$这是预测定制评论情绪的代码:
def customQuestionScorer(question, clf):
X_new_tfidf = vectorizer.transform([question]).toarray()
print (clf.predict(X_new_tfidf))
q = "I really like this movie"
customQuestionScorer(q,classifier)
这里是文本处理的好例子http://scikit-learn.org/stable/auto_examples/model_selection/grid_search_text_feature_extraction.html#example-model-selection-grid-search-text-feature-extraction-py
相关问题 更多 >
编程相关推荐