擅长:python、mysql、java
<p>是的。请参见已安装/已转换的TF-IDF矢量器上的<code>.vocabulary_</code>。</p>
<pre><code>In [1]: from sklearn.datasets import fetch_20newsgroups
In [2]: data = fetch_20newsgroups(categories=['rec.autos'])
In [3]: from sklearn.feature_extraction.text import TfidfVectorizer
In [4]: cv = TfidfVectorizer()
In [5]: X = cv.fit_transform(data.data)
In [6]: cv.vocabulary_
</code></pre>
<p>这是一本字典的形式:</p>
<p><code>{word : column index in array}</code></p>