擅长:python、mysql、java
<p>R解决方案可能如下所示:</p>
<pre><code>library(tm)
docs <- c(D1 = "The sky is blue.",
D2 = "The sun is bright.",
D3 = "The sun in the sky is bright.")
dict <- c("sky","land","sea","water","sun","moon")
mat <- TermDocumentMatrix(Corpus(VectorSource(docs)),
control=list(weighting = weightTfIdf,
dictionary = dict))
as.matrix(mat)[dict, ]
# Docs
# Terms D1 D2 D3
# sky 0.5849625 0.0000000 0.2924813
# land 0.0000000 0.0000000 0.0000000
# sea 0.0000000 0.0000000 0.0000000
# water 0.0000000 0.0000000 0.0000000
# sun 0.0000000 0.5849625 0.2924813
# moon 0.0000000 0.0000000 0.0000000
</code></pre>