1

尝试使用 tm_term_score 为行中的文本将术语短语或术语对评分为 1 或 0 会得到此错误:

c1 <- Corpus(VectorSource(r1))
inspect(c1)
inspect(c1[2])
cleanR1 <- tm_map(c1, removeWords, stopwords('english'))
cleanR1 <- tm_map(cleanR1, content_transformer(tolower))
cleanR1 <- tm_map(cleanR1, removeWords, c("re", "ve", "ahha"))   
cleanR1 <- tm_map(cleanR1, removePunctuation)
cleanR1 <- tm_map(cleanR1, removeSpecialChars)
cleanR1 <- tm_map(cleanR1, removeNumbers)
cleanR1 <- tm_map(cleanR1,removeWords,stopwords('en'))
cleanR1 <- tm_map(cleanR1,stripWhitespace)
cleanR1 <- tm_map(cleanR1,stemDocument)
cleanR1 <- tm_map(cleanR1, PlainTextDocument)

tm_term_score(cleanR1, terms, FUN = slam::row_sum) 

UseMethod(“tm_term_score”,x)中的错误:没有适用于“tm_term_score”的方法应用于类“c('VCorpus','Corpus')”的对象

str(terms) chr [1:61] “帮助学习”“帮助成长”。

4

1 回答 1

1

您需要将 aPlainTextDocument返回的词频termFreq或 a传递给 of而不是语料库TermDocumentMatrix的第一个参数。tm_term_score

使用术语文档矩阵的示例:

tdm <- TermDocumentMatrix(cleanR1)
tm_term_score(tdm, terms, FUN = slam::row_sum)

请参阅文档,?tm::tm_term_score

于 2015-08-11T10:46:56.190 回答