r - R中的hclust，dist解释了聚类单词的方法

问问题 2017-01-18T19:49:24.760

648 次

我正在使用 twitter API 和 OAUTH 处理一些推文数据。我正在制作一个聚类树状图来聚类 500 条推文中使用的单词。我不确定 hclust 函数是如何对单词进行聚类的。即根据什么规则我需要一些逻辑解释

这是代码

 tweetTDM.reduced<-removeSparseTerms(tweetTDM, 0.992)

 d <- dist(tweetTDM.reduced, method="euclidian")
 fit <- hclust(d, method="ward.D")
 plot(fit)

我在 R 中包含了我正在查看的内容的图像。

0 回答 0