r - R中的树状图。如何按每个集群创建新表

Question

library(mlbench)
library(stats)


College <- read.csv("colleges.XL.csv", header=T) ## this data has 23 columns
na.college<- na.omit(College)

row.names(na.college) <- NULL

na.college[, c(4:23)] <- scale(as.matrix(na.college[,c(-1,-2,-3)]))
###before making dendrogram, standardization is needed.

plot(hc<-hclust(dist(na.college[,c(-1,-2,-3)]),method="complete"),hang=-1)
##now the dendrogram is drawn.

groups <- cutree(hc, k=10) # cut tree into 5 clusters
# draw dendogram with red borders around the 5 clusters 
rect.hclust(hc, k=10, border="red")
## identifying 10 clusters by red borders

我想从这个树状图中制作一些表格。我有 10 个集群，每个集群都有以观察编号表示的元素（我可以细化树状图底部的观察编号）。因为我可以将树状图打印为 pdf 文件，所以我可以通过拖放复制所有观察编号。

现在的问题是，如何通过观察数字制作数据表？我想制作列与树状图原始数据“na.college”相同的表。

我想制作第二个集群的数据表，但我不知道该怎么做。

如果您有任何答案，请告诉我。

score 1 · Accepted Answer

尝试这个：

# your matrix dimensions have to match with the clustering results
# remove any columns from na.college, as you did for clustering
mat <- na.college[,-c(1:3)]

# select the data based on the clustering results
cluster_2 <- mat[which(groups==2),]

如果您想保存所有集群，最好将其保存为list：

# each list entry will correspond to the cluster number
clust_list <- lapply(sort(unique(groups)), function(x) mat[which(groups==x),])

# to retrieve i.e cluster 2:
cluster_2 <- clust_list[[2]]

r - R中的树状图。如何按每个集群创建新表

1 回答 1

Related

Reference