我有一个关键字之间相似性的大型数据框,我想将其更改为包含所有成对相似性的格式。我当前的数据框如下所示:
> df
kwd1 kwd2 sim
1 a b 1
2 b a 1
3 c a 2
4 a c 2
我想将其转换为以下形式的 data.frame:
> df
kwd1 kwd2 sim
a b 1
a c 2
b c 0
谢谢你的帮助!
到目前为止,我的代码是:
df <- data.frame(c('a', 'b', 'c', 'a'), c('b', 'a', 'a', 'c'), c(.1,.1,.2,.2))
colnames(df) = c('kwd1', 'kwd2', 'sim')
> dput(df)
structure(list(kwd1 = structure(c(1L, 2L, 3L, 1L), .Label = c("a",
"b", "c"), class = "factor"), kwd2 = structure(c(2L, 1L, 1L,
3L), .Label = c("a", "b", "c"), class = "factor"), sim = c(1,
1, 2, 2)), .Names = c("kwd1", "kwd2", "sim"), row.names = c(NA,
-4L), class = "data.frame")