2

我正在尝试操作数据框。举个例子:假设我有一个包含客户和他们访问的商店的数据框:

df = data.frame(customers = c("a", "b", "b", "c", "c"),
                shop_visited = c("X", "X", "Y", "X", "Z"))
customers shop_visited
        a            X
        b            X
        b            Y
        c            X
        c            Z

总结这个数据框:

  • 一位顾客 ( ) 在和b购物;XY
  • 一位顾客 ( ) 在和b购物;YX
  • 一位顾客 ( ) 在和c购物;XZ
  • 一位顾客 ( c)ZX

或者,更简洁地说:

relations = data.frame(source = c("X","Y", "X", "Z"), 
                       target = c("Y","X","Z","X"))
 source target
      X      Y
      Y      X
      X      Z
      Z      X

我正在寻找一种能够进行转换的方法df -> relationsrelations这背后的动机是我可以edgeswrite.gexf. 为任何帮助而欢呼。

4

2 回答 2

3
df <- data.frame(customers = c("a", "b", "b", "c", "c"),
                 shop_visited = c("X", "X", "Y", "X", "Z"))

#create an identifier df
dfnames <- data.frame(i = as.numeric(df$shop_visited), 
                      shop_visited = df$shop_visited)

library(tnet)
tdf       <- as.tnet( cbind(df[,2],df[,1]),type =  "binary two-mode tnet" )
relations <- projecting_tm(tdf, method = "sum")

# match original names
relations[["i"]] <- dfnames[ match(relations[['i']], dfnames[['i']] ) , 'shop_visited']
relations[["j"]] <- dfnames[ match(relations[['j']], dfnames[['i']] ) , 'shop_visited']

# clean up names
names(relations) <- c("source" , "target", "weight")


#> relations
#  source target weight
#1      X      Y      1
#2      X      Z      1
#3      Y      X      1
#4      Z      X      1
于 2013-04-23T19:28:04.200 回答
3

请看一下(http://www.inside-r.org/packages/cran/rgexf/docs/edge.list)的功能edge.list。使用您的示例将是这样的rgexf

库(rgexf)

# Your data
df = data.frame(customers = c("a", "b", "b", "c", "c"),
                shop_visited = c("X", "X", "Y", "X", "Z"))

# Getting nodes and edges
df2 <- edge.list(df)

看起来像这样

> df2
$nodes
  id label
1  1     1
2  2     2
3  3     3

$edges
     [,1] [,2]
[1,]    1    1
[2,]    2    1
[3,]    2    2
[4,]    3    1
[5,]    3    3

最后,您可以使用它来编写 GEXF 图

# Building the graph
write.gexf(nodes=df2$nodes, edges=df2$edges)

<?xml version="1.0" encoding="UTF-8"?>
<gexf xmlns="http://www.gexf.net/1.2draft" xmlns:viz="http://www.gexf.net/1.1draft/viz" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.gexf.net/1.2draft http://www.gexf.net/1.2draft/gexf.xsd" version="1.2">
  <meta lastmodifieddate="2013-08-06">
    <creator>NodosChile</creator>
    <description>A graph file writing in R using "rgexf"</description>
    <keywords>gexf graph, NodosChile, R, rgexf</keywords>
  </meta>
  <graph mode="static">
    <nodes>
      <node id="1" label="1"/>
      <node id="2" label="2"/>
      <node id="3" label="3"/>
    </nodes>
    <edges>
      <edge id="0" source="1" target="1" weight="1.0"/>
      <edge id="1" source="2" target="1" weight="1.0"/>
      <edge id="2" source="2" target="2" weight="1.0"/>
      <edge id="3" source="3" target="1" weight="1.0"/>
      <edge id="4" source="3" target="3" weight="1.0"/>
    </edges>
  </graph>
</gexf>

如果您对 nodoschile.org 上的 George dot vega 有任何疑问,请告诉我

最好的!

乔治(创造者rgexf

于 2013-08-06T18:16:27.830 回答