8

I have a data frame which contains a customerid, and a list. I would like to merge those list pertaining to the same customer.

library(plyr)
subsets <- list(c("a", "d", "e"), c("a", "b", "c", "e"))
customerids <- c(1,1)
transactions <- data.frame(customerid = customerids,subset =I(subsets))
> transactions
  customerid     subset
1          1    a, d, e
2          1 a, b, c, e

If I want to merge the subsets with ddply, I get an expanded result

> ddply(transactions, .(customerid), summarise, subset=Reduce(union,subset))
  customerid subset
1          1   a
2          1   d
3          1   e
4          1   b
5          1   c

while I would have expected all the results in 1 row.

4

1 回答 1

4

你可以这样做:

ddply(transactions, .(customerid), function(x) 
            data.frame(subset=I(list(unlist(x$subset)))))

编辑:我不确定我是否遵循您的评论。但是,如果您只想要每个customeridfor 中的唯一值,subset那么:

ddply(transactions, .(customerid), function(x) 
            data.frame(subset=I(list(unique(unlist(x$subset))))))
于 2013-07-15T21:30:04.213 回答