0

我有一个由 py​​thon 语言制作的 csv 格式的列联表,如下所示:

            case  control
disease_A    20    30 
disease_B    35    45
disease_C    42    52
disease_D    52    62

现在我想从这个列联表中派生 2x2 列联表来使用 R 计算卡方值

我如何从上面的列联表中得出如下所示的 2x2 表:

            case  control
disease_A    20    30 
disease_D    52    62

这可能是一个新手问题,但我是 R 新手,我在其他任何地方都找不到解决方案

4

2 回答 2

1

这是一种方法。

数据:

txt <-  "           case  control
disease_A    20    30 
disease_B    35    45
disease_C    42    52
disease_D    52    62"

读取数据:

dat <- read.table(textConnection(txt))
#           case control
# disease_A   20      30
# disease_B   35      45
# disease_C   42      52
# disease_D   52      62

提取行的子集:

dat2 <- dat[rownames(dat) %in% c("disease_A", "disease_D"), ]
#           case control 
# disease_A   20      30
# disease_D   52      62
于 2014-08-07T06:58:13.177 回答
0

如果M是类table

M <- structure(c(20, 35, 42, 52, 30, 45, 52, 62), .Dim = c(4L, 2L), .Dimnames = list(
c("disease_A", "disease_B", "disease_C", "disease_D"), c("case", 
"control")), class = "table")



xtabs(Freq~Var1+Var2,data= subset(as.data.frame(M,stringsAsFactors=F),
                   Var1%in% c("disease_A", "disease_D")))
           Var2
 Var1        case control
  disease_A   20      30
  disease_D   52      62

如果 M是一个data.frame

 M <- structure(list(case = c(20L, 35L, 42L, 52L), control = c(30L, 
 45L, 52L, 62L)), .Names = c("case", "control"), class = "data.frame", row.names =   c("disease_A", 
 "disease_B", "disease_C", "disease_D"))

 as.table(as.matrix(M[grep("A|D", rownames(M)),]))
于 2014-08-07T07:06:29.320 回答