r - 为重复测量方差分析准备多个数据文件的修剪平均值

Question

我有多个数据文件（在制表符分隔的 txt 文件中），格式如下：

制作了一些示例文件

https://docs.google.com/file/d/0B20HmmYd0lsFVGhTQ0EzRFFmYXc/edit?usp=sharing

https://docs.google.com/file/d/0B20HmmYd0lsFbWxmQzV6X0o2Y1E/edit?usp=sharing

Condition  Block Session Stimuli    Score   Reqrespons Act RT extra
 X          3      3    asdfa        1           a      a  500  0
 Y          1      2    qewrq        0           b      a  1100 0

我想排除异常的 RT 并对 RT 的平均值和文件的分数（具有因子条件）执行 ANOVA。到目前为止，我已经以一种极其丑陋的方式完成了这项工作，并且按主题设置了行（我更喜欢将行格式化为 subjectxcondition）。

我当前的尝试使用 for 循环：

all_data<-data.frame(rbind(1:27)) #make empty data.frame 
all_data
for(i in 1:2)
{
n= paste(i,".txt", sep="")
a<- sprintf("table%d", i, i)
data <- read.table(toString(n), header = TRUE, sep = "\t")

我用分数 1-9 填写 cols 1:9

Score<-as.vector(tapply(data$Score,list(data$Condition,data$Reqresponse),mean))

for(o in 1:9)
{
all_data [i, o] <- Score[o]
}

然后以我想要的方式修剪我的 RT 值并放入 all_data 的第 10 列

data <- data[which(data$RT>200),]
data <- do.call(rbind,by(data,data$Condition,function(x) x[!abs(scale(x$RT)) > 3,] ))
RT<-as.vector(tapply(data$RT,list(data$Condition,data$Reqresponse, data$Score),mean))
for(j in 1:18)
{
all_data [i, j+9] <- RT[j]
}
}

此外，此代码必须在美学上冒犯 R 中任何体面的人，如果您愿意，请告诉我如何解决它

score 1 · Accepted Answer

我会使用ddplyfrom plyrpackage 来做到这一点。例如：

require(plyr)
res <- lapply(list.files(pattern='^[1-2].txt'),function(ff){
  ## you read the file 
  data <-  read.table(ff, header=T, quote="\"")
  ## remove the outlier
  data <- data[data$RT>200,]
  data <-  ddply(data,.(Condition),function(x) x[!abs(scale(x$RT)) > 3,])
  ## compute the mean
  ddply(data,.(Condition,Reqresponse,Score),summarise,RT=mean(RT))
})

[[1]]
   Condition Reqresponse Score   RT
1          X           a     0  500
2          X           a     1  750
3          X           b     0  500
4          X           b     1  500
5          Y           a     0  400
6          Y           a     1  640
7          Y           b     1 1000
8          Z           a     0 1000
9          Z           a     1 1675
10         Z           b     0  400

[[2]]
   Condition Reqresponse Score   RT
1          X           a     0  500
2          X           a     1  750
3          X           b     0  500
4          X           b     1  500
5          Y           a     0  400
6          Y           a     1  640
7          Y           b     1 1000
8          Z           a     0 1000
9          Z           a     1 1675
10         Z           b     0  400

r - 为重复测量方差分析准备多个数据文件的修剪平均值

1 回答 1

Related

Reference