原始df(临床化学)
Subject Code Test Value Units Flag
1 NA NA 147 mmol/L
2 NA/K NA/K 10.5 RATIO
3 K K 4.7 mmol/L
4 CK CK 235 UL
...
清洁后理想的df
Subject Code Test Value Units Flag
1 NA Sodium 147 mmol/L NA
2 NA/K Sodium Potassium 10.5 RATIO NA
3 K Potassium 4.7 mmol/L NA
4 CK Creatine Kinase 235 UL NA
...
我试过的
df <- read.csv(file="clinchemistry.csv", header = TRUE, sep=",", stringsAsFactors = FALSE)
df$df[df8$Test == "NA"] <- "Sodium"
df$df[df8$Code == "NA"] <- "Sodium"
和
df[is.na(lb$Code)]<-"Sodium"
lb[is.na(lb$Code)]<-"Sodium"
结果:
所有钠值都消失或出现错误:
[<-.data.frame
( , is.na*tmp*
(lb$Test), value = "Sodium") 中的错误:列的下标重复
有人会指导我的想法吗?