r - 如何将两列与偏移数据结合起来？

Question

我的数据集包含两列，其中的数据是偏移的 - 例如：

col1<-c("a", "b", "c", "d", "ND", "ND", "ND", "ND")
col2<-c("ND", "ND", "ND", "ND", "e", "f", "g", "h")
dataset<-data.frame(cbind(col1, col2))

我想将这两个偏移列组合成一个包含字母 a 到 h 的列，仅此而已。

我在想类似下面的内容，但 rbind 不是正确的命令：

dataset$combine<-rbind(dataset$col1[1:4], dataset$col2[5:8])

score 2 · Accepted Answer

关于什么：

sel2 <- col2!="ND"
col1[sel2] <- col2[sel2]
> col1
[1] "a" "b" "c" "d" "e" "f" "g" "h"

score 2 · Accepted Answer

使用sapply一个匿名函数：

dataset[sapply(dataset, function(x) x != "ND")]
# [1] "a" "b" "c" "d" "e" "f" "g" "h"
dataset$combine <- dataset[sapply(dataset, function(x) x != "ND")]
dataset
#   col1 col2 combine
# 1    a   ND       a
# 2    b   ND       b
# 3    c   ND       c
# 4    d   ND       d
# 5   ND    e       e
# 6   ND    f       f
# 7   ND    g       g
# 8   ND    h       h

score 2 · Accepted Answer

用于grep查找匹配的元素并选择它们：

c(col1[grep("^[a-h]$",col1)],col2[grep("^[a-h]$",col2)])

score 2 · Accepted Answer

还有另一种方式，使用mapplyand gsub：

 within(dataset, combine <- mapply(gsub, pattern='ND', replacement=col2, x=col1))
#   col1 col2 combine
# 1    a   ND       a
# 2    b   ND       b
# 3    c   ND       c
# 4    d   ND       d
# 5   ND    e       e
# 6   ND    f       f
# 7   ND    g       g
# 8   ND    h       h

根据您对@Andrie 的回答的评论，这也将保留NA行。

score 1 · Accepted Answer

另一个角度来看：

transform(dataset, 
          combine=dataset[apply(dataset, 2, function(x) x %in% letters[1:8])])
  col1 col2 combine
1    a   ND       a
2    b   ND       b
3    c   ND       c
4    d   ND       d
5   ND    e       e
6   ND    f       f
7   ND    g       g
8   ND    h       h

dataset$combine <- dataset[apply(dataset,2, function(x) nchar(x)==1)] #Also works

score 0 · Accepted Answer

有时问题是想得足够简单...... ;-)

dataset$combine<-c(dataset$col1[1:4], dataset$col2[5:8])

r - 如何将两列与偏移数据结合起来？

6 回答 6

Related

Reference