r - 熔化和铸造数据框添加新变量

Question

我在尝试reshape我的数据框时遇到了一些麻烦：sites_w_richness DATASET LINK HERE。

我一直在尝试将数据组织成四列

地点
经度
纬度
年
设想
丰富度值。

不幸的是，年份（2050 年和 2070 年）和情景（2.6 和 4.5）数据嵌入在数据集第 3 到第 6 列的标题中。有没有办法从标题中提取信息并将数据帧重新转换为我需要的六列？

我已经尝试过了，但我得到的只是na。

#melt and cast to format 
require (reshape)
sites_richness<-read.csv('sites_w_richness.csv', header=T)
rich.melt<-melt (sites_richness, id=1:2)
rich.cast<-cast(rich.melt, Longitude ~ Latitude ~ variable ~ value)

score 2 · Accepted Answer

reshape已被取代reshape2，后者更快且内存效率更高。它还具有colsplit按您的意愿执行的功能

library(reshape2)
# melt (using site, longitude and latitude as ids)
rich.melt <- melt(site_richness, id = 1:3)
# create a variable without `site_richness_` prefix
rich.melt$v2 <- rich.melt$variable
levels(rich.melt$v2) <- gsub('^site_richness_','',levels(rich.melt$v2))
# use colsplit to split on `_` and combine with the newest data
rich.melt2 <- cbind(rich.melt, colsplit(rich.melt$v2, pattern = '_', names = c('scenario','year')))
# drop unwanted columns and reorder
rich.melt.final <- rich.melt2[, c("Site", "Longitude", "Latitude", 
                                 "scenario", "year", "species_richness")]
head(rich.melt.final)
     Site Longitude Latitude scenario year species_richness
# 1  ABSF  -78.6492  37.4343      2.6 2050                4
# 2 ALLSP  -74.1487  40.1481      2.6 2050               31
# 3  ANSF  -71.9341  42.7743      2.6 2050               49
# 4  ARSP  -68.0148  46.6067      2.6 2050               19
# 5  BAXP  -68.8520  46.1645      2.6 2050               23
# 6  BBSP  -71.3822  43.1643      2.6 2050               35

r - 熔化和铸造数据框添加新变量

1 回答 1

Related

Reference