我有两个数据集。第一个包含城市列表及其与目的地的距离(以英里为单位)。第二个列表包含目的地。我想使用 purrr 将最近目的地的名称放入第一个数据集中的新列中。
这是第一个数据集(包含数据/距离):
library(tidyverse)
data1 <- tibble(city = c("Atlanta", "Tokyo", "Paris"),
dist_Rome = c(1000, 2000, 300),
dist_Miami = c(400, 3000, 1500),
dist_Singapore = c(3000, 600, 2000),
dist_Toronto = c(900, 3200, 1900))
这是包含目的地的第二个数据集:
library(tidyverse)
data2 <- tibble(destination = c("Rome Italy", "Miami United States", "Singapore Singapore", "Toronto Canada"))
这就是我想要的样子:
library(tidyverse)
solution <- tibble(city = c("Atlanta", "Tokyo", "Paris"),
dist_Rome = c(1000, 2000, 300),
dist_Miami = c(400, 3000, 1500),
dist_Singapore = c(3000, 600, 2000),
dist_Toronto = c(900, 3200, 1900),
nearest = c("Miami United States", "Singapore Singapore", "Rome Italy"))
理想情况下,我正在寻找一个整洁的解决方案,并且我试图用 purrr 来做到这一点,但无济于事。这是我失败的尝试:
library(tidyverse)
solution <- data1 %>%
mutate(nearest_hub = map(select(., contains("dist")), ~
case_when(which.min(c(...)) ~ data2$destination),
TRUE ~ "NA"))
Error in which.min(c(...)) :
(list) object cannot be coerced to type 'double'
谢谢!