我正在尝试获取以下数据集并将其转换为第二个。基本上,我正在尝试用该 ID 填写每个 ID 之间的 NA。
每个 ID 对应于两个时间戳,我已将其加入到较大的 date_time 列中。出于重现能力的目的,在连接之间执行 sql(date_time 列非常大)甚至获取原始数据集并在每个 id 之间创建时间戳然后将其加入(我有太多ID 来执行此操作)。我已经成功完成了这两种方法,但我拥有的数据量需要太多时间。我希望用这个数据集来操作数据。看似很简单的事情,却真的让我很为难。任何帮助,将不胜感激。
当前数据集:
date_time id
<dttm> <dbl>
1 2017-01-30 08:00:00 NA
2 2017-01-30 08:00:01 NA
3 2017-01-30 08:00:02 1
4 2017-01-30 08:00:03 NA
5 2017-01-30 08:00:04 NA
6 2017-01-30 08:00:05 NA
7 2017-01-30 08:00:06 NA
8 2017-01-30 08:00:07 1
9 2017-01-30 08:00:08 NA
10 2017-01-30 08:00:09 NA
11 2017-01-30 08:00:10 2
12 2017-01-30 08:00:11 NA
13 2017-01-30 08:00:12 NA
14 2017-01-30 08:00:13 NA
15 2017-01-30 08:00:14 2
16 2017-01-30 08:00:15 NA
17 2017-01-30 08:00:16 3
18 2017-01-30 08:00:17 NA
19 2017-01-30 08:00:18 3
20 2017-01-30 08:00:19 NA
所需数据集:
date_time id
<dttm> <dbl>
1 2017-01-30 08:00:00 NA
2 2017-01-30 08:00:01 NA
3 2017-01-30 08:00:02 1
4 2017-01-30 08:00:03 1
5 2017-01-30 08:00:04 1
6 2017-01-30 08:00:05 1
7 2017-01-30 08:00:06 1
8 2017-01-30 08:00:07 1
9 2017-01-30 08:00:08 NA
10 2017-01-30 08:00:09 NA
11 2017-01-30 08:00:10 2
12 2017-01-30 08:00:11 2
13 2017-01-30 08:00:12 2
14 2017-01-30 08:00:13 2
15 2017-01-30 08:00:14 2
16 2017-01-30 08:00:15 NA
17 2017-01-30 08:00:16 3
18 2017-01-30 08:00:17 3
19 2017-01-30 08:00:18 3
20 2017-01-30 08:00:19 NA
输入()日期:
structure(list(date_time = structure(c(1485781200, 1485781201,
1485781202, 1485781203, 1485781204, 1485781205, 1485781206, 1485781207,
1485781208, 1485781209, 1485781210, 1485781211, 1485781212, 1485781213,
1485781214, 1485781215, 1485781216, 1485781217, 1485781218, 1485781219
), class = c("POSIXct", "POSIXt"), tzone = ""), trx_id = c(NA_real_,
NA_real_, 1, NA_real_, NA_real_, NA_real_, NA_real_, 1,
NA_real_, NA_real_, 2, NA_real_, NA_real_, NA_real_, 2,
NA_real_, 3, NA_real_, 3, NA_real_)), .Names = c("date_time",
"trx_id"), row.names = c(NA, -20L), class = c("tbl_df", "tbl",
"data.frame"))