1

我想根据时间间隔汇总我的值,但我有分类值,并且有许多数值函数。这里的代码:

data.drop_duplicates(subset=['timestamp'], inplace=True)
data.timestamp = pd.to_datetime(data.timestamp)
data2 = data.set_index('timestamp').groupby('host').agg("event").resample('10s').ffill()
data2

输出是:

host        timestamp          
abc         2021-03-15 15:15:00         NaN
            2021-03-15 15:15:10    Accepted
            2021-03-15 15:15:20    Accepted
            2021-03-15 15:15:30         NaN
            2021-03-15 15:15:40         NaN
Name: event, dtype: object

但我希望它是:

host        timestamp          
abc         2021-03-15 15:15:00    Accepted, Refused, Accepted
            2021-03-15 15:15:10    Accepted, Accepted
            2021-03-15 15:15:20    Accepted
            2021-03-15 15:15:30         NaN
            2021-03-15 15:15:40    Refused, Refused, Accepted
Name: event, dtype: object

谁能帮我?

groupin 之前的示例数据是这样的数据框:

    timestamp   host    event
0   2021-03-15 15:15:01 x   Accepted
3   2021-03-15 15:15:01 x   Accepted
6   2021-03-15 15:15:01 x   Accepted
9   2021-03-15 15:15:01 x   Accepted
12  2021-03-15 15:15:01 x   Accepted
15  2021-03-15 15:15:01 f   Accepted
20  2021-03-15 15:15:31 x   Failed
23  2021-03-15 15:15:33 x   Failed
26  2021-03-15 15:15:35 x   Failed
29  2021-03-15 15:15:37 x   Failed
33  2021-03-15 15:15:39 x   Failed

提前谢谢了!!!

4

0 回答 0