2

我有以下数据框;

Group     Deadline Time    Deadline Date    Task Completed Date   Task Completed Time
Group 1   20:00:00         17-07-2012       17-07-2012              20:34:00
Group 2   20:15:00         17-07-2012       17-07-2012              20:39:00
Group 3   22:00:00         17-07-2012       17-07-2012              22:21:00
Group 4   23:50:00         17-07-2012       18-07-2012              00:09:00
Group 5   20:00:00         18-07-2012       18-07-2012              20:37:00
Group 6   20:15:00         18-07-2012       18-07-2012              21:13:00
Group 7   22:00:00         18-07-2012       18-07-2012              22:56:00
Group 8   23:50:00         18-07-2012       19-07-2012              00:01:00
Group 9   20:15:00         19-07-2012       19-07-2012              20:34:00
Group 10  20:00:00         19-07-2012       19-07-2012              20:24:00

我如何计算时间延迟;

Time Delay (mins)
00:34:00
00:24:00
00:21:00
00:19:00
00:37:00
00:58:00
00:56:00
00:11:00
00:19:00
00:24:00

我试过没有成功;

  1. 结合“截止日期”“日期”和“时间”列和“任务完成”“日期”和“时间”列和

  2. 发现“任务完成”-“截止日期”时间的差异。

4

1 回答 1

3

将它们组合为字符串(“加法”有效),将它们转换为datetime类型,然后减去,得到一个 Seriestimedelta类型。

In [14]: deadline = pd.to_datetime(df['Deadline Date'] + ' ' + df['Deadline Time'])

In [15]: completed = pd.to_datetime(df['Task Completed Date'] + ' ' + df['Task Completed Time'])

In [16]: completed - deadline
Out[16]: 
0   00:34:00
1   00:24:00
2   00:21:00
3   00:19:00
4   00:37:00
5   00:58:00
6   00:56:00
7   00:11:00
8   00:19:00
9   00:24:00
dtype: timedelta64[ns]
于 2013-07-26T19:43:12.900 回答