python - Pandas for 循环索引

翻译自：https://stackoverflow.com/questions/19776546 2013-11-04T20:29:40.303

346 次

我有一个由一些非唯一非连续 id 编号索引的熊猫数据框。

x
class 'pandas.core.frame.DataFrame'
Int64Index：814061 个条目，8536896 到 8498857
数据列（共 1 列）：
收到 814061 个非空值
dtypes：datetime64ns

x['received'] 是长度不同的时间戳。

x.ix[i] might have len() == 20  
x.ix[j] might have len() == 32.

对于任何 x.ix[i] 我可以将时间戳放入 [0,1] 范围内

df['totalseconds'] = x.ix[i]['received']-x.ix[i]['received'].min()
y = x.ix[i]['received'].max()-x.ix[i]['received'].min()
z = timedelta.total_seconds(y)
df['seconds'] = df['totalseconds'].apply(lambda x: x / timedelta64(1, 's'))
df['norm'] = df['seconds']/z

我正在尝试为 x 中的每个索引标准化我的时间段。但是由于某种原因我遇到了麻烦。

tldr; 如何通过索引 id 将所有时间戳放入 [0, 1] 范围内？

python - Pandas for 循环索引

0 回答 0

Related

Reference