我认为原因是熊猫使用cython 优化代码,如果单独调用,concat为相同的输出添加测试:
np.random.seed(123)
N = 1000000
df = pd.DataFrame(np.random.randint(1000, size=(N, 4)), columns=list('ABCD'))
print (df)
In [176]: %%timeit
...: df.groupby('A')['B', 'C', 'D'].agg(['mean', 'std', 'count'])
...:
274 ms ± 7.7 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)
In [177]: %%timeit
...: grpd = df.groupby('A')['B', 'C', 'D']
...: a = grpd.agg('mean')
...: b = grpd.agg('std')
...: c = grpd.agg('count')
...: pd.concat([a,b,c], axis=1)
...:
...:
190 ms ± 980 µs per loop (mean ± std. dev. of 7 runs, 10 loops each)
In [178]: %%timeit
...: grpd = df.groupby('A')['B', 'C', 'D']
...: a = grpd.mean()
...: b = grpd.std()
...: c = grpd.count()
...: pd.concat([a,b,c], axis=1)
...:
...:
191 ms ± 4.33 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)