9

假设我有两个数组,a并且b

  a.shape = (5,2,3)
  b.shape = (2,3)

然后c = a * b会给我一个c形状数组(5,2,3)c[i,j,k] = a[i,j,k]*b[j,k]

现在的情况是,

  a.shape = (5,2,3)
  b.shape = (2,3,8)

我想有c一个形状。(5,2,3,8)c[i,j,k,l] = a[i,j,k]*b[j,k,l]

如何有效地做到这一点?我的ab实际上相当大。

4

2 回答 2

13

这应该有效:

a[..., numpy.newaxis] * b[numpy.newaxis, ...]

用法:

In : a = numpy.random.randn(5,2,3)

In : b = numpy.random.randn(2,3,8)

In : c = a[..., numpy.newaxis]*b[numpy.newaxis, ...]

In : c.shape
Out: (5, 2, 3, 8)

参考:numpy 中的数组广播

编辑:更新的参考 URL

于 2012-05-20T22:37:00.830 回答
7

我认为以下应该有效:

import numpy as np
a = np.random.normal(size=(5,2,3))
b = np.random.normal(size=(2,3,8))
c = np.einsum('ijk,jkl->ijkl',a,b)

和:

In [5]: c.shape
Out[5]: (5, 2, 3, 8)

In [6]: a[0,0,1]*b[0,1,2]
Out[6]: -0.041308376453821738

In [7]: c[0,0,1,2]
Out[7]: -0.041308376453821738

np.einsum使用起来可能有点棘手,但对于这些类型的索引问题非常强大:

http://docs.scipy.org/doc/numpy/reference/generated/numpy.einsum.html

另请注意,这需要 numpy >= v1.6.0

我不确定您的特定问题的效率,但如果它的性能不如需要,一定要考虑使用带有显式 for 循环的 Cython,并可能使用并行化它prange

更新

In [18]: %timeit np.einsum('ijk,jkl->ijkl',a,b)
100000 loops, best of 3: 4.78 us per loop

In [19]: %timeit a[..., np.newaxis]*b[np.newaxis, ...]
100000 loops, best of 3: 12.2 us per loop


In [20]: a = np.random.normal(size=(50,20,30))
In [21]: b = np.random.normal(size=(20,30,80))

In [22]: %timeit np.einsum('ijk,jkl->ijkl',a,b)
100 loops, best of 3: 16.6 ms per loop

In [23]: %timeit a[..., np.newaxis]*b[np.newaxis, ...]
100 loops, best of 3: 16.6 ms per loop

In [2]: a = np.random.normal(size=(500,20,30))
In [3]: b = np.random.normal(size=(20,30,800))

In [4]: %timeit np.einsum('ijk,jkl->ijkl',a,b)
1 loops, best of 3: 3.31 s per loop

In [5]: %timeit a[..., np.newaxis]*b[np.newaxis, ...]
1 loops, best of 3: 2.6 s per loop
于 2012-05-20T22:32:40.707 回答