python - 添加到常规 numpy ndarray 时调用 scipy.sparse add 方法？

Question

我正在计算 scipy.sparse 矩阵（CSC）和 numpy ndarray 向量之间的点积：

>>> print type(np_vector), np_vector.shape
<type 'numpy.ndarray'> (200,)
>>> print type(sp_matrix), sparse.isspmatrix(sp_matrix), sp_matrix.shape
<class 'scipy.sparse.csc.csc_matrix'> True (200, 200)
>>> dot_vector = dot(np_vector, sp_matrix)

结果似乎是一个新的 ndarray 向量，正如我所期待的：

>>> print type(dot_vector), dot_vector.shape
<type 'numpy.ndarray'> (200,)

但是，当我尝试向该向量添加标量时，我收到异常：

>>> scalar = 3.0
>>> print dot_vector + scalar 
C:\Python27\lib\site-packages\scipy\sparse\compressed.pyc in __add__(self, other)
    173                 return self.copy()
    174             else: # Now we would add this scalar to every element.
--> 175                 raise NotImplementedError('adding a nonzero scalar to a '
    176                                           'sparse matrix is not supported')
    177         elif isspmatrix(other):

NotImplementedError: adding a nonzero scalar to a sparse matrix is not supported

好像结果dot_vector又是一个稀疏矩阵。

具体来说，好像我有一个 ndarray 但__add__为运算符调用了稀疏矩阵+。

这是我希望被调用的方法：

>>> print dot_vector.__add__
<method-wrapper '__add__' of numpy.ndarray object at 0x05250690>

我在这里遗漏了什么或者这真的看起来很奇怪吗？
是什么决定了操作员调用哪种方法+？
我在 IPython Notebook ( ipython notebook --pylab inline) 中运行此代码。会不会是 IPython --pylab 或笔记本内核搞砸了？

谢谢你的帮助！

score 7 · Accepted Answer

如果您执行以下操作，您的呼叫np.dot所做的与您得到的并没有太大不同：

>>> np.dot([1, 2, 3], 4)
array([ 4,  8, 12])

因为np.dot不了解稀疏矩阵，所以在您的情况下，返回再次是向量的每个元素与原始稀疏矩阵的乘积。这可能是调用__rmul__稀疏矩阵的方法来执行的，所以你得到的是一个 200 项的数组，每个数组本身就是一个稀疏矩阵。当您尝试将标量添加到该向量时，它会被广播，并且当您尝试将标量添加到每个矩阵时，会弹出错误。

这样做的正确方法是调用.dot稀疏矩阵的方法。预乘一个行向量：

>>> aa = sps.csc_matrix(np.arange(9).reshape(3, 3))
>>> bb = np.arange(3)
>>> aa.T.dot(bb)
array([15, 18, 21])

并乘以列向量：

>>> aa.dot(bb)
array([ 5, 14, 23])

这当然完全等同于您对数组的操作：

>>> aaa = np.arange(9).reshape(3,3)
>>> aaa.dot(bb)
array([ 5, 14, 23])
>>> bb.dot(aaa)
array([15, 18, 21])

python - 添加到常规 numpy ndarray 时调用 scipy.sparse __add__ 方法？

1 回答 1

Related

Reference

python - 添加到常规 numpy ndarray 时调用 scipy.sparse add 方法？