cupy - 遍历数组时，Cupy 比 numpy 慢

Question

我有代码，我想与 cupy 并行化。我认为这很简单——只需写“import cupy as cp”，然后将我写 np. 的所有地方都替换为 cp.，这样就可以了。

而且，它确实有效，代码确实可以运行，但速度要慢得多。与 numpy 相比，我认为在迭代更大的数组时最终会更快，但似乎永远不会发生。

代码是：

q = np.zeros((5,5))
q[:,0] = 20

def foo(array):

    result = array
    shedding_row = array*0
    for i in range((array.shape[0])):
        for j in range((array.shape[1])-1):

            shedding_param = 2 * (result[i,j])**.5             
            shedding = (np.random.poisson( (shedding_param), 1))[0]

            if shedding >= result[i,j]:
                shedding = result[i,j] - 1

            result[i,j+1] = result[i,j] - shedding

            if result[i,j+1]<0:
                result[i,j+1] = 0

            shedding_row[i,j+1] = shedding  

    return(result,shedding_row)

x,y = foo(q)

这应该用cupy变得更快吗？我用错了吗？

score 2 · Accepted Answer

要获得numpyor的快速性能cupy，您应该使用并行操作而不是使用 for 循环。

举个例子，

for i in range((array.shape[0])):
    for j in range((array.shape[1])-1):

        shedding_param = 2 * (result[i,j])**.5

这可以计算为

xp = numpy  # change to cupy for GPU
shedding_param = 2 * xp.sqrt(result[:, :-1])

cupy - 遍历数组时，Cupy 比 numpy 慢

1 回答 1

Related

Reference