1

足够简单

start=cuda.Event()
func(args,block=blockdims)
cuda.memcpy_dtoh(d,h)
end=cuda.Event()

dur=start.time_till(end)
print dur

但我收到了这个错误

File "gpu.py", line 161, in gpu_test
    dur=start.time_till(end)
pycuda._driver.LogicError: cuEventElapsedTime failed: invalid handle

据我所知,这是正确的用法有人知道我在做什么错吗?

4

1 回答 1

1

看看SimpleSpeedTest.py

start=cuda.Event()
end=cuda.Event()

start.record() # start timing
func(args,block=blockdims)
cuda.memcpy_dtoh(d,h)
end.record() # end timing
# calculate the run length
end.synchronize()
millis = start.time_till(end)
print millis
于 2011-04-18T16:57:56.417 回答