我正在尝试运行http://docs.thrust.googlecode.com/hg/group__modifying.html中描述的推力 for_each 示例,但是在编译和运行时出现错误。
我使用以下文件:fe.cu:
#include <thrust/for_each.h>
#include <thrust/device_vector.h>
#include <stdio.h>
struct printf_functor{
__host__ __device__
void operator()(int x){
printf("%d\n");
}
};
int main(){
thrust::device_vector<int> d_vec(3);
d_vec[0] = 0; d_vec[1] = 1; d_vec[2] = 2;
thrust::for_each(d_vec.begin(), d_vec.end(), printf_functor());
}
我用nvcc -arch=sm_20 fe.cu
.
当我使用 ./a.out 运行时,我得到以下输出:
terminate called after throwing an instance of 'thrust::system::system_error'
what(): unspecified launch failure
Aborted
以下是有关用于运行代码的 GPU 的一些信息:
--- General Information for device 0 ---
Name: Tesla C2075
Compute capability: 2.0
Clock rate: 1147000
Device copy overlap: Enabled
Kernel execution timeout : Disabled
--- Memory Information for device 0 ---
Total global mem: 5636554752
Total constant Mem: 65536
Max mem pitch: 2147483647
Texture Alignment: 512
--- MP Information for device 0 ---
Multiprocessor count: 14
Shared mem per mp: 49152
Registers per mp: 32768
Threads in warp: 32
Max threads per block: 1024
Max thread dimensions: (1024, 1024, 64)
Max grid dimensions: (65535, 65535, 65535)