cuda - 最大调用cuda中的功能设备

Question

我正在尝试在全局功能上调用 1000000 或更长时间的设备功能。但是，我总是遇到以下错误：Microsoft C++ exception: cudaError_enum at memory location 0x0031fc24 但是代码很简单。从线程设备返回到线程主机的执行线程有可能异步锁定一些资源吗？正如我们所见，变量中没有溢出，那么发生了什么？

#include <stdio.h>
#include <stdlib.h>
#include <time.h>
#include <math.h>

#include "cuda.h"
#include "curand_kernel.h"

#define NDIM 30 
#define NPAR 3 

#define DIMPAR NDIM*NPAR //

__device__ float f(float *inputs){
    float t = 0.0;
    int i;
    for(i = 0 ; i < 15; i++)
        t+= inputs[i]*0.0001;
    return t;
}

__global__ void kernel(float *pos, float *pbest){

    int thread = threadIdx.x + blockDim.x * blockIdx.x;
    int i = 0;
    float tpbest = 0.0;

    if(thread < DIMPAR){
        do{
            tpbest = f(pbest);
            i++;
        }while(i <  1000000); //max length int 2147483648 > 1000000

    }
}


int main(int argc, char *argv[])
{

    float *d_pos,    *h_pos;
    float *d_pbest,  *h_pbest;


    h_pos   = ( float *) malloc(sizeof( float ) * DIMPAR);
    h_pbest = ( float *) malloc(sizeof( float ) * DIMPAR);

    cudaMalloc((void**)&d_pos, DIMPAR   * sizeof( float ));
    cudaMalloc((void**)&d_pbest, DIMPAR * sizeof( float ));

    int i, numthreadsperblock, numblocks;

    numthreadsperblock = 512;
    numblocks = (DIMPAR / numthreadsperblock) + ((DIMPAR % numthreadsperblock)?1:0);
    printf("numthreadsperblock: %i;; numblocks:%i\n", numthreadsperblock, numblocks);

    //fill in host code
    for(i = 0 ; i < DIMPAR ; i++){
        h_pos[i] = 1;
        h_pbest[i] = 1;
    }

    //transf. to device memory
    cudaMemcpy(d_pos, h_pos, DIMPAR * sizeof( float ), cudaMemcpyHostToDevice);
    cudaMemcpy(d_pbest, h_pbest, DIMPAR * sizeof( float ), cudaMemcpyHostToDevice);

    kernel<<<numblocks,numthreadsperblock>>>(d_pos, d_pbest);
    cudaMemcpy(h_pos, d_pos, DIMPAR * sizeof( float ), cudaMemcpyDeviceToHost); 



    return 0;
}

score 1 · Accepted Answer

我怀疑完整的错误消息是这样的：

First-chance exception at 0x7c812a5b in myapp.exe: Microsoft C++ exception: cudaError_enum at memory location 0x0031fc24...

您应该在您的 CUDA 代码中进行适当的cuda 错误检查（但我运行了您的代码并没有看到任何明显的 API 错误）。

如果您没有通过上述方法报告 CUDA 错误（正确的 CUDA 错误检查），那么您可以放心地忽略此错误。它是由链接到您的代码的 CUDA 库中捕获并正确处理的异常引起的。

您的应用程序仍将正常运行，如果您在 Visual Studio 之外运行可执行文件，我相信您不会看到此消息。

您可以尝试更新到 CUDA 5.5 以查看此特定消息是否消失。

作为另一个指标，您可以运行您的应用程序，cuda-memcheck它还会检查各种错误。

cuda - 最大调用cuda中的功能设备

1 回答 1

Related

Reference