cuda - 定义了局部行为的 CUDA 和 Monte Carlo

Question

我有一个关于 CUDA 中奇怪行为的问题。我目前正在开发粒子轨迹的蒙特卡罗模拟，我正在做以下事情。

我的粒子在给定日期 t(n) 的位置 p(n) 取决于我的粒子在前一个日期 t(n-1) 的位置 t(n-1)。实际上，假设值 v(n) 是根据值 p(n-1) 计算得出的。这是我的代码的简化示例：

__device__ inline double calculateStep( double drift, double vol, double dt, double randomWalk, double S_t){
  return exp((drift - vol*vol*0.5)*dt + randomWalk*vol*sqrt(dt))*S_t;
}    

__device__ double doSomethingWhith(double v_n, ….) {
  ...
  Return v_n*exp(t)*S
}



__global__ myMCsimulation( double* matrice, double * randomWalk, int nbreSimulation, int nPaths, double drift, ……) {


  double dt = T/nPaths;
  unsigned int tid = threadIdx.x + blockDim.x * blockIdx.x; 
  unsigned int stride = blockDim.x*gridDim.x;
  unsigned int index = tid;  
  double mydt = (index - nbreSimulation)/nbreSimulation*dt + dt;

  for ( index = tid; index < nbreSimulation*nPaths; index += stride) {
    if (index >= nbreSimulation)
    {
     double v_n = DoSomethingWith(drift,dt, matrice[index – nbreSimulation]);
     matrice[index] = matrice[index - nbreSimulation ] * calculateStep(drift,v_n,dt,randomWalk[index]); // 
    }
...}

最后一行代码：

matrice[index] = matrice[index - nbreSimulation ] * calculateStep(drift,v_n,dt,randomWalk[index]);

使我能够只填写矩阵矩阵的第二行。我不知道为什么。

当我通过以下方式更改代码行时：

matrice[index] =  DoSomethingWith(drift,dt, matrice[index – nbreSimulation]);

我的矩阵填得很好，我的所有值都改变了，然后我可以取回matrice[index – nbreSimulation]. 我认为这是一个并发访问，但我不确定，我试过__syncthreads()但没有奏效。

有人可以帮忙吗？

非常感谢

score 1 · Accepted Answer

我已经通过以下内容更改了我的代码，现在它可以完美运行。

if (index < nbreSimulation) {
            matrice[index] = S0;    
            for (workingCol=1; workingCol< nPaths; workingCol++) {
                previousMove = index; 
                index = index + nbreSimulation;
                  ................
                matrice[index] = calculateStep(drift,vol_int[index],dt,randomWalk[index], matrice[previousMove]);             }
       }
    }

score 0 · Accepted Answer

我尝试了以下事情：

我已经声明了一个共享变量（一个双精度数组），其中包含每次迭代计算的值：

__shared__ double mat[];

......
for ( index = tid; index < nbreSimulation*nPaths; index += stride) {
   .....
  mat[index] = computedValue;
   ......
 }

没有成功。有没有人看到这个问题？

cuda - 定义了局部行为的 CUDA 和 Monte Carlo

2 回答 2

Related

Reference