有没有办法创建一个二维数组 a[][],其中每个 a[i] 本身都被迫与 CUDA 中的其他数据类型对齐?
我想做这样的事情:
__shared__ unsigned char a[20][8];// where a[i] is aligned to 8-byte boundary;
double t=*((double *)(a[2]));
甚至是这样的:
__shared__ unsigned char a[20][9];// where a[i] is aligned to 8-byte boundary;
double t=*((double *)(a[2]));