我正在使用 FFTW 高级数据布局 API 处理批处理 2D FFT。
根据FFTW Advanced Complex DFT文档:
为 nembed 参数传递NULL等效于传递n。
inembed = onembed = NULL
但是,使用vs.时我得到了不同的结果inembed = onembed = n
。什么可能导致结果不匹配?
让我们举个例子...
设置
int howMany = 2;
int nRows = 4;
int nCols = 4;
int n[2] = {nRows, nCols};
float* h_in = (float*)malloc(sizeof(float) * nRows*nCols*howMany);
for(int i=0; i<(nRows*nCols*howMany); i++){ //initialize h_in to [0 1 2 3 4 ...]
h_in[i] = (float)i;
printf("h_in[%d] = %f \n", i, h_in[i]);
}
FFTW 计划使用inembed == onembed == NULL
fftwf_plan forwardPlan = fftwf_plan_many_dft_r2c(2, //rank
n, //dimensions = {nRows, nCols}
howMany, //howmany
h_in, //in
NULL, //inembed
howMany, //istride
1, //idist
h_freq, //out
NULL, //onembed
howMany, //ostride
1, //odist
FFTW_PATIENT /*flags*/);
我还用inembed = onembed = n = {nRows, nCols}
.
结果
请注意,使用NULL
orn
给出相同的数值结果,但在内存中的顺序不同:
版本 1:inembed == onembed == NULL
result[0][0,1] = 240, 0
result[1][0,1] = 256, 0
result[2][0,1] = -16, 16
result[3][0,1] = -16, 16
result[4][0,1] = -16, 0
result[5][0,1] = -16, 0 //this line and above match the other version
result[6][0,1] = -64, 64 //this line and below don't match (data is in a different order)
result[7][0,1] = -64, 64
result[8][0,1] = 0, 0
result[9][0,1] = 0, 0
result[10][0,1] = 0, 0
result[11][0,1] = 0, 0
result[12][0,1] = -64, 0
result[13][0,1] = -64, 0
result[14][0,1] = 0, 0
result[15][0,1] = 0, 0
result[16][0,1] = 0, 0
result[17][0,1] = 0, 0
result[18][0,1] = -64, -64
result[19][0,1] = -64, -64
result[20][0,1] = 0, 0
result[21][0,1] = 0, 0
result[22][0,1] = 0, 0
result[23][0,1] = 0, 0
result[24][0,1] = 0, 0
result[25][0,1] = 0, 0
result[26][0,1] = 0, 0
result[27][0,1] = 0, 0
result[28][0,1] = 0, 0
result[29][0,1] = 0, 0
result[30][0,1] = 0, 0
result[31][0,1] = 0, 0
版本 2:inembed = onembed = n = {nRows, nCols}
result[0][0,1] = 240, 0
result[1][0,1] = 256, 0
result[2][0,1] = -16, 16
result[3][0,1] = -16, 16
result[4][0,1] = -16, 0
result[5][0,1] = -16, 0
result[6][0,1] = 0, 0
result[7][0,1] = 0, 0
result[8][0,1] = -64, 64
result[9][0,1] = -64, 64
result[10][0,1] = 0, 0
result[11][0,1] = 0, 0
result[12][0,1] = 0, 0
result[13][0,1] = 0, 0
result[14][0,1] = 0, 0
result[15][0,1] = 0, 0
result[16][0,1] = -64, 0
result[17][0,1] = -64, 0
result[18][0,1] = 0, 0
result[19][0,1] = 0, 0
result[20][0,1] = 0, 0
result[21][0,1] = 0, 0
result[22][0,1] = 0, 0
result[23][0,1] = 0, 0
result[24][0,1] = -64, -64
result[25][0,1] = -64, -64
result[26][0,1] = 0, 0
result[27][0,1] = 0, 0
result[28][0,1] = 0, 0
result[29][0,1] = 0, 0
result[30][0,1] = 0, 0
result[31][0,1] = 0, 0
这是这个实验的一个工作实现。