java - 从 FFT 结果中获取频率的 Java 代码

Question

我有一个双精度数组，其中包含直接来自麦克风的音频样本，采样率为 44100。我想获得基频（样本包含幅度）。在自相关页面的维基百科上，我找到了基于 Wiener-Khinchin 定理的解决方案的描述，我在互联网上进行了更多研究，完成了算法，最终我编写了以下代码，但我不确定它是否正确：

private double determineFrequency(double[] signal) {
 //Get a FastFourierTransformer instance (Apache library)
 FastFourierTransformer fft = new FastFourierTransformer(DftNormalization.STANDARD);

 //The size of the array used by the fft must be a power of two, wrapping 
 //the original array in a bigger one padded to zero
 //NOTE: Here I assume that the input array is smaller than 8192
 double[] paddedSignal = new double[8192];
 System.arraycopy(signal, 0, paddedSignal, 0, signal.length);

 //First fft (forward) to switch from amplitude domain to the frequency domain
 Complex[] transformed = fft.transform(paddedSignal, TransformType.FORWARD);

 // Calculate the conjugate of the complex array
 for (int i=0; i<transformed.length; i++)
  transformed[i] = transformed[i].conjugate();

 //Second fft (inverse) to complete the autocorrelation
 transformed = fft.transform(transformed, TransformType.INVERSE);

 //Calculate the array of corresponding real values to switch 
 // from the frequency domain to the amplitude domain
 double[] autocorrelationMatrix = new double[transformed.length];
 for (int i=0; i<transformed.length; i++) {
  if (Double.isNaN(transformed[i].abs()) || Double.isInfinite(transformed[i].abs()))
   autocorrelationMatrix[i] = 0;
  else
   autocorrelationMatrix[i] = transformed[i].abs();
 }

 //Get the index of the max amplitude
 Integer indexOfMax = Utils.indexOfMax(autocorrelationMatrix);

 return transformed[indexOfMax].getReal()*audioFormat.getSampleRate()/transformed.length; 
}

score 0 · Accepted Answer

您在自相关域中找到了最大值，然后用它来读取频域。这行不通，就像您可以使用时域中的索引来了解频域一样。

反而，

return autocorrelationMatrix[indexOfMax].getReal()*audioFormat.getSampleRate()/autocorrelationMatrix.length;

也就是说，您可能会发现避免额外的 IFFT 更容易。相反，只需从频域中提取最大绝对值。这将适用于采样率/变换长度的分辨率，并且可以在最大 FFT bin 的相位的帮助下进行细化。

java - 从 FFT 结果中获取频率的 Java 代码

1 回答 1

Related

Reference