ios - 带有浮点缓冲区 AudioUnit 的 FFT 输出

Question

我对 vDSP_zrip 和 AudioUnit 的使用和配置有疑问。事实上，我将 AudioUnit 配置为将打包数据保存为浮点数。我创建了一个循环缓冲区，当这个缓冲区已满时，我计算一个 fft。我有结果，但我不明白为什么 fft 输出不好（参见图）

音频单元配置：

// describe format
AudioStreamBasicDescription audioFormat;
audioFormat.mSampleRate       = 44100;
audioFormat.mFormatID         = kAudioFormatLinearPCM;
audioFormat.mFormatFlags      = kAudioFormatFlagsNativeEndian|kAudioFormatFlagIsPacked|kAudioFormatFlagIsFloat|kAudioFormatFlagIsNonInterleaved;
audioFormat.mFramesPerPacket  = 1;
audioFormat.mChannelsPerFrame = 1; // mono
audioFormat.mBitsPerChannel   = sizeof(float) * 8;
audioFormat.mBytesPerFrame    = audioFormat.mChannelsPerFrame * sizeof(float);
audioFormat.mBytesPerPacket   = audioFormat.mFramesPerPacket * audioFormat.mBytesPerFrame;

循环缓冲区：

_audioSample = new AudioSample(8192, 44100);
// in recording callback :
for(int i = 0; i < bufferList.mNumberBuffers; ++i)
{
    if(!status)
    {
        if(_sample->needData())
            _sample->put((float*)bufferList.mBuffers[i].mData,
                    bufferList.mBuffers[i].mDataByteSize);
        [...]
    }
}

vDSP 调用：

  // get a split complex vector (real signal divided into an even-odd config
  vDSP_ctoz((COMPLEX *)sample.get(), 2, &_complex, 1, _fftsize);
  vDSP_fft_zrip(_fftsetup, &_complex, 1, _log2n, kFFTDirection_Forward);
  // scale (from vDSP reference)
  float scale = 1.0 / (2.0 * _samples);
  vDSP_vsmul(_complex.realp, 1, &scale, _complex.realp, 1, _fftsize);
  vDSP_vsmul(_complex.imagp, 1, &scale, _complex.imagp, 1, _fftsize);
  _complex.imagp[0] = 0.0;

在哪里_fftsize = _audioSample.capacity()/2

score 9 · Accepted Answer

您的输出看起来很合理，因此我将您的问题解释为“如何清理这些结果？”

1）您可能正在使用矩形窗口

这意味着您没有进行任何窗口化，这会在您的结果中引入一些噪音。vDSP 带有一些用于执行窗口化的函数，您可以像这样使用它们：

// N = number of samples in your buffer
int N = _audioSample.capacity();

// allocate space for a hamming window
float * hammingWindow = (float *) malloc(sizeof(float) * N);

// generate the window values and store them in the hamming window buffer
vDSP_hamm_window(hammingWindow, N, 0);

然后，每当您要进行 FFT 时，首先对样本进行窗口化（如在调用 vDSP_ctoz 之前执行此操作）：

 vDSP_vmul(sample.get(), 1, hammingWindow, 1, sample.get(), 1, N);

2）您可能希望对结果运行幅度函数

这将为您提供类似于您在标准 FFT 条形图音乐可视化工具中看到的结果。在 FFT之后执行此操作：

vDSP_zvmags(&_complex, 1, &_complex.realp, 1, _fftsize);

之后，_complex.realp 将是一个浮点值数组，表示每个 FFT bin 的大小。

ios - 带有浮点缓冲区 AudioUnit 的 FFT 输出

1 回答 1

Related

Reference