python - 区分作为峰值一部分的局部最大值和峰值的绝对最大值

Question

我从 mp3 的 10 秒剪辑中获取了幅度数据。然后我对其进行了快速傅立叶变换，以获取频域中剪辑的数据（如第一张图所示）。我现在想确定峰值所在的频率。

幅度频率

我首先对数据进行了平滑处理，可以在下面的蓝色和红色图中看到。我创建了一个阈值，峰值必须超过才能被考虑。这是下面第三个图中的水平蓝线。可以看出，我的峰值检测代码在一定程度上有效。

平滑和峰值检测

我现在遇到的问题在下面显示的最终图中很明显。我的代码正在寻找作为整体峰值一部分的局部最大值的最大值。我需要一种方法来过滤掉这些局部最大值，以便对于每个峰值，我只能得到一个标记。即对于下面显示的峰值，我只想要绝对峰值处的标记，而不是沿途的每个次要峰值。

峰值检测的放大视图

我的峰值检测代码如下所示：

for i, item in enumerate(xavg): #xavg contains all the smoothed data points
    if xavg[i] > threshold: #points must be above the threshold
        #if not the first or last point (so index isn't out of range)            
        if (i > 0) and (i < (len(xavg)-1)): 
            #greater than points on either side                
            if (xavg[i] > xavg[i-1]) and (xavg[i] > xavg[i+1]):  
                max_locations.append(i)

编辑：我认为我没有足够清楚地说明我的问题。我想找到地块上 5 个左右最高尖峰的位置，而不仅仅是总体最高点。我基本上试图通过标记其主要频率来为剪辑提供音频指纹。

EDIT2：更多代码可帮助显示我在 FFT 和平滑方面所做的工作：

def movingaverage(interval, window_size):
    window = np.ones(int(window_size))/float(window_size)
    return np.convolve(interval, window, 'same')

fft = np.fft.rfft(song)
xavg = movingaverage(abs(fft), 21)

score 1 · Accepted Answer

您的值可以划分为交替的超过阈值和低于阈值的区域。当您找到局部最大值时，请跟踪哪个最大值，直到您的值再次低于阈值。将该“区域”最大值设置为真正的峰值，然后继续下一个超过阈值的区域。就像是：

# Store the true peaks
peaks = []

# If you consider the first value a possible local maxima.
# Otherwise, just initialize max_location to (None, 0)
if xavg[0] > xavg[1]:
    max_location = (0, xavg[0])
else:
    max_location = (None,0) # position and value

# Use a slice to skip the first and last items.
for i, item in enumerate(xavg[1:-1]):
    if xavg[i] > threshold:
        if ((xavg[i] > xavg[i-1]) and
            (xavg[i] > xavg[i+1]) and
            xavg[i] > max_location[1]):
            max_location = (i, xavg[i])
    else:
        # If we found a previous largest local maxima, save it as a true
        # peak, then reset the values until the next time we exceed the threshold
        if max_location[0] is not None:
            peaks.append(max_location[0])
        max_location = None
        max_location_value = 0

# Do you consider the last value a possible maximum?
if xavg[i+1] > xavg[i] and xavg[i+1] > max_location[1]:
    max_location = (i+1, xavg[i+1])

# Check one last time if the last point was over threshold.
if max_location[0] is not None:
    peaks.append(max_location[0])

score 0 · Accepted Answer

您当前正在制作前一点和下一点低于当前点的任何点的列表 (max_locations)。如果您只对绝对最大值感兴趣，您可以这样做：

xavg.index(max(xavg[startPosition:endPosition]))

或者，如果您想保持代码完整，可以在将自身置于 max_location 位置之前检查当前点是否大于任何其他点：

如果 xavg[i] > xavg[max_location]：max_location = i

score 0 · Accepted Answer

有很多关于这方面的文献。在我看来，您至少有四个选择。我特别推荐选项 2 和 3。

正如 chepner 所建议的那样，考虑到您的峰值是相同的，直到它低于您的阈值。
或者，如果您认为在再次上升之前可能没有完全低于阈值的峰值（双峰），您可以计算 FFT 的导数。因此，当峰值低于阈值或导数变为零（峰值之间的谷）时，您将其称为不同的峰值。
你总是可以将函数拟合到你的峰值，比如高斯或洛伦兹。这可能是我提到的方法中最稳健的方法。
您确定最高峰，保存它并删除它周围的 N 个点（N=3,4,5，...您选择），然后找到第二个峰值，并删除它周围相同数量的点...

score 0 · Accepted Answer

峰值查找非常棘手，如果可能的话，我会避免尝试实现自己的代码。尝试使用scipy.signal.find_peaks_cwt，您可以使用一些参数。使用此功能，我认为您不需要事先对数据进行平滑处理，因为其中一个参数基本上是用于平滑数据的长度列表。粗略地说，该算法在一个长度尺度上平滑数据，寻找峰值，在另一个长度尺度上平滑，寻找峰值等。然后它寻找出现在所有或大多数长度尺度上的峰值。

python - 区分作为峰值一部分的局部最大值和峰值的绝对最大值

4 回答 4

Related

Reference