python - Pydub 原始音频数据

Question

我在 Python 3.4 中使用 Pydub 来尝试检测一些音频文件的音高。

我有一个有效的音高检测算法（McLeod Pitch Method），它对于实时应用程序非常强大（我什至用它制作了一个 Android 音高检测应用程序：https ://github.com/sevagh/Pitcha ）。

我的问题是，当我将算法应用于 AudioSegment._data 时，我没有从算法中获得任何有意义的输出。

代码：

from pydub import AudioSegment

sound = AudioSegment.from_wav(file="./8700hz.wav")

#sampling rate = sound.frame_rate = 44100hz
mpm = Mpm(sound.frame_rate, len(sound._data))
print(mpm.get_pitch(sound._data))

输出：

Pitch: 150.000002396

如果我从扬声器播放相同的 wav 文件，从麦克风录制它并将算法应用于原始麦克风捕获（有符号的 16 位小端 PCM，44100Hz，单声道），我会得到正确的音高。

AudioSegment._data 不会返回我所期望的吗？

score 10 · Accepted Answer

sound._data是一个bytestring。我不确定输入Mpm期望什么，但您可能需要将其转换bytestring为array类似的：

import array
from pydub import AudioSegment
from pydub.utils import get_array_type

sound = AudioSegment.from_wav(file="./8700hz.wav")

bit_depth = sound.sample_width * 8
array_type = get_array_type(bit_depth)

numeric_array = array.array(array_type, sound._data)

python - Pydub 原始音频数据

1 回答 1

Related

Reference