python-3.x - 如何在 Python 3 中比较和可视化两个 .wav 文件？

Question

我在 Jupyter 上使用 Python 3，我试图找出音频文件（有一些噪音）与没有原始文件的准确性。请在下面找到我在网上找到的代码，

import librosa
import matplotlib.pyplot as plt
from dtw import dtw

#Loading audio files
y1, sr1 = librosa.load('data/dev1_female3_liverec_130ms_1m_sim_1.wav') 
y2, sr2 = librosa.load('data/dev1_female3_liverec_130ms_1m_sim_1o.wav') 

#Showing multiple plots using subplot
plt.subplot(1, 2, 1) 
mfcc1 = librosa.feature.mfcc(y1,sr1)   #Computing MFCC values
librosa.display.specshow(mfcc1)

plt.subplot(1, 2, 2)
mfcc2 = librosa.feature.mfcc(y2, sr2)
librosa.display.specshow(mfcc2)

dist, cost, path = dtw(mfcc1.T, mfcc2.T)
print("The normalized distance between the two : ",dist)   # 0 for similar audios 

plt.imshow(cost.T, origin='lower', cmap=plt.get_cmap('gray'), interpolation='nearest')
plt.plot(path[0], path[1], 'w')   #creating plot for DTW

plt.show()

我收到一个错误“找不到模块名称 librosa”

score 2 · Accepted Answer

首先你必须安装 librosa ，如果你使用的是 python 的 anaconda 发行版，那么在 anaconda 提示符下运行pip install librosa或者运行你的 CMD 。

其次import librosa在 jupyter 上然后开始工作。

score 0 · Accepted Answer

安装 librosa 库后。它在 librosa.display 上出现问题，因此请将librosa.display 导入为 dis并使用。它还会在 dtw(mfcc1.T, mfcc2.T) 线上给出问题，请参阅下面

https://github.com/pierre-rouanet/dtw/issues/18 2) https://github.com/pierre-rouanet/dtw/pull/19

python-3.x - 如何在 Python 3 中比较和可视化两个 .wav 文件？

2 回答 2

Related

Reference