python-3.x - 如何在python中分割音频文件

Question

我正在对客户服务数据进行语音情感分析。我有一个音频文件，客户服务官员在其中提出了问题并且客户给出了他的评论。

我需要拆分这段音频，只从客户那里得到评论部分来做情绪分析，不管客户是高兴、悲伤还是中立。

请让我知道，如何拆分音频文件以仅获取客户的音频。音频格式为“.aac”

到目前为止，这就是我所做的：

from os import path
AUDIO_FILE = path.join(path.dirname(path.realpath('C:\\Users\\anagha\\Documents\\Python Scripts')),"Python Scripts\\audioa.aac")

halfway_point = len(AUDIO_FILE) / 2

score 6 · Accepted Answer

因为你使用了 pydub 标签，这里是如何使用 pydub

from pydub import AudioSegment
sound = AudioSegment.from_file(AUDIO_FILE)

halfway_point = len(sound) // 2
first_half = sound[:halfway_point]

# create a new file "first_half.mp3":
first_half.export("/path/to/first_half.mp3", format="mp3")

score 1 · Accepted Answer

我认为回答原始问题为时已晚，但偶然发现这个问题的人可能会发现该程序很有用

-> 使用工具对数据进行分类。我用过LIUM（http://www-lium.univ-lemans.fr/diarization/doku.php）

-> 根据这个精美简单的 SO 帖子解释输出（解析 LIUM Speaker Diarization Output）

然后最后用上面得到的时序拼接音频文件！但是，将语音转换为文本是一个完全不同的挑战，需要深入的方法（具有大量数据）或依赖 API 提供程序（如谷歌）

python-3.x - 如何在python中分割音频文件

2 回答 2

Related

Reference