我是一名 10 年级的高中计算机学生,正在寻求有关 python 音频的帮助。我没有要求任何人为我做任何工作,但我已经陷入僵局。
我正在尝试使用 python(而不是通过使用 GTTS 或 pyttsx)创建一个自定义语音程序。我正在尝试通过共振峰语音合成来创建音素。
我尝试过使用 pyaudio 和单频,并且研究过正弦波和波形,但我仍然在努力产生任何音素。
我所能做的就是制造不同频率的哔哔声。
import pyaudio
import numpy as np
p = pyaudio.PyAudio()
volume = 0.5 # range [0.0, 1.0]
fs = 84100 # sampling rate, Hz, must be integer
duration = 1.0 # in seconds, may be float
f = 800.0 # sine frequency, Hz, may be float
# generate samples, note conversion to float32 array
samples = (np.sin(2*np.pi*np.arange(fs*duration)*f/fs)).astype(np.float32)
# for paFloat32 sample values must be in range [-1.0, 1.0]
stream = p.open(format=pyaudio.paFloat32,
channels=1,
rate=fs,
output=True)
p2 = pyaudio.PyAudio()
volume2 = 0.5 # range [0.0, 1.0]
fs2 = 49100 # sampling rate, Hz, must be integer
duration2 = 1.0 # in seconds, may be float
f2 = 680.0 # sine frequency, Hz, may be float
# generate samples, note conversion to float32 array
samples2 = (np.sin(2*np.pi*np.arange(fs2*duration2)*f2/fs2)).astype(np.float32)
# for paFloat32 sample values must be in range [-1.0, 1.0]
stream2 = p2.open(format=pyaudio.paFloat32,
channels=1,
rate=fs2,
output=True)
# play. May repeat with different volume values (if done interactively)
stream2.write(volume2*samples2)
stream2.stop_stream()
stream2.close()
p2.terminate(
# play. May repeat with different volume values (if done interactively)
stream.write(volume*samples)
stream.stop_stream()
stream.close()
p.terminate()
如何仅通过在 python 中使用采样率和频率来产生音素?