google-cloud-platform - 谷歌云文本到语音的质量更好

Question

我正在为我的 Adalo 应用程序使用 Integromat 中的 Google Cloud Text to Speech 模块。我想要更好的音频质量，所以我已经切换到 WaveNet 并将采样率提高到 48000 赫兹，但它的质量仍然很差。我能做些什么？非常感谢每个想法，在此先感谢！

最好的问候，本

score 0 · Accepted Answer

您可能需要根据https://cloud.google.com/text-to-speech/docs/audio-profileseffectsProfileId上的文档指定

const effectsProfileId = ['telephony-class-application'];

const request = {
  input: {text: text},
  voice: {languageCode: languageCode, ssmlGender: ssmlGender},
  audioConfig: {audioEncoding: 'MP3', effectsProfileId: effectsProfileId},
};

语音的质量会根据您所说的播放结果音频的设备类型而起伏不定。

score 0 · Accepted Answer

将您的数据转换为 Google 提到的推荐编码。使用与 Google 文档中提到的格式相同的格式，例如 Flac 格式。这将提供适当的准确性。使用立体声录音，单独的香奈儿的扬声器。

google-cloud-platform - 谷歌云文本到语音的质量更好

2 回答 2

Related

Reference