python - 如何在 Python 程序中嵌入 Google Speech to Text API？

Question

我有一个项目，其中我在客户端和主机之间创建了一个聊天程序，我必须在其中嵌入 Speech to Text。有什么方法可以在我的程序中嵌入 Google Speech to Text API 吗？

score 5 · Accepted Answer

PyPI 中有一个名为Speech Recognition的包，看起来它可以做到这一点。现场（即通过麦克风）API 看起来非常简单。

# NOTE: this requires PyAudio because it uses the Microphone class
import speech_recognition as sr
r = sr.Recognizer()
with sr.Microphone() as source:                # use the default microphone as the audio source
    audio = r.listen(source)                   # listen for the first phrase and extract it into audio data

try:
    print("You said " + r.recognize(audio))    # recognize speech using Google Speech Recognition
except LookupError:                            # speech is unintelligible
    print("Could not understand audio")

它还具有转录 WAV 文件、作为后台进程运行、为转录提供置信度值等功能。

score 2 · Accepted Answer

这可能是您不想在您的情况下使用的，但对于其他可能需要它用于一次性项目的人，我不久前编写了一个简单的 python 客户端，它使用 Chrome 中内置的 API 进行语音搜索：

https://github.com/korylprince/python-google-transcribe

要让它工作，你必须有 16000Hz 编码的 FLAC，而且它们必须相当短。

此外，就像评论中提到的那样，API 是非官方的，所以谁知道它什么时候会停止工作。

score 2 · Accepted Answer

您可以尝试 Nexiwave 的免费语音到文本 api。这是 python 示例： http: //nexiwave.com/api_samples/nexiwave_py.txt。另请查看 API 指南： http: //nexiwave.com/index.php/119-integrate-in-5-minutes。很简单。

您必须先注册才能使用免费计划。

python - 如何在 Python 程序中嵌入 Google Speech to Text API？

3 回答 3

Related

Reference