c# - 为 Amazon Lex 编码语音输入？

Question

我遇到了 Amazon Lex 的语音输入编码问题。

如果我将 InputStream 分配为空，它会起作用，我会收到来自 Lex 的默认语音回答：“我该如何帮助你”

            var amazonLexClient = new AmazonLexClient("APPID", "APPSECRET", Amazon.RegionEndpoint.USEast1);
            var amazonPostRequest = new Amazon.Lex.Model.PostContentRequest();
            var amazonPostResponse = new Amazon.Lex.Model.PostContentResponse();
            amazonPostRequest.BotAlias = "BookTrip";
            amazonPostRequest.BotName = "BookTrip";
            amazonPostRequest.ContentType = "audio/l16; rate=16000; channels=1";
            amazonPostRequest.UserId = "user";
            amazonPostRequest.InputStream = null;

            amazonPostResponse = await amazonLexClient.PostContentAsync(amazonPostRequest);

如果我尝试使用下面的编码（Lex 要求：16KHz、8 位、1 通道）发送录制的语音“你好吗”

            var amazonLexClient = new AmazonLexClient("APPID", "APPSECRET", Amazon.RegionEndpoint.USEast1);
            var amazonPostRequest = new Amazon.Lex.Model.PostContentRequest();
            var amazonPostResponse = new Amazon.Lex.Model.PostContentResponse();
            amazonPostRequest.BotAlias = "BookTrip";
            amazonPostRequest.BotName = "BookTrip";
            amazonPostRequest.ContentType = "audio/l16; rate=16000; channels=1";
            amazonPostRequest.UserId = "user";
            amazonPostRequest.InputStream = new MemoryStream();

            WaveFormat target = new WaveFormat(16000, 8, 1);
            WaveStream streamIn = new WaveFileReader("F:\\Whatever.wav");
            WaveFormatConversionStream str = new WaveFormatConversionStream(target, streamIn);
            WaveFileWriter.WriteWavFileToStream(amazonPostRequest.InputStream, str);

            amazonPostResponse = await amazonLexClient.PostContentAsync(amazonPostRequest);

然后就不行了，大约 20~25s 后 Lex 服务器会返回 null。

Amazon.Runtime.AmazonUnmarshallingException: 'Error unmarshalling response back from AWS.'
NullReferenceException: Object reference not set to an instance of an object.

谁能告诉我如何对 wav 文件进行编码以使其与 Amazon Lex 一起使用？顺便说一句，我使用 Visual Studio 2017、C# 和 NAudio 库。

score 0 · Accepted Answer

c# 的 AWSSDk 内部似乎存在某种问题 - 发生的情况是 Lex 服务返回了纯文本错误消息，并且 SDK 正在尝试将其解析为 JSON。有时您可以深入研究异常细节并找到原始响应，或者只使用 Fiddler。

score 0 · Accepted Answer

Amazon Lex 要求音频为 PCM 或 Opus 格式（阅读此文档了解更多详细信息）。您可以参考 Amazon 的这篇AI 博客文章，以获取有关如何对您的 wav 音频进行 PCM 编码的更多信息。

c# - 为 Amazon Lex 编码语音输入？

2 回答 2

Related

Reference