c# - Microsoft Speech SDK 11 中的语法过多

Question

我编写了简单的语音识别应用程序，可以将语法加载到引擎中。

但我明白了，不能将许多语法加载到引擎中，而不是 1024 个语法。

Additional information: Too many grammars have been loaded. Number of grammars cannot exceed 1024.

当我加载 1024 语法时，它无法识别输入流.wav（和我的语音）文件：

 Thread.CurrentThread.CurrentCulture = new CultureInfo("ru-RU");
        Thread.CurrentThread.CurrentUICulture = new CultureInfo("ru-RU");

        // Create a new SpeechRecognitionEngine instance.
         _sre = new SpeechRecognitionEngine(new System.Globalization.CultureInfo("ru-RU"));

         _sre.SpeechHypothesized += _sre_SpeechHypothesized;
         _sre.SpeechDetected += _sre_SpeechDetected;
         _sre.SetInputToWaveFile(@"c:\Test\Wavs\Wavs-converted\file.wav");


 public void LoadGrammarIntoEngine(IEnumerable<String> textColl)
    {
        Choices choises = new Choices();
        GrammarBuilder gb = new GrammarBuilder();
        gb.Culture = new CultureInfo("ru-RU");


        if (choises != null && textColl != null)
        {
            choises.Add(textColl.ToArray());

            if (gb != null)
                gb.Append(choises);
            else
            {
                Console.WriteLine();
            }

            if (_sre.Grammars.Count < 1024)
            {
                Grammar g = new Grammar(gb);
                if (_sre != null && g != null)
                    _sre.LoadGrammar(g);
                else
                {
                    Console.WriteLine();
                }
            }
            else
            {
               Console.WriteLine("too many grammars");
            }
        }

    }

PS 当我加载 5-10 个语法（每个 100 个单词）时，效果很好。

也许我可以\应该一起使用多个识别引擎？

score 3 · Accepted Answer

从评论中，您从根本上采取了错误的方法。您需要使用System.Speech.Recognition.DictationGrammar之类的东西- 它使用 Microsoft 桌面 SR 引擎。

这将接受大多数英语单词。如果您需要将其限制为 1000 字，有几个选项。

如果您的单词列表包含不在默认单词列表中的单词（非常广泛），您可以使用Lexicon Interfaces，遗憾的是，它不会通过 System.Speech.Recognition 公开，因此您必须放到 SAPI 使用它们。

这还假设您可以拒绝词汇表外的识别。如果不是这种情况，您可以使用Dictation Resource Kit，它可以让您构建自定义语言模型；请注意，它是由语音科学家为语音科学家构建的，因此文档很难进行。

在实践中，用户会说词外的东西；最好检查并拒绝它们。小（是的，1000 个单词很小）词汇往往存在误报问题（用户说的词汇外的东西被识别为词汇内的东西）。命令和控制语法也会发生这种情况。

c# - Microsoft Speech SDK 11 中的语法过多

1 回答 1

Related

Reference