python - 将来自 Dragon NaturallySpeaking 的所有输入重定向到 Python？（使用 Natlink）

Question

我目前正在编写一个 AI 程序，它接收来自 Dragon NaturallySpeaking（使用 Natlink）的输入，对其进行处理，然后返回语音输出。我能够想出一个接收器语法库，它捕获来自 Dragon 的所有输入并将其发送到我的解析器。

    class Receiver(GrammarBase):

        gramSpec = """ <start> exported = {emptyList}; """

        def initialize(self):
            self.load(self.gramSpec, allResults = 1)
            self.activateAll()

        def gotResultsObject(self, recogType, resObj):
            if recogType == 'reject':
                inpt, self.best_guess = [], []
            else:
                inpt = extract_words(resObj)
                inpt = process_input(inpt) # Forms a list of possible interpretations
                self.best_guess = resObj.getWords(0)
            self.send_input(inpt)

        def send_input(self, inpt):
            send = send_to_parser(inpt) # Sends first possible interpretation to parser
            try:
                while True:
                    send.next() # Sends the next possible interpretation if the first is rejected
            except StopIteration: # If all interpretations are rejected, try sending the input to Dragon
                try:
                    recognitionMimic(parse(self.best_guess))
                except MimicFailed: # If that fails too, execute all_failed
                    all_failed()

此代码按预期工作，但有几个问题：

Dragon 在将输入发送到我的程序之前对其进行处理。例如，如果我说“打开 Google Chrome。”，它会打开 Google Chrome，然后将输入发送到 Python。有没有办法在不先处理输入的情况下将输入发送到 Python？
当我调用 waitForSpeech() 时，会弹出一个消息框，说明 Python 解释器正在等待输入。是否有可能（为了美观和方便）阻止消息框出现，而是在用户显着暂停后终止语音收集过程？

谢谢！

score 3 · Accepted Answer

关于您的第一个问题，事实证明 DNS 在内部使用“Open ...”话语作为其命令解析过程的一部分。这意味着 DNS 在 natlink 有机会之前解析语音并执行命令方式。解决此问题的唯一方法是在您的 natlink 语法中将话语从“Open ...”更改为“Trigger ...”（或更改为 DNS 除了“Trigger”之外未使用的其他话语）。

一些 natlink 开发人员在 Speechcomputing.com 上闲逛。你可能会在那里得到更好的回应。

祝你好运！

python - 将来自 Dragon NaturallySpeaking 的所有输入重定向到 Python？（使用 Natlink）

1 回答 1

Related

Reference