ios - SFSpeechRecognizer 可以识别几个命令词而不是整个短语？

Question

我从 Apple 的示例应用程序 https://developer.apple.com/library/content/samplecode/SpeakToMe/Introduction/Intro.html设置了 SFSpeechRecognizer

我想知道是否有可能让识别器识别与其他先前识别的单词无关的单个单词。

例如，识别器现在会在说出“滚动”时尝试形成一个句子，然后找到有意义的单词的最佳转录，因此当说出“停止”时，它会将其更改为类似于“向下”的内容在前一个单词的上下文中更有意义。

但这不是我想要的，因为我希望我的应用程序听听单个单词作为在听时调用函数的命令。

有没有办法以这样的方式实现框架，它会不断地听单词并且只捕获单个单词？

score 6 · Accepted Answer

是的。您可以通过设置扫描部分结果上的传入单词recognitionRequest.shouldReportPartialResults = YES，然后多次调用结果回调。

然后，您可以随时处理结果，在获得最终结果之前扫描关键字/关键短语（即忽略result.isFinal）。当您找到您正在寻找的关键字/关键短语时，然后取消识别。

我已经成功地在Speaking Email中使用这种方法实现了语音命令，作为修改后的Cordova 插件（来源在这里）。

例子：

- (void) recordAndRecognizeWithLang:(NSString *) lang
{
        NSLocale *locale = [[NSLocale alloc] initWithLocaleIdentifier:lang];
        self.sfSpeechRecognizer = [[SFSpeechRecognizer alloc] initWithLocale:locale];
        if (!self.sfSpeechRecognizer) {
                [self sendErrorWithMessage:@"The language is not supported" andCode:7];
        } else {

                // Cancel the previous task if it's running.
                if ( self.recognitionTask ) {
                        [self.recognitionTask cancel];
                        self.recognitionTask = nil;
                }

                [self initAudioSession];

                self.recognitionRequest = [[SFSpeechAudioBufferRecognitionRequest alloc] init];
                self.recognitionRequest.shouldReportPartialResults = [[self.command argumentAtIndex:1] boolValue];

                self.recognitionTask = [self.sfSpeechRecognizer recognitionTaskWithRequest:self.recognitionRequest resultHandler:^(SFSpeechRecognitionResult *result, NSError *error) {

                        if (error) {
                                NSLog(@"error");
                                [self stopAndRelease];
                                [self sendErrorWithMessage:error.localizedFailureReason andCode:error.code];
                        }

                        if (result) {
                                NSMutableArray * alternatives = [[NSMutableArray alloc] init];
                                int maxAlternatives = [[self.command argumentAtIndex:2] intValue];
                                for ( SFTranscription *transcription in result.transcriptions ) {
                                        if (alternatives.count < maxAlternatives) {
                                                float confMed = 0;
                                                for ( SFTranscriptionSegment *transcriptionSegment in transcription.segments ) {
                                                        NSLog(@"transcriptionSegment.confidence %f", transcriptionSegment.confidence);
                                                        confMed +=transcriptionSegment.confidence;
                                                }
                                                NSMutableDictionary * resultDict = [[NSMutableDictionary alloc]init];
                                                [resultDict setValue:transcription.formattedString forKey:@"transcript"];
                                                [resultDict setValue:[NSNumber numberWithBool:result.isFinal] forKey:@"final"];
                                                [resultDict setValue:[NSNumber numberWithFloat:confMed/transcription.segments.count]forKey:@"confidence"];
                                                [alternatives addObject:resultDict];
                                        }
                                }
                                [self sendResults:@[alternatives]];
                                if ( result.isFinal ) {
                                        [self stopAndRelease];
                                }
                        }
                }];

                AVAudioFormat *recordingFormat = [self.audioEngine.inputNode outputFormatForBus:0];

                [self.audioEngine.inputNode installTapOnBus:0 bufferSize:1024 format:recordingFormat block:^(AVAudioPCMBuffer * _Nonnull buffer, AVAudioTime * _Nonnull when) {
                        [self.recognitionRequest appendAudioPCMBuffer:buffer];
                }],

                [self.audioEngine prepare];
                [self.audioEngine startAndReturnError:nil];
        }
}

ios - SFSpeechRecognizer 可以识别几个命令词而不是整个短语？

1 回答 1

Related

Reference