我有一个应用程序,我们需要根据语音命令打开某些屏幕,比如如果用户说“打开设置”,那么它应该打开设置屏幕,到目前为止我已经使用了SpeechKit
框架,但我无法检测到言语沉默。就像Siri的做法一样。我想检测用户是否结束了他的句子/短语。
请在下面的代码中找到我SpeechKit
以两种方式集成框架的地方。
A) 通过闭包( recognitionTask(with request: SFSpeechRecognitionRequest, resultHandler: @escaping (SFSpeechRecognitionResult?, Error?) -> Swift.Void) -> SFSpeechRecognitionTask
)
let audioEngine = AVAudioEngine()
let speechRecognizer = SFSpeechRecognizer()
let request = SFSpeechAudioBufferRecognitionRequest()
var recognitionTask: SFSpeechRecognitionTask?
func startRecording() throws {
let node = audioEngine.inputNode
let recordingFormat = node.outputFormat(forBus: 0)
node.installTap(onBus: 0, bufferSize: 1024,
format: recordingFormat) { [unowned self]
(buffer, _) in
self.request.append(buffer)
}
audioEngine.prepare()
try audioEngine.start()
weak var weakSelf = self
recognitionTask = speechRecognizer?.recognitionTask(with: request) {
(result, error) in
if result != nil {
if let transcription = result?.bestTranscription {
weakSelf?.idenifyVoiceCommand(transcription)
}
}
}
}
但是当我说任何像“Open Setting”这样的单词/句子时,closure( recognitionTask(with:)
)会被调用多次,我已经将method( idenifyVoiceCommand
)放在了多次调用的闭包中,所以我怎么能限制只调用一次。
而且我还在谷歌搜索时查看了 Timer 逻辑(SFSpeechRecognizer - 检测话语结束),但在我的场景中它不起作用,因为我没有停止音频引擎,因为它像Siri一样不断地听用户的声音。
B) 通过委托( SFSpeechRecognitionTaskDelegate
)
SpeechRecognizer.recognitionTask(with: self.request, delegate: self)
func speechRecognitionTaskWasCancelled(_ task: SFSpeechRecognitionTask) {
}
func speechRecognitionTask(_ task: SFSpeechRecognitionTask, didFinishSuccessfully successfully: Bool) {
}
而且我发现在发生演讲结束时处理的代表不调用它,有时会不小心调用它。