Multi-utterance extension of speech_recognizer.recognize_once() API #397

larschristensen · 2019-10-11T10:25:16Z

I'm using the speech_recognizer.recognize_once() method for synchronous/blocking transcription of audio files. However, this method only does recognition of a single utterance, but the files I wish to transcribe can contain multiple utterances.

What is the recommended approach for synchronous/blocking multi-utterance transcription of audio files? Would it be possible to extend the speech_recognizer.recognize_once() API to also accept multi-utterance audio files?

chlandsi · 2019-10-11T16:07:40Z

Please have a look at the continuous recognition mode. There is a sample here. This comment shows how to accumulate all results from continuous recognition.

larschristensen · 2019-10-14T14:37:09Z

Please have a look at the continuous recognition mode. There is a sample here. This comment shows how to accumulate all results from continuous recognition.

Thanks for the information, the sample code seems to work fine with multiple utterances. However in this continuous recognition mode, I can't seem to delete the audio file immediately after the recognition is complete. Is there something I have to do in order to close the audio file used for recognition?

chlandsi · 2019-10-14T14:51:23Z

You could try calling del recognizer after the recognition is finished (i.e., after you have received a session stopped event) to clean up the recognizer resources. Also have a look at the Batch API (sample), maybe it fits for your needs.

larschristensen · 2019-10-14T19:54:06Z

I have tried deleting both speech_recognizer and others after recognition is completed, but it doesn't make any difference. Could it be that the file access is somehow not correctly released in the SDK after use when using speech_recognizer.start_continuous_recognition()?

chlandsi · 2019-10-15T08:04:05Z

Hi @larschristensen, could you share the code that shows the problem you are having? Also, could you check the discussion in #352 to see whether it might be related? Thanks!

larschristensen · 2019-10-15T08:54:08Z

Hi @larschristensen, could you share the code that shows the problem you are having? Also, could you check the discussion in #352 to see whether it might be related? Thanks!

@chlandsi Thanks for the input. The issue identified in #352 indeed seems to be the same as I'm having: If I do del speech_recognizer._impl, the clean-up correctly does its job and I can delete the audio file afterwards without problems. Maybe you should update the sample code and/or SDK to not have this problem?

chlandsi · 2019-10-15T09:01:30Z

@larschristensen Good to know that this is the underlying issue. You should then also be able to solve the problem by calling recognizer.canceled.disconnect_all() (etc. for the other signals) after recognition has finished, or moving the call to stop_continuous_recognition out of the callback. We do indeed have work items in the backlog to make this easier and clearer, but no ETA yet.

I'll proceed to close the issue, please reopen if you continue to have problems. Thanks!

pankopon assigned chlandsi Oct 15, 2019

chlandsi closed this as completed Oct 15, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multi-utterance extension of speech_recognizer.recognize_once() API #397

Multi-utterance extension of speech_recognizer.recognize_once() API #397

larschristensen commented Oct 11, 2019

chlandsi commented Oct 11, 2019

larschristensen commented Oct 14, 2019

chlandsi commented Oct 14, 2019

larschristensen commented Oct 14, 2019

chlandsi commented Oct 15, 2019

larschristensen commented Oct 15, 2019

chlandsi commented Oct 15, 2019

Multi-utterance extension of speech_recognizer.recognize_once() API #397

Multi-utterance extension of speech_recognizer.recognize_once() API #397

Comments

larschristensen commented Oct 11, 2019

chlandsi commented Oct 11, 2019

larschristensen commented Oct 14, 2019

chlandsi commented Oct 14, 2019

larschristensen commented Oct 14, 2019

chlandsi commented Oct 15, 2019

larschristensen commented Oct 15, 2019

chlandsi commented Oct 15, 2019