Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support speech recognition #835

Open
waj opened this issue Sep 19, 2019 · 0 comments
Open

Support speech recognition #835

waj opened this issue Sep 19, 2019 · 0 comments

Comments

@waj
Copy link
Contributor

waj commented Sep 19, 2019

Google Cloud Speech-to-Text looks like a good fit (https://cloud.google.com/speech-to-text/). AWS also has a speech-to-text service but the non-streaming version of the API is too slow (like 30 seconds to transcribe a single audio) and the streaming API supports only a few languages. Besides that I also found that Google's service is much more flexible and for example it returns digits (0-9) instead of plain text when a number is read.

Add a transcribe command that uses the Google API to convert the audio to text.

Let's expose this command through the speech recognition feature of the Gather command in TwiML: https://www.twilio.com/docs/voice/twiml/gather#input

(new step to use this command with web designer will come later)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant