Support to Google Speech To Text transcription format #152

eyal13579 · 2019-05-07T14:31:41Z

any plans to support Google?

pietrop · 2019-05-07T18:17:28Z

Hi @eyal13579 ,
At the moment we have been relying on open source contributors to create the adapters they might need, and made a guide to help with the process. As we are only using the BBC Kaldi in our projects.

If you are interested in creating a Google STT one, and need any help to setup a PR let me know.

sshniro · 2019-07-09T18:58:03Z

Hi @pietrop , I'm familiar with Google STT and would love to contribute to this development. I have gone through the helper docs. I have a small clarification regarding which approach to take. Google STT provides speaker diarization as a beta feature, so is it okay if I use the beta results for creating the sample json ?

pietrop · 2019-07-09T19:20:25Z

Hi @sshniro, that's awesome, thanks!

re-speaker info, sounds good to use the beta feature for Google STT speaker diarization.

We have been using a flexible approach in the current module, because a lot of these STT have those optional attributes, so didn't want to impose them as defaults.

You can see in the other modules, that we check if the attribute is available, if it is we break the paragraph down based on the speaker diarization, if it isn't we break the paragraphs down based on punctuation, and add place holder names to the speaker labels. The user can then change the size of the paragraphs and speaker labels in the UI.

for example you can see in the /packages/stt-adapters/bbc-kaldi we check if the attribute is present in the json (/packages/stt-adapters/bbc-kaldi/index.js#L57) and then we either do groupWordsInParagraphs (in/packages/stt-adapters/bbc-kaldi/index.js#L15 ) or groupWordsInParagraphsBySpeakers( in/packages/stt-adapters/bbc-kaldi/group-words-by-speakers.js)

I think it be good to add support for both, creating paragraph based on speaker label as well as on punctuation, as a fallback if speaker labels are not provided. However if you want to priorities the speaker labels, and leave the punctuation one for a separate PR that's also fine.

Let me know if you have any questions.

sshniro · 2019-07-18T09:38:29Z

Hi @pietrop I have created a PR for GCP to Draft JS conversion , The PR does not cover speaker diarization. I will send another PR with the segmentation soon.

pietrop · 2019-07-19T17:21:07Z

closing as addressed by @sshniro in PR #167

eyal13579 added the Enhancement a request for improvement label May 7, 2019

eyal13579 assigned pietrop May 7, 2019

pietrop added the Speech To Text Adapters Speech To Text Adapters label May 7, 2019

pietrop added the help wanted Extra attention is needed label May 7, 2019

sshniro mentioned this issue Jul 18, 2019

Adding speech to text adapter for Google cloud platform #167

Merged

pietrop closed this as completed Jul 19, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support to Google Speech To Text transcription format #152

Support to Google Speech To Text transcription format #152

eyal13579 commented May 7, 2019 •

edited

Loading

pietrop commented May 7, 2019

sshniro commented Jul 9, 2019 •

edited

Loading

pietrop commented Jul 9, 2019

sshniro commented Jul 18, 2019

pietrop commented Jul 19, 2019

Support to Google Speech To Text transcription format #152

Support to Google Speech To Text transcription format #152

Comments

eyal13579 commented May 7, 2019 • edited Loading

pietrop commented May 7, 2019

sshniro commented Jul 9, 2019 • edited Loading

pietrop commented Jul 9, 2019

sshniro commented Jul 18, 2019

pietrop commented Jul 19, 2019

eyal13579 commented May 7, 2019 •

edited

Loading

sshniro commented Jul 9, 2019 •

edited

Loading