mod_google_transcribe: Feature Request: Support for `google::cloud::speech::v2` #149

entenschnabel · 2023-11-14T09:42:43Z

Hi,

I was wondering whether you had any plans to support google::cloud::speech::v2 in addition to or instead of google::cloud::speech::v1p1beta1 inside mod_google_transcribe. If you're interested I might soon have a PR to share regarding this but I just wondered whether you had plans to do this anyway.

The text was updated successfully, but these errors were encountered:

davehorton · 2023-11-14T12:47:52Z

I would love some information about v2 - what it adds, what it breaks etc. Last I looked, some time ago, it seemed to lack some things that were in v1beta1, but I may have been mistaken. If you can point me to some docs that would be appreciated

entenschnabel · 2023-11-16T12:13:06Z

What actually nudged us into thinking about v2 was the migration of the v1 models to the conformer-based models which is due to take place in January. Although it's not going to break the v1 API, it's perhaps an indication that we would need to look at v2 sooner or later, anyway. There's some more details on that here: Migrate from classic to conformer models.

So far, the changes which have been most noticeable to me are the "single utterance" concept and the way multiple languages are dealt with. It seems that "single utterance" is no longer offered as a configuration parameter but is implicitly set if using the "latest_short" model. It's actually described here.
RecognitionConfig no longer has an alternative_language_codes field. Instead it just has a language_codes field so I would be interested to see how effective it is at being able to automatically deduce which language is being spoken.

Anyway, you can read more about it in the reference documentation: google.cloud.speech.v2

entenschnabel changed the title ~~mod_google_transcribe: Feature Request: Support for google::cloud::speech::v2~~ mod_google_transcribe: Feature Request: Support for google::cloud::speech::v2 Nov 16, 2023

entenschnabel mentioned this issue Feb 21, 2024

Add Support for Google Cloud Speech-To-Text v2 in mod_google_transcribe #164

Open

ajgolledge mentioned this issue Mar 20, 2024

Add Support for Google Cloud Speech-To-Text V2 library in mod_google_transcribe jambonz/freeswitch-modules#23

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mod_google_transcribe: Feature Request: Support for `google::cloud::speech::v2` #149

mod_google_transcribe: Feature Request: Support for `google::cloud::speech::v2` #149

entenschnabel commented Nov 14, 2023

davehorton commented Nov 14, 2023

entenschnabel commented Nov 16, 2023

mod_google_transcribe: Feature Request: Support for google::cloud::speech::v2 #149

mod_google_transcribe: Feature Request: Support for google::cloud::speech::v2 #149

Comments

entenschnabel commented Nov 14, 2023

davehorton commented Nov 14, 2023

entenschnabel commented Nov 16, 2023

mod_google_transcribe: Feature Request: Support for `google::cloud::speech::v2` #149

mod_google_transcribe: Feature Request: Support for `google::cloud::speech::v2` #149