Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add option to select STT engine per project #191

Merged
merged 4 commits into from
Jun 21, 2023
Merged

Conversation

Ashesh3
Copy link
Member

@Ashesh3 Ashesh3 commented Jun 20, 2023

Frontend: ohcnetwork/ayushma_fe#47

This PR adds the ability to select the speech-to-text (STT) engine per Ayushma project. Previously, Ayushma was using Whisper as the default STT engine.

With this change:

A new stt_engine field is added to Project model, with choices of "google" and "whisper". The default is still "whisper".
The converse API now uses the selected STT engine to transcribe audio.
A new STT_API_KEY environment variable is introduced for Google Cloud STT API key.
A speech_to_text.py utils file is added to contain the logic for different STT engines.
Related migrations, model and serializer changes are done.
This adds more flexibility by allowing projects to choose between Whisper and Google Cloud for speech transcription.

Once merged, new projects can choose between whisper and google STT engines. Existing projects will continue using whisper by default.

@vercel
Copy link

vercel bot commented Jun 20, 2023

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name Status Preview Comments Updated (UTC)
ayushma ✅ Ready (Inspect) Visit Preview 💬 Add feedback Jun 20, 2023 10:45am

ayushma/models/project.py Outdated Show resolved Hide resolved
@shivankacker shivankacker merged commit 027a4c7 into master Jun 21, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Archived in project
Development

Successfully merging this pull request may close these issues.

2 participants