Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve Speech-To-Text (ML Approach) #16

Open
chidiewenike opened this issue Oct 17, 2020 · 1 comment
Open

Improve Speech-To-Text (ML Approach) #16

chidiewenike opened this issue Oct 17, 2020 · 1 comment
Assignees
Labels
enhancement New feature or request help wanted Extra attention is needed

Comments

@chidiewenike
Copy link
Collaborator

chidiewenike commented Oct 17, 2020

Objective

The DeepSpeech Speech-To-Text system needs to be improved to handle uncommon & non-English words. The machine learning approach is to retrain the DeepSpeech model with new audio data and analyze the results.

Key Result

Using the run_stt function of stream_deepspeech.py, return a string of audio input that is correctly transcribed.

def run_stt(time_len=TIME_LEN):

Details

Correctly transcribe all QA pairs from the question-answer pairs Google Sheet. To train a new DeepSpeech model, you can follow these instructions.

You will need the following DeepSpeech model and DeepSpeech scorer to use run_stt.

If in need of assistance, please ask @chidiewenike

Additional Resources

@chidiewenike chidiewenike added enhancement New feature or request help wanted Extra attention is needed labels Oct 17, 2020
@gwholland3 gwholland3 self-assigned this Oct 25, 2020
@snekiam
Copy link
Member

snekiam commented Nov 1, 2020

Here's a (growing) list of words we're having issues with: https://docs.google.com/spreadsheets/d/1rcomLifXhAaMo0zFzv36f1OoWHmShwe3gQKmV-uE71Q/edit?usp=sharing

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

4 participants