Improve Speech-To-Text (ML Approach) #16

chidiewenike · 2020-10-17T01:34:34Z

Objective

The DeepSpeech Speech-To-Text system needs to be improved to handle uncommon & non-English words. The machine learning approach is to retrain the DeepSpeech model with new audio data and analyze the results.

Key Result

Using the run_stt function of stream_deepspeech.py, return a string of audio input that is correctly transcribed.

swanton/stream_deepspeech.py

Line 16 in b8e5502

def run_stt(time_len=TIME_LEN):

Details

Correctly transcribe all QA pairs from the question-answer pairs Google Sheet. To train a new DeepSpeech model, you can follow these instructions.

You will need the following DeepSpeech model and DeepSpeech scorer to use run_stt.

If in need of assistance, please ask @chidiewenike

Additional Resources

DeepSpeech Github repo: https://github.com/mozilla/DeepSpeech
Training the model: https://medium.com/visionwizard/train-your-own-speech-recognition-model-in-5-simple-steps-512d5ac348a5
Learning Git: https://git-scm.com/book/en/v2

snekiam · 2020-11-01T21:29:49Z

Here's a (growing) list of words we're having issues with: https://docs.google.com/spreadsheets/d/1rcomLifXhAaMo0zFzv36f1OoWHmShwe3gQKmV-uE71Q/edit?usp=sharing

chidiewenike added enhancement New feature or request help wanted Extra attention is needed labels Oct 17, 2020

gwholland3 self-assigned this Oct 25, 2020

chidiewenike assigned Braden50 Oct 29, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve Speech-To-Text (ML Approach) #16

Improve Speech-To-Text (ML Approach) #16

chidiewenike commented Oct 17, 2020 •

edited by gwholland3

Loading

snekiam commented Nov 1, 2020

Improve Speech-To-Text (ML Approach) #16

Improve Speech-To-Text (ML Approach) #16

Comments

chidiewenike commented Oct 17, 2020 • edited by gwholland3 Loading

Objective

Key Result

Details

Additional Resources

snekiam commented Nov 1, 2020

chidiewenike commented Oct 17, 2020 •

edited by gwholland3

Loading