GitHub - ArrushTandon/DSST: A Speech-to-text transcription Model

This Is a Domain Specified Speech Translation Script. It helps in trancripting a audio into text by saving time. It uses Whisper Algorithm, which was Developed by OpenAI, Whisper is a speech recognition model designed for robustness and accuracy across a wide variety of audio data. It has the lowest CER, which is tested and fine tuned using huge amount of LibriSpeech Datasets. [ test-clean Dataset CER - 0.000312535 ] [ test-other Dataset CER - 0.000821592 ] [ train-clean-100 Dataset CER - 0.0001720000 ] [ NPTEL Dataset CER - 0.278767957 ] For Verifying the results, You can run the script by changing the directory according to own setup.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.idea		.idea
.venv		.venv
Output		Output
README.md		README.md
Robot_Instruction_NLP.py		Robot_Instruction_NLP.py
Robot_Instruction_results.json		Robot_Instruction_results.json
nptel_transcription.py		nptel_transcription.py
whisper_transcribe_cer.py		whisper_transcribe_cer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

ArrushTandon/DSST

Folders and files

Latest commit

History

Repository files navigation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages