Download speech datasets (English and non-English) for Automatic Speech Recognition
-
Updated
Jan 22, 2023 - Jupyter Notebook
Download speech datasets (English and non-English) for Automatic Speech Recognition
Fine-Tune Whisper for Italian ASR with transformers
The project,being part of Kagglex BIPOC Mentorship Program final project, aims to train two separate Hindi ASR models using the Facebook Wav2Vec2 (300M parameters) and OpenAI Whisper-Small models, respectively. The goal is to compare their performance, with a target WER of less than 13%, across various Hindi accents and dialects.
Analysis and Viewer for Mozilla Common Voice Datasets
Automatic Subtitle Generation for Bengali Multimedia Using Deep Learning.
WepApp for examining Common Voice metadata
A Streamlit web application for Voice recognition using a pre-trained speech embedding model.
Add a description, image, and links to the common-voice-dataset topic page so that developers can more easily learn about it.
To associate your repository with the common-voice-dataset topic, visit your repo's landing page and select "manage topics."