Kid-Whisper

Training code for Kid-Whisper, an adaptation of Whisper for children speech. View paper

Abstract

Recent advancements in Automatic Speech Recognition (ASR) systems, exemplified by Whisper, have demonstrated the potential of these systems to approach human-level performance given sufficient data. However, this progress doesn’t readily extend to ASR for children due to the limited availability of suitable child-specific databases and the distinct characteristics of children’s speech. A recent study investigated leveraging the My Science Tutor (MyST) children’s speech corpus to enhance Whisper’s performance in recognizing children’s speech. They were able to demonstrate some improvement on a limited testset. This paper builds on these findings by enhancing the utility of the MyST dataset through more efficient data preprocessing. We reduce the Word Error Rate (WER) on the MyST testset 13.93% to 9.11% with Whisper-Small and from 13.23% to 8.61% with Whisper-Medium and show that this improvement can be generalized to unseen datasets. We also highlight important challenges towards improving children’s ASR performance. The results showcase the viable and efficient integration of Whisper for effective children’s speech recognition.

Contributors and authors

Ahmed Adel Attia, Jing Liu, Wei Ai, Dorottya Demszky, Carol Espy-Wilson

Model Checkpoints

You can find the model checkpoints on my Huggingface account

Acknowledgments

Inspiration, code snippets, etc.

Training script adapted from Huggingface tutorial
Transcription and evaluation script adapted from Open-AI code snippet

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
README.md		README.md
calculate_wer.py		calculate_wer.py
create_dataset.py		create_dataset.py
train.py		train.py
transcribe.py		transcribe.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

calculate_wer.py

calculate_wer.py

create_dataset.py

create_dataset.py

train.py

train.py

transcribe.py

transcribe.py

Repository files navigation

Kid-Whisper

Abstract

Contributors and authors

Model Checkpoints

Acknowledgments

About

Releases

Packages

Languages

ahmedadelattia/Kid-Whisper

Folders and files

Latest commit

History

Repository files navigation

Kid-Whisper

Abstract

Contributors and authors

Model Checkpoints

Acknowledgments

About

Resources

Stars

Watchers

Forks

Languages