All you need to start fine-tuning SpeechBrain models using the Ubenwa CryCeleb dataset!
This code was used for CryCeleb2023 HuggingFace challenge
It reproduces the official baseline model training
train.ipynb
- main code for data preparation and fine-tuning with various configs
evaluate.ipynb
- example of scoring with pre-trained and fine-tuned model
Note that default configurations are optimized for speed and simplicity rather than accuracy
@article{ubenwa2023cryceleb,
title={CryCeleb: A Speaker Verification Dataset Based on Infant Cry Sounds},
author={David Budaghyan and Charles C. Onu and Arsenii Gorin and Cem Subakan and Doina Precup},
year={2023},
journal={preprint arXiv:2305.00969},
}
@misc{speechbrain,
title={{SpeechBrain}: A General-Purpose Speech Toolkit},
author={Mirco Ravanelli and Titouan Parcollet and Peter Plantinga and Aku Rouhe and Samuele Cornell and Loren Lugosch and Cem Subakan and Nauman Dawalatabad and Abdelwahab Heba and Jianyuan Zhong and Ju-Chieh Chou and Sung-Lin Yeh and Szu-Wei Fu and Chien-Feng Liao and Elena Rastorgueva and François Grondin and William Aris and Hwidong Na and Yan Gao and Renato De Mori and Yoshua Bengio},
year={2021},
eprint={2106.04624},
archivePrefix={arXiv},
primaryClass={eess.AS},
note={arXiv:2106.04624}
}