Goodness of Pronunciation Pipelines for OOV Removal
-
Updated
Jan 1, 2023 - Perl
Goodness of Pronunciation Pipelines for OOV Removal
A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.
Voxceleb1 i-vector based speaker recognition system
THEANO-KALDI-RNNs is a project implementing various Recurrent Neural Networks (RNNs) for RNN-HMM speech recognition. The Theano Code is coupled with the Kaldi decoder.
This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and decoding are performed with Kaldi. The current implementation supports dropout and batch normalization. An example for phoneme recognition using the standard TIMIT dataset is provided.
Add a description, image, and links to the kaldi topic page so that developers can more easily learn about it.
To associate your repository with the kaldi topic, visit your repo's landing page and select "manage topics."