Code for converting speech data into text using encoder-decoder model.
-
Updated
Nov 12, 2017 - Python
Code for converting speech data into text using encoder-decoder model.
This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and decoding are performed with Kaldi. The current implementation supports dropout and batch normalization. An example for phoneme recognition using the standard TIMIT dataset is provided.
readers that enable reading kaldi ark in tensorflow
This is the repository for my version of Kaldi for Dummies example.
Contains code for Speaker Recognition.
Create a speech recognition system for programming by voice using Kaldi
This is a Kaldi tutorial for beginners
Keyword Search Recipe for Subword ASR
📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.
scripts to align a given wave to its transcription using trained models by Kaldi
This code repo is in reference to the Medium Article for setting up Kaldi on AWS
The repository contains all the codes necessary for my project - Automatic Speech Recognition System in Hindi Language ( Project description is available at :- https://kunal-dhawan.weebly.com/asr-system-for-hindi-language-from-scratch.html) : It contains the code for the following systems - 1) Monophone-HMM system built using HTK toolkit , 2)Mon…
This repository contains my attempt to use two famous speech recognition frameworks (Kaldi, CMU Sphinx4) for Arabic Language using the publicly-available dataset "Arabic Corpus of Isolated Words"
Add a description, image, and links to the kaldi-asr topic page so that developers can more easily learn about it.
To associate your repository with the kaldi-asr topic, visit your repo's landing page and select "manage topics."