An automatic speech recognition API
-
Updated
Jul 4, 2024 - Python
An automatic speech recognition API
SEPIA server to support open-source speech recognition via WebSocket connection.
Implementing the paper -
Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
Create and adapt n-gram and JSGF language models, e.g. for Kaldi-ASR nnet3 chain models from Zamia-Speech
Chinese text normalization for speech processing
Nevis is a (sort of) all in one speech transcription library backed by Kaldi ASR.
This is a fork of PyTorch-Kaldi, a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit. This repo adds support to use a GAN front-end for an ASR acoustic model.
Tool to transform data from Nemo/Deepspeech format to Kaldi as described here — https://kaldi-asr.org/doc/data_prep.html
simple version of our torch kaldi toolkit, developed at the LIA by 2 apprentices. (@Chaanks & @vbrignatz)
Neural speaker recognition/verification system based on Kaldi and Tensorflow
This code repo is in reference to the Medium Article for setting up Kaldi on AWS
Create a speech recognition system for programming by voice using Kaldi
Contains code for Speaker Recognition.
Code for converting speech data into text using encoder-decoder model.
Add a description, image, and links to the kaldi-asr topic page so that developers can more easily learn about it.
To associate your repository with the kaldi-asr topic, visit your repo's landing page and select "manage topics."