kaldi-asr/kaldi is the official location of the Kaldi project.
-
Updated
May 30, 2024 - Shell
kaldi-asr/kaldi is the official location of the Kaldi project.
Offline private voice assistant for many human languages
Kaldi-based Korean ASR (한국어 음성인식) open-source project
Phonetisaurus G2P
Dockerfile for compiling Kaldi for Android.
A list of publically available audio data that anyone can download for ASR or other speech activities
How to create your own model for vosk
Code samples to Get started quickly with Symbl's Voice SDK and APIs: Node.js, JavaScript, WebSockets, & PSTN.
Code related to the Dutch instance and user groups of the KALDI speech recognition toolkit
Non-blocking Asterisk modules for accessing VoiceKit services for speech recognition and speech synthesis.
Top level code to transcribe English audio/video files into text/subtitles
Open source offline speech recognition for Android using Mozilla's DeepSpeech in Termux
This repository contains my attempt to use two famous speech recognition frameworks (Kaldi, CMU Sphinx4) for Arabic Language using the publicly-available dataset "Arabic Corpus of Isolated Words"
Long audio alignment using Kaldi
PyTorch implementation of Listen Attend and Spell Automatic Speech Recognition (ASR).
OLAMI API Quickstart cURL Samples (in bash)
Video Summarization - Summarized a video lecture and converted it to a slideshow using Speech-to-text, Keyword extraction and OpenCV Shot detection.
the first industrial-scale open-source Kazakh speech corpus. KSC2 corpus subsumes the previously introduced two corpora: KSC and KazakhTTS2 and supplements additional data from other sources. KSC2 contains around 1.2k hours of high-quality transcribed data comprising over 600k utterances.
Kaldi-based audio-visual speech recognition
Add a description, image, and links to the speech-recognition topic page so that developers can more easily learn about it.
To associate your repository with the speech-recognition topic, visit your repo's landing page and select "manage topics."