🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
-
Updated
May 24, 2024 - Python
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
🧠 Leon is your open-source personal assistant.
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Faster Whisper transcription with CTranslate2
Speech recognition module for Python, supporting several engines and APIs, online and offline.
A PyTorch-based Speech Toolkit
End-to-End Speech Processing Toolkit
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Production First and Production Ready End-to-End Speech Recognition Toolkit
On-device wake word detection powered by deep learning
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
Machine Learning Resources, Practice and Research
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Lingvo
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
中文语音识别; Mandarin Automatic Speech Recognition;
Add a description, image, and links to the speech-recognition topic page so that developers can more easily learn about it.
To associate your repository with the speech-recognition topic, visit your repo's landing page and select "manage topics."