speech-recognition

Star

Here are 1,950 public repositories matching this topic...

huggingface / transformers

Star

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Updated May 24, 2024
Python

leon-ai / leon

Star

🧠 Leon is your open-source personal assistant.

Updated May 24, 2024
Python

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Updated May 23, 2024
Python

m-bain / whisperX

Star

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

speech speech-recognition speech-to-text whisper asr

Updated May 16, 2024
Python

SYSTRAN / faster-whisper

Star

Faster Whisper transcription with CTranslate2

deep-learning inference transformer speech-recognition openai speech-to-text quantization whisper

Updated May 20, 2024
Python

Uberi / speech_recognition

Star

Speech recognition module for Python, supporting several engines and APIs, online and offline.

audio python speech-recognition speech-to-text

Updated May 16, 2024
Python

speechbrain / speechbrain

Star

A PyTorch-based Speech Toolkit

Updated May 22, 2024
Python

espnet / espnet

Star

End-to-End Speech Processing Toolkit

deep-learning chainer end-to-end machine-translation pytorch speech-synthesis speech-recognition kaldi voice-conversion speaker-diarization speech-separation speech-enhancement spoken-language-understanding speech-translation singing-voice-synthesis

Updated May 24, 2024
Python

nl8590687 / ASRT_SpeechRecognition

Star

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

python tensorflow keras cnn python3 speech-recognition speech-to-text ctc chinese-speech-recognition asrt

Updated Apr 15, 2024
Python

alibaba-damo-academy / FunASR

Star

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

pytorch speech-recognition vad punctuation whisper audio-visual-speech-recognition speaker-diarization voice-activity-detection conformer pretrained-model rnnt dfsmn paraformer speechgpt speechllm

Updated May 23, 2024
Python

wenet-e2e / wenet

Star

Production First and Production Ready End-to-End Speech Recognition Toolkit

pytorch transformer speech-recognition automatic-speech-recognition production-ready whisper asr conformer e2e-models

Updated May 24, 2024
Python

Picovoice / porcupine

Star

On-device wake word detection powered by deep learning

speech-recognition hotword-detection keyword-spotting handsfree wake-word-detection on-device hotword-detector hotword trigger-word-detection keyword-spotter wake-word voice-activation wake-word-engine

Updated May 2, 2024
Python

huggingface / distil-whisper

Star

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

audio speech-recognition whisper

Updated May 21, 2024
Python

yanshengjia / ml-road

Sponsor

Star

Machine Learning Resources, Practice and Research

nlp machine-learning computer-vision deep-learning tensorflow pytorch speech-recognition

Updated Apr 4, 2024
Python

zzw922cn / Automatic_Speech_Recognition

Star

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

audio deep-learning tensorflow paper end-to-end evaluation cnn lstm speech-recognition rnn automatic-speech-recognition feature-vector data-preprocessing phonemes timit-dataset layer-normalization rnn-encoder-decoder chinese-speech-recognition

Updated Mar 24, 2023
Python

tensorflow / lingvo

Star

Lingvo

nlp research translation tensorflow machine-translation speech distributed tts speech-synthesis mnist speech-recognition lm seq2seq speech-to-text gpu-computing language-model asr

Updated May 21, 2024
Python

mravanelli / pytorch-kaldi

Star

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.