🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
-
Updated
Jul 6, 2024 - Python
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
SoftVC VITS Singing Voice Conversion
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Officially maintained, supported by PaddlePaddle, including CV, NLP, Speech, Rec, TS, big models and so on.
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
ModelScope: bring the notion of Model-as-a-Service to life.
Foundational model for human-like, expressive TTS
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Lingvo
Data manipulation and transformation for audio signal processing, powered by PyTorch
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
WaveNet vocoder
Python library and CLI tool to interface with Google Translate's text-to-speech API
MARS5 speech model (TTS) from CAMB.AI
Noise supression using deep filtering
OpenAI Whisper ASR Webservice API
Add a description, image, and links to the speech topic page so that developers can more easily learn about it.
To associate your repository with the speech topic, visit your repo's landing page and select "manage topics."