🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
-
Updated
Nov 15, 2024 - Python
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
A generative speech model for daily dialogue.
Instant voice cloning by MIT and MyShell.
End-to-End Speech Processing Toolkit
🧠 Leon is your open-source personal assistant.
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,同时支持语音识别转录、语音合成、字幕翻译。
DeepMind's Tacotron-2 Tensorflow implementation
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
MARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
WaveRNN Vocoder + TTS
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Foundational model for human-like, expressive TTS
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Add a description, image, and links to the text-to-speech topic page so that developers can more easily learn about it.
To associate your repository with the text-to-speech topic, visit your repo's landing page and select "manage topics."