🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
-
Updated
Aug 16, 2024 - Python
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.
Ultrafast GAN based Vocoder for Text to Speech
Speech synthesis (TTS) in low-resource languages by training from scratch with Fastpitch and fine-tuning with HifiGan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
RADTTS + HiFiGAN vocoder
TTS for Arabic (FastPitch) in the ONNX format
SA-toolkit: Speaker speech anonymization toolkit in python
homework for deep generation. Combine FastSpeech2 with different vocoders ⭐REFERENCE (modify origin repos): https://github.com/ming024/FastSpeech2 https://github.com/NVIDIA/waveglow https://github.com/mindslab-ai/univnet https://github.com/jik876/hifi-gan
Add a description, image, and links to the hifigan topic page so that developers can more easily learn about it.
To associate your repository with the hifigan topic, visit your repo's landing page and select "manage topics."