Convert Image to audio using ViT, GPT and FastSpeech
-
Updated
Jul 1, 2024 - Python
Convert Image to audio using ViT, GPT and FastSpeech
An Android application that acts as a speaking assistant for the hearing impaired people.
homework for deep generation. Combine FastSpeech2 with different vocoders ⭐REFERENCE (modify origin repos): https://github.com/ming024/FastSpeech2 https://github.com/NVIDIA/waveglow https://github.com/mindslab-ai/univnet https://github.com/jik876/hifi-gan
An Android application that allows visually impaired people to hear which bus lines are passing next to them.
This repository contain the code of the main part of my master thesis degree at Politecnico di Torino in Data science & Engineering
Multi-Speaker FastSpeech2 applicable to Korean. Description about train and synthesize in detail.
Refactored version of https://github.com/ming024/FastSpeech2
Unofficial implementation of ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech
Use FastSpeech2 and HiFi-GAN to easily perform end-to-end Korean speech synthesis.
The Implementation of FastSpeech2 Based on Pytorch.
LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search
Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech ✊
An implementation of Microsoft's "AdaSpeech: Adaptive Text to Speech for Custom Voice"
A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS
基于标贝数据继续训练,同时对原本的FastSpeech2模型做了改进,引入了韵律表征以及韵律预测模块,使中文发音更生动且富有节奏
A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS
Chinese Mandarin tts text-to-speech 中文 (普通话) 语音 合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder, with biaobei and aishell3 datasets
PAddle PARAllel text-to-speech toolKIT (supporting Tacotron2, Transformer TTS, FastSpeech2/FastPitch, SpeedySpeech, WaveFlow and Parallel WaveGAN)
😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
Add a description, image, and links to the fastspeech2 topic page so that developers can more easily learn about it.
To associate your repository with the fastspeech2 topic, visit your repo's landing page and select "manage topics."