Implementation of Transfer Learning from Speaker Verification to Multi-speaker Text-To-Speech Synthesis (SV2TTS) in Persian language.
-
Updated
Jun 3, 2024 - Python
Implementation of Transfer Learning from Speaker Verification to Multi-speaker Text-To-Speech Synthesis (SV2TTS) in Persian language.
Preparation and processing of data for tacotron2.
Tacotron-Korean-Tensorflow2 for ubuntu
Pytorch implementation of Tacotron 2 (https://arxiv.org/pdf/1712.05884.pdf)
Catalan Text to Speech
Code used in conjunction with an implementation of a Seq2Seq LSTM TTS frontend, to process and evaluate Google Research's Wikipedia Homograph Dataset (WHD) and LibriSpeech data, with the aim of improving the TTS frontend's homograph disambiguation abilities.
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
This repository contains the code and resources associated with my Bachelor's Thesis. The project evaluates the performance of various automatic speaker verification (ASV) systems against identity spoofing attacks generated using text-to-speech (TTS) synthesis technologies.
Speech synthesis with conditioning on very small dataset. Using Nvidia's Tacotron2 and WaveGlow models with Pytorch.
Converting text to audio and applying audio augmentation
speech synthesis - common voice polish dataset.
Training Tacotron2 for Persian language as a Persian text-to-speech
A dataset for Mario's voice (Charles Martinet), from the Super Mario franchise. More info here: https://uberduck.ai/about
This repository contain the code of the main part of my master thesis degree at Politecnico di Torino in Data science & Engineering
Add a description, image, and links to the tacotron2 topic page so that developers can more easily learn about it.
To associate your repository with the tacotron2 topic, visit your repo's landing page and select "manage topics."