Universal-TTS

A Universal Multi-Speaker Multi-Style Text-to-Speech via Disentangled Representation Learning based on Rényi Divergence Minimization

Dipjyoti Paul^a, Sankar Mukherjee^b, Yannis Pantazis^c and Yannis Stylianou^a

^aComputer Science Department, University of Crete

^cIstituto Italiano di Tecnologia, Italy

^cInst. of Applied and Computational Mathematics, Foundation for Research and Technology - Hellas

Abstract:

In this paper, we present a universal multi-speaker, multi-style Text-to-Speech (TTS) synthesis system which is able to generate speech from text with speaker characteristics and speaking style similar to a given reference signal. Training is conducted on non-parallel data and generates voices in an unsupervised manner, i.e., neither style annotation nor speaker label are required. To avoid leaking content information into the style embeddings (referred to as "content leakage") and leaking speaker information into style embeddings (referred to as "style leakage") we suggest a novel Rényi Divergence based Disentangled Representation framework through adversarial learning. Similar to mutual information minimization, the proposed approach explicitly estimates via a variational formula and then minimizes the Rényi divergence between the joint distribution and the product of marginals for the content-style and style-speaker pairs. By doing so, content, style and speaker spaces become representative and (ideally) independent of each other. Our proposed system greatly reduces content leakage by improving the word error rate by approximately 17-19% relative to the baseline system. In MOS-speech-quality, the proposed algorithm achieves an improvement of about 16-20% whereas MOS-style-similarly boost up 15% relative performance.

Audio Samples:

Audio samples can be found in here.

Scripts:

Code will be coming soon!!

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
_config.yml		_config.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Universal-TTS

Abstract:

Audio Samples:

Scripts:

About

Releases

Packages

dipjyoti92/Universal-TTS

Folders and files

Latest commit

History

Repository files navigation

Universal-TTS

Abstract:

Audio Samples:

Scripts:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages