🧑‍🎤 Expressive Text-to-Speech

This is a repository forked from Coqui-AI (🐸TTS ) used to research about expressive TTS in our AI-Unicamp-CPQD group. The original codes are kept in "main" branch which is not our default visualization.

Here we keep the "unicamp' branch as our main branch, while "main" branch remains as the original and updated. You can see here the original README.md.

🔍 About the group

We are an expressive TTS research group located at Unicamp and CPQD (Brazil).

🔨 Implementations

Expressive Models

Tacotron 2
Fastpitch

Expressive Datasets

EMOVDB
IEMOCAP
ESD

Style Encoders

Look-Up
Reference Encoder (Coarse/Fine-Grained)
GST
VAE
VQ-VAE
VAE+Flow
Diffusion

Disentanglement Blocks

Style Classifier
Speaker Classifier + GRL (Gradient Reversal Layer)

Style Reference Features

Pitch
Energy
Mel-Spectrogram

Agregation Types

Sum, Concat or AdaIN

Enhancing Losses

Orthogonal Loss
CLIP Loss
Cycle consistency Loss(*)

Name		Name	Last commit message	Last commit date
Latest commit History 4,393 Commits
.github		.github
TTS		TTS
debug		debug
docs		docs
images		images
notebooks		notebooks
recipes		recipes
tests		tests
.cardboardlint.yml		.cardboardlint.yml
.compute		.compute
.dockerignore		.dockerignore
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.pylintrc		.pylintrc
.readthedocs.yml		.readthedocs.yml
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CODE_OWNERS.rst		CODE_OWNERS.rst
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE.txt		LICENSE.txt
MANIFEST.in		MANIFEST.in
Makefile		Makefile
README.md		README.md
Unicamp_branch_doc.txt		Unicamp_branch_doc.txt
__init__.py		__init__.py
build_image.sh		build_image.sh
hubconf.py		hubconf.py
pyproject.toml		pyproject.toml
recod_container.sh		recod_container.sh
requirements-recod.txt		requirements-recod.txt
requirements.dev.txt		requirements.dev.txt
requirements.notebooks.txt		requirements.notebooks.txt
requirements.tf.txt		requirements.tf.txt
requirements.txt		requirements.txt
run_bash_tests.sh		run_bash_tests.sh
setup.cfg		setup.cfg
setup.py		setup.py

License

AI-Unicamp/TTS

Folders and files

Latest commit

History

Repository files navigation

🧑‍🎤 Expressive Text-to-Speech

🔍 About the group

🔨 Implementations

Expressive Models

Expressive Datasets

Style Encoders

Disentanglement Blocks

Style Reference Features

Agregation Types

Enhancing Losses

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Languages