Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Tacotron + HIFI GAN Fine tuned: Sounds distorted. #130

Open
Mixomo opened this issue Nov 1, 2022 · 0 comments
Open

Tacotron + HIFI GAN Fine tuned: Sounds distorted. #130

Mixomo opened this issue Nov 1, 2022 · 0 comments

Comments

@Mixomo
Copy link

Mixomo commented Nov 1, 2022

Hello, this is something very strange that has been happening to me more and more frequently.
When fine tuning a tacotron model, the end result sounds distorted.
I have trained it for more than 5k steps (as indicated in the notebook I am using and will leave below).
The dataset is about 40 minutes with very good quality audios. all are in 22 khz, mono, 16 bits.
Tacotron could train it without problems, but HIFI GAN could not.

This is the notebook: https://colab.research.google.com/github/justinjohn0306/FakeYou-Tacotron2-Notebook/blob/main/FakeYou_HiFi_GAN_Fine_Tuning.ipynb?authuser=1#scrollTo=teF-Ut8Z7Gjp

This is the demo distorted audio: https://drive.google.com/file/d/1cuqfWGS1JmSMNlcnyd_PaH3Atv-PPxvB/view?usp=share_link

This is the original dataset audio sample: https://drive.google.com/file/d/1ReqoxwHSRfu3D186jhQynCJXPQ1vhWZx/view?usp=share_link

This is what I get when I synthesize:
image

Thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant