Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Converting generated MEL spectrograms to Tacotron 2 format, in order to use a different vocoder? #217

Open
serg06 opened this issue Nov 28, 2020 · 0 comments

Comments

@serg06
Copy link

serg06 commented Nov 28, 2020

The WaveRNN vocoder is too slow for me (I'm running it on the CPU), so I want to hook up a different vocoder to the Tacotron for inferences.

MelGAN and HiFi-GAN are both very fast vocoders, and they both have pre-trained Tacotron2 models which I can use with Tacotron2 mel spectrograms without issues, but when I try inference Tacotron1 mels, the output is completely off.

It looks like converting Tacotron 2 mels -> Tacotron 1 mels is done like so or like so, so is there maybe some way to reverse this process to get Tacotron 2 mels?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant