New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Tacotron 2 #9
Comments
Hello, Do you mean sampling rate of the waveform from which you extract the target acoustic features for Tacotron2 and input acoustic features for the neural waveform model? ==== For waveform sampling rate:
==== For the frame rate of acoustic feature:
In all: if there is mismatch, you can retrain either Tacotron2 or the neural waveform model. Hope it clarifies. |
Hi thank you for your response now I do understand. One more question please. I notice that the input for NSF tensor size is [1,81,n] while the output of tacotron is [1,80,n] . Is that extra size because of pitch sequence? |
Yes, exactly. NSF requires pitch input (as the source signal) |
Oh thank you for your response I will close this issue |
Hi, thank you, for your great Job. I wondering should I retrain Tacotron 2 with the same sample rate if I want to feed output from Tacotron 2 to this project?
The text was updated successfully, but these errors were encountered: