-
Notifications
You must be signed in to change notification settings - Fork 8.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
WaveRNN modifications? #60
Comments
What I've seen the most significant differences are 16kHz sampling rate and a slightly smaller window size for the batched synthesis. |
I wonder what is lower limit of sample_rate for reasonable speech quality? Can it be tested just by resampling by something like ffmpeg?
|
This is correct. I haven't changed anything else other than the data loader. |
Is this repo uses vanilla version of WaveRNN (https://github.com/fatchord/WaveRNN)? or architecture was modified and model was retrained?
It's quite fast relative to my benchmark of tacotron2 + wavernn from here https://github.com/erogol/WaveRNN but (maybe) have less natural voice.
BTW I don't tried new
universal vocoder
version.The text was updated successfully, but these errors were encountered: