WaveRNN modifications? #60

mrgloom · 2019-07-22T11:55:02Z

Is this repo uses vanilla version of WaveRNN (https://github.com/fatchord/WaveRNN)? or architecture was modified and model was retrained?

It's quite fast relative to my benchmark of tacotron2 + wavernn from here https://github.com/erogol/WaveRNN but (maybe) have less natural voice.

BTW I don't tried new universal vocoder version.

The text was updated successfully, but these errors were encountered:

m-toman · 2019-07-23T17:09:17Z

What I've seen the most significant differences are 16kHz sampling rate and a slightly smaller window size for the batched synthesis.

mrgloom · 2019-07-23T17:30:09Z

I wonder what is lower limit of sample_rate for reasonable speech quality? Can it be tested just by resampling by something like ffmpeg?

CorentinJ · 2019-07-23T23:43:27Z

What I've seen the most significant differences are 16kHz sampling rate and a slightly smaller window size for the batched synthesis.

This is correct. I haven't changed anything else other than the data loader.

CorentinJ closed this as completed Jul 23, 2019

Provide feedback