dataset size / audio generation #1

markhanslip · 2020-06-04T14:21:00Z

Hi, thanks for the implementation :)
Do you have suggestions for how long the input audio chunks should be and what size the dataset should be?
At the moment I'm getting weird negative discriminator loss readings, which I wondered might be because of not enough / inappropriately segmented data. I was working with ~50mins of 4 second wav chunks. Also should the input audio data be downsampled to 16000?
I've only trained as far as ~5000 epochs and at the moment there are only glitches at the very beginning and end of the generated waveform with flat lines in between - is this a normal thing for early iterations that improves in later epochs, or is something wrong?
Sorry for multiple questions!
thanks again,
Mark

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dataset size / audio generation #1

dataset size / audio generation #1

markhanslip commented Jun 4, 2020

dataset size / audio generation #1

dataset size / audio generation #1

Comments

markhanslip commented Jun 4, 2020