Does anyone test the inference process using trained model? #5

azraelkuan · 2018-11-09T03:25:12Z

I have train a model about 60k,
when i test the inference.py using the checkpoint waveglow_0, there will be all noise in the wav.
but when i use the trained model(60k), the generated wav is almost 0, nothing in the wav.
Does anyone have this problem?

The text was updated successfully, but these errors were encountered:

hcwu1993 · 2018-11-09T08:27:34Z

60k? paper said 580k. model is not convergency.

azraelkuan · 2018-11-09T09:10:44Z

@hcwu1993 Although the model is not trained well, it should generate some noise but not zero value

Arbaletos · 2018-11-09T13:06:44Z

In my case on 52k already appears audible speech, but i use lesser model - 256 channels instead of default 512.

azraelkuan · 2018-11-09T13:13:46Z

@Arbaletos do u use the distributed.py? loss value at 56k, my is -18. Thanks

mkolod · 2018-11-09T18:12:24Z

@azraelkuan Have you tried training all the way until convergence? If you're still getting noise after the indicated number of iterations, please provide the following information for reproducibility of your case:

Driver, GPU type and VBIOS version
nvidia-smi --query-gpu=gpu_name,vbios_version,driver_version --format=csv
PyTorch build (pip package, from source (if so, git hash), Docker image, etc.)

Also, which distributed approach are you using - default PyTorch DDP or Apex DDP? The latter can be found here.

Are you doing fp16 or fp32 training?

rafaelvalle · 2018-11-09T23:18:55Z

The code does not include FP16 training.

azraelkuan · 2018-11-10T15:43:26Z

@rafaelvalle may be i have found the problem
for this question, because i load the wav using librosa.load rather than the scipy, so the read data will be [-1, 1] but not int16 numbel, so in the line

waveglow/inference.py

Lines 51 to 52 in f4c04e2

    
           audio = audio.cpu().numpy() 
        
           audio = audio.astype('int16')

the output audio is a float32 number between -1 and 1, so when we use the func int16, all the value will be zero.

rafaelvalle · 2018-11-10T15:46:42Z

Yes, that's probably it. In the code we load the audio with scipy and divide it by 2^15.
Then, during inference we multiply the output by 2^15.
In your setup, try multiplying your inference output by 2^30.

azraelkuan · 2018-11-10T16:01:47Z

Yes, i remove the max_audio_value and retrain it. thanks

HashiamKadhim · 2019-02-18T20:16:28Z

@azraelkuan just wondering if your fix worked, and if so, can you please explain exactly how you fixed it?

Thank you!

azraelkuan closed this as completed Nov 10, 2018

yoyolicoris mentioned this issue Nov 28, 2018

During training, the loss value goes up and down and cannot converge, is that normal? Besides, what should the final loss value looks like? #49

Closed

Ella77 mentioned this issue Aug 16, 2019

audio normalization value changed (inference process) #148

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does anyone test the inference process using trained model? #5

Does anyone test the inference process using trained model? #5

azraelkuan commented Nov 9, 2018

hcwu1993 commented Nov 9, 2018

azraelkuan commented Nov 9, 2018

Arbaletos commented Nov 9, 2018

azraelkuan commented Nov 9, 2018

mkolod commented Nov 9, 2018 •

edited

Loading

rafaelvalle commented Nov 9, 2018 •

edited

Loading

azraelkuan commented Nov 10, 2018

rafaelvalle commented Nov 10, 2018 •

edited

Loading

azraelkuan commented Nov 10, 2018

HashiamKadhim commented Feb 18, 2019

Does anyone test the inference process using trained model? #5

Does anyone test the inference process using trained model? #5

Comments

azraelkuan commented Nov 9, 2018

hcwu1993 commented Nov 9, 2018

azraelkuan commented Nov 9, 2018

Arbaletos commented Nov 9, 2018

azraelkuan commented Nov 9, 2018

mkolod commented Nov 9, 2018 • edited Loading

rafaelvalle commented Nov 9, 2018 • edited Loading

azraelkuan commented Nov 10, 2018

rafaelvalle commented Nov 10, 2018 • edited Loading

azraelkuan commented Nov 10, 2018

HashiamKadhim commented Feb 18, 2019

mkolod commented Nov 9, 2018 •

edited

Loading

rafaelvalle commented Nov 9, 2018 •

edited

Loading

rafaelvalle commented Nov 10, 2018 •

edited

Loading