-
Notifications
You must be signed in to change notification settings - Fork 530
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Does anyone test the inference process using trained model? #5
Comments
60k? paper said 580k. model is not convergency. |
@hcwu1993 Although the model is not trained well, it should generate some noise but not zero value |
In my case on 52k already appears audible speech, but i use lesser model - 256 channels instead of default 512. |
@Arbaletos do u use the distributed.py? loss value at 56k, my is -18. Thanks |
@azraelkuan Have you tried training all the way until convergence? If you're still getting noise after the indicated number of iterations, please provide the following information for reproducibility of your case:
Also, which distributed approach are you using - default PyTorch DDP or Apex DDP? The latter can be found here. Are you doing fp16 or fp32 training? |
The code does not include FP16 training. |
@rafaelvalle may be i have found the problem Lines 51 to 52 in f4c04e2
the output audio is a float32 number between -1 and 1, so when we use the func int16 , all the value will be zero.
|
Yes, that's probably it. In the code we load the audio with scipy and divide it by 2^15. |
Yes, i remove the max_audio_value and retrain it. thanks |
@azraelkuan just wondering if your fix worked, and if so, can you please explain exactly how you fixed it? Thank you! |
I have train a model about 60k,
when i test the
inference.py
using the checkpointwaveglow_0
, there will be all noise in the wav.but when i use the trained model(60k), the generated wav is almost 0, nothing in the wav.
Does anyone have this problem?
The text was updated successfully, but these errors were encountered: