-
Notifications
You must be signed in to change notification settings - Fork 88
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Using train_sr for second stage results out of memory #73
Comments
In case you need more information. Environment: $ pip3 list |
I am wondering what you use as Cuda and driver version, and Cupy version (since you mention below 7.0.0 in recommendations, I installed Cuda 10.0 with Cupy100==5.4.0, as I do not see cupy version of 6, yet still gave out of memory error although taking longer in training somehow) so I can rebuild the environment. I used Ubuntu 18.04, and I could not install drivers lower than 450XX |
I want to add, I had 12 .npy files after using spectrogram pairs, total of 55 mb in size (around 4mb on average), I removed few npy files, reducing to 7 files and it no longer crashes. Is it supposed to be like that? looks a bit weird as it now uses 7.4gb with 7 files. I might be able to squeeze 8th file in there but isn't it like better the more you have? Probably something to do with cupy/chainer, but they say do it in batches when I look at their pages when looking further into this. So not sure if that is a possibility with your code. I'll await your reply to see what you recommend with this (I couldn't give too less as it would whine about epoch divided by zero, but more than 8 just eats all 11 gb of my videocard and crash) |
Try lowering the batch size here. become-yukarin/recipe/config_sr.json Line 26 in 99a4998
I will close this Issue once, but please open it again if you need anything else. |
Thanks, it helped! using about 20 files now with batch size on 5. |
After trying second stage learning, I get out of memory issue:
It seems to allocate too much, as there is hardly anything using the videocard memory:
The text was updated successfully, but these errors were encountered: