-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Adding notes on memory requirements? #3
Comments
Can you try decreasing the batch size to 1, and then decreasing rnn hidden size to say, 128? 2GB is really very little. |
The CPU error is because you're using my pretrained checkpoint, which was trained on GPU. Try running with CPU from scratch, should work ok. |
adding yes, i just noticed the CPU/GPU thing. i tried moving the |
ok, you also want to make batch size as large as you can fit, by the way, and you should expect to probably decrease the learning rate (if training). |
i don't understand how you have time to be so helpful & encouraging, and work on a phd at the same time. thank you so much! :) edit: for the record, |
I was trying to train CPU-based model and encountered same OOM error on Ubuntu. I tried adjusting batch_size and It perfectly works right now. Thanks for the detailed explanation! |
I'm going to test this on a real computer tomorrow, but testing today on the 2GB GPU on my laptop I get an out of memory error with the 600MB pre-trained model.
I tried shutting everything else down in hope that 2GB was almost enough to run the model, but it doesn't seem to help (or even change the error message).
I tried running off the CPU using combinations of
-gpuid -1
and-backend nn
but i get different errors. Here are all the errors, in order:The text was updated successfully, but these errors were encountered: