-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Training IWSLT on CPU #2
Comments
I think SimpleLossCompute should work fine on CPU if you have enough memory. Do you get an error? There is a variant you could use, where you split into chunks like MultiGPULossCompute, but do not use data parallel. Let me know if SimpleLossCompute fails |
it seems like But adapted
|
Did you figure this out? I would like to leave it open. |
Unfortunately
For some reason it doesn't like calculating I've tried to adapt |
I tackled the same problem and found the following codes worked.
After
(I changed BATCH_SIZE for my environment.)
NOTE: I just checked the script doesn't return errors, so I'm not sure whether the training goes well or not (I mean I didn't check the performance of a trained model). |
Hello!
Thank you very much for your contribution.
I wonder how to adapt the code in order to train a model on IWSLT data on my PC without GPUs.
It seems like
MultiGPULossCompute
should be replaced inrun_epoch
, butSimpleLossCompute
doesn't seem like an appropriate candidate.I would appreciate any hint.
The text was updated successfully, but these errors were encountered: