-
Notifications
You must be signed in to change notification settings - Fork 22.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
PicklingError encountered when using multiple GPUs #67681
Comments
Potentially the problem came from your implementation of Dataset based on your trace. Could you please post more detail about it so we can reproduce the error? |
Thank you @ejguan for your quick response. I believe this error is related to multiprocessing used in |
Something is calling
>>> import pickle, sh
>>> pickle.dumps(sh.glob('*'))
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
_pickle.PicklingError: Can't pickle <class 'sh.GlobResults'>: attribute lookup GlobResults on sh failed
>>> pickle.dumps(list(sh.glob('*')))
b'...' |
@ecederstrand Thank you again for your solution. This is the exact reason why there would be a pickling error. I have solved the issue by following your code snippet. |
Seems like this issue is resolved, so I will close this. @TotalVariation let us know if there is any further issue or suggestion for changes. |
🐛 Bug
Hello,
I was using Slurm-based HPC with multiple GPUs to train deep learning models.
I believe this pickle error was caused by multiprocessing but I have little knowledge about this. Thank you so much.
Environment
cc @ssnl @VitalyFedyunin @ejguan @NivekT
The text was updated successfully, but these errors were encountered: