New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error while saving with EarlyStoppingCallback #29157
Comments
pip3 install deepspeed==0.13.1 is work for me. pip3 install deepspeed==0.13.2 |
Wow thanks @webbigdata-jp ! |
This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread. Please note that issues that do not follow the contributing guidelines are likely to be ignored. |
The error still occur in my situation even with deepspeed==0.13.1 and transformers==4.37.2 Could anyone help? |
|
problem fix. I shouldn't log a tensor object using self.log (working with wandb). After converting my logging variable from tensor to python object via xx.item() solve my problem |
System Info
transformers
version: 4.38.0.dev0 (also in 4.38.0 and 4.39.0.dev0)Who can help?
@muellerzr and @pacman100
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Running a standard Causal LM training routine.
Reproduction
This error has appeared in the last few days, likely due to some recent change.
Error is fixed by either rolling back to transformers version 4.37.2 or remove the early stopping callback.
Here's the stack trace:
Expected behavior
No error with 4.38.0.dev0 transformers version.
The text was updated successfully, but these errors were encountered: