-
Notifications
You must be signed in to change notification settings - Fork 868
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Create saving-and-loading-Pytorch-checkpoints.rst #1364
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the PR @Linardos ! Looks good overall, I've added a few small suggestions. Could you re-compile the docs and check if everything looks good?
Co-authored-by: Daniel J. Beutel <daniel@adap.com>
Co-authored-by: Daniel J. Beutel <daniel@adap.com>
Co-authored-by: Daniel J. Beutel <daniel@adap.com>
Co-authored-by: Daniel J. Beutel <daniel@adap.com>
Changes committed! I look forward to your thoughts after running the code. I hope we can resolve the issue. |
@Linardos, lightning fast! Did you have a chance to build the docs and check if the formatting looks good? |
Yeap, I used a VS code plugin that allowed me to built them as I worked and your changes were small so it was easy to check pretty quickly. |
Reference Issues/PRs
What does this implement/fix? Explain your changes.
A minimal example for storing a PyTorch checkpoint. Several extra steps are added compared to the original saving progress example. Furthermore a way to load the latest checkpoint is added as well.
Any other comments?
This was developed with Daniel's help. We thought it was a resolved issue but apparently when loading the checkpoint, performance drops. It's as if training from scratch. We need to debug this further.