Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Model checkpointing #234

Merged
merged 9 commits into from
Oct 13, 2023
Merged

Conversation

dhpitt
Copy link
Collaborator

@dhpitt dhpitt commented Oct 11, 2023

This PR implements two model checkpointing callbacks. One simply saves a model every N epochs, and the other monitors a metric from eval_losses to determine when to save a model.

Copy link
Member

@JeanKossaifi JeanKossaifi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good overall @dhpitt !

neuralop/training/trainer.py Outdated Show resolved Hide resolved
neuralop/training/callbacks.py Outdated Show resolved Hide resolved
@JeanKossaifi JeanKossaifi merged commit 5da5869 into neuraloperator:main Oct 13, 2023
1 check passed
@dhpitt dhpitt deleted the model_checkpointing branch October 30, 2023 21:54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants