Skip to content

Conversation

@anuragarnab
Copy link
Contributor

What does this PR do?

Fix bug in the case where job was preempted after the temporary checkpoint was written, but before it was renamed to the final checkpoint name.

Currently, this will result in an "InvalidCheckpointError" being thrown.

Fixes # (issue)

Checklist

  • This PR fixes a minor typo or improves the docs (you can dismiss the other
    checks if that's the case).
  • This change is discussed in a Github issue/
    discussion (please add a
    link).
  • The documentation and docstrings adhere to the
    documentation guidelines.
  • This change includes necessary high-coverage tests.
    (No quality testing = no merge!)

Fix bug in the case where job was preempted after the temporary checkpoint was written, but before it was renamed to the final checkpoint name.
@google-cla google-cla bot added the cla: yes label Apr 5, 2021
Copy link
Member

@jheek jheek left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm surprised this didn't pop up before. Thanks for the fix!

@copybara-service copybara-service bot merged commit 95409ec into google:master Apr 7, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants