Skip to content
This repository was archived by the owner on Mar 14, 2024. It is now read-only.

Checkpoint Learning Stats#78

Closed
chandlerzuo wants to merge 1 commit intofacebookresearch:masterfrom
chandlerzuo:export-D15977931
Closed

Checkpoint Learning Stats#78
chandlerzuo wants to merge 1 commit intofacebookresearch:masterfrom
chandlerzuo:export-D15977931

Conversation

@chandlerzuo
Copy link
Copy Markdown
Contributor

@chandlerzuo chandlerzuo commented Jun 28, 2019

Summary:
Currently, when resuming from failed training, learning curve stats history is lost. This is because in learning stats are saved in a temporary path, which is not picked up by the resumed run.

This diff adds the learning curve stats in the checkpoint file, which is picked up by the resumed run.

Differential Revision: D15977931

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jun 28, 2019
chandlerzuo added a commit to chandlerzuo/PyTorch-BigGraph that referenced this pull request Aug 14, 2019
Summary:
Pull Request resolved: facebookresearch#78

Currently, when resuming from a failed training, learning curve stats history is lost.

This diff adds the learning curve stats in the checkpoint file.

Differential Revision: D15977931

fbshipit-source-id: 8b562a540a6ebb5354f824f8bdf59767c8525505
chandlerzuo added a commit to chandlerzuo/PyTorch-BigGraph that referenced this pull request Sep 12, 2019
Summary:
Pull Request resolved: facebookresearch#78

Currently, when resuming from a failed training, learning curve stats history is lost.

This diff adds the learning curve stats in the checkpoint file.

Differential Revision: D15977931

fbshipit-source-id: bb28d9342f52640df9d6478aa979e666825d0c67
chandlerzuo added a commit to chandlerzuo/PyTorch-BigGraph that referenced this pull request Sep 13, 2019
Summary:
Pull Request resolved: facebookresearch#78

Currently, when resuming from a failed training, learning curve stats history is lost.

This diff adds the learning curve stats in the checkpoint file.

Reviewed By: lerks

Differential Revision: D15977931

fbshipit-source-id: d32c7d758cb03a742ed04999a1c169aec620e75f
chandlerzuo added a commit to chandlerzuo/PyTorch-BigGraph that referenced this pull request Sep 13, 2019
Summary:
Pull Request resolved: facebookresearch#78

Currently, when resuming from a failed training, learning curve stats history is lost.

This diff adds the learning curve stats in the checkpoint file.

Reviewed By: lerks

Differential Revision: D15977931

fbshipit-source-id: 39308d85efe8f0921231f58b2f96747732a169a0
Summary:
Pull Request resolved: facebookresearch#78

Currently, when resuming from a failed training, learning curve stats history is lost.

This diff adds the learning curve stats in the checkpoint file.

Reviewed By: lerks

Differential Revision: D15977931

fbshipit-source-id: 1427f447ee388ec38d8aa49f6b568acbc1a8f4d4
@facebook-github-bot
Copy link
Copy Markdown
Contributor

This pull request has been merged in f0dd9a4.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Merged

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants