sync AveragedModel buffers when use_buffers=False #84054

RangiLyu · 2022-08-25T13:13:54Z

As described in the issue, the AveragedModel will deep copy the model during initialization, which means that the buffers in the averaged model cannot be updated together with the model.

One solution is to make the buffers equal to the source model every time when calling update_parameters.

facebook-github-bot · 2022-08-25T13:14:01Z

🔗 Helpful links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/84054
✖️ Python docs build was skipped
✖️ C++ docs build was skipped
❓Need help or want to give feedback on the CI? Visit our office hours

✅ No Failures (18 Pending)

As of commit 8dea46b (more details on the Dr. CI page):

Expand to see more

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

samdow

Sorry for the delay in this review @RangiLyu, thanks for doing this!

facebook-github-bot · 2022-10-04T00:33:50Z

/easycla

As part of the transition to the PyTorch Foundation, this project now requires contributions be covered under the new CLA. See #85559 for additional details.

This comment will trigger a new check of this PR. If you are already covered, you will simply see a new "EasyCLA" check that passes. If you are not covered, a bot will leave a new comment with a link to sign.

linux-foundation-easycla · 2022-10-04T00:33:52Z

The committers listed above are authorized under a signed CLA.

✅ login: RangiLyu (8dea46b)

RangiLyu · 2022-10-21T03:17:55Z

@samdow @albanD Hi, sorry for bothering. Is there a plan to merge this PR?

samdow · 2022-10-21T13:13:37Z

Hi sorry--since it's approved, feel free to comment with @pytorchbot merge and it will merge it for you

RangiLyu · 2022-10-24T02:00:50Z

@pytorchbot merge

pytorchmergebot · 2022-10-24T02:02:16Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2022-10-24T02:02:16Z

Merge failed

Reason: This PR is too stale; the last push date was more than 3 days ago. Please rebase and try again. You can rebase by leaving the following comment on this PR:
@pytorchbot rebase

Details for Dev Infra team

Raised by workflow job

RangiLyu · 2022-10-24T03:31:56Z

@pytorchbot rebase

pytorch-bot · 2022-10-24T03:31:58Z

You don't have permissions to rebase this PR, only people with write permissions may rebase PRs.

samdow · 2022-10-24T13:44:11Z

@pytorchbot rebase

pytorchmergebot · 2022-10-24T13:46:10Z

@pytorchbot successfully started a rebase job. Check the current status here

pytorchmergebot · 2022-10-24T13:46:15Z

Successfully rebased fix_avgmodel_buffer onto refs/remotes/origin/viable/strict, please pull locally before adding more changes (for example, via git checkout fix_avgmodel_buffer && git pull --rebase)

pytorch-bot · 2022-10-24T13:46:25Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/84054

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit ad4cd0e:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

samdow · 2022-10-24T13:50:25Z

@pytorchbot merge

samdow · 2022-10-24T13:50:48Z

Just set up the job so I didn't forget. Sorry about that @RangiLyu and thanks for the PR

pytorchmergebot · 2022-10-24T13:52:22Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

github-actions · 2022-10-24T16:04:38Z

Hey @RangiLyu.
You've committed this PR, but it does not have both a 'release notes: ...' and 'topics: ...' label. Please add one of each to the PR. The 'release notes: ...' label should represent the part of PyTorch that this PR changes (fx, autograd, distributed, etc) and the 'topics: ...' label should represent the kind of PR it is (not user facing, new feature, bug fix, perf improvement, etc). The list of valid labels can be found here for the 'release notes: ...' and here for the 'topics: ...'.
For changes that are 'topic: not user facing' there is no need for a release notes label.

Fixes pytorch#84053 As described in the issue, the AveragedModel will deep copy the model during initialization, which means that the buffers in the averaged model cannot be updated together with the model. One solution is to make the buffers equal to the source model every time when calling `update_parameters`. Pull Request resolved: pytorch#84054 Approved by: https://github.com/samdow

RangiLyu requested a review from albanD as a code owner August 25, 2022 13:13

facebook-github-bot added the cla signed label Aug 25, 2022

pytorchbot added the open source label Aug 25, 2022

RangiLyu mentioned this pull request Aug 25, 2022

Buffers in AveragedModel are not synchronized with the source model when use_buffers=False #84053

Closed

albanD assigned samdow Aug 26, 2022

dagitses requested a review from samdow August 29, 2022 13:28

dagitses added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Aug 29, 2022

samdow approved these changes Sep 7, 2022

View reviewed changes

sync AveragedModel buffers when use_buffers=False

ad4cd0e

pytorchmergebot force-pushed the fix_avgmodel_buffer branch from 8dea46b to ad4cd0e Compare October 24, 2022 13:46

pytorchmergebot added the Merged label Oct 24, 2022

pytorchmergebot closed this in 512a3a4 Oct 24, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sync AveragedModel buffers when use_buffers=False #84054

sync AveragedModel buffers when use_buffers=False #84054

RangiLyu commented Aug 25, 2022

facebook-github-bot commented Aug 25, 2022 •

edited

samdow left a comment

facebook-github-bot commented Oct 4, 2022

linux-foundation-easycla bot commented Oct 4, 2022 •

edited

RangiLyu commented Oct 21, 2022

samdow commented Oct 21, 2022

RangiLyu commented Oct 24, 2022

pytorchmergebot commented Oct 24, 2022

pytorchmergebot commented Oct 24, 2022

RangiLyu commented Oct 24, 2022

pytorch-bot bot commented Oct 24, 2022

samdow commented Oct 24, 2022

pytorchmergebot commented Oct 24, 2022

pytorchmergebot commented Oct 24, 2022

pytorch-bot bot commented Oct 24, 2022 •

edited

samdow commented Oct 24, 2022

samdow commented Oct 24, 2022

pytorchmergebot commented Oct 24, 2022

github-actions bot commented Oct 24, 2022

sync AveragedModel buffers when use_buffers=False #84054

sync AveragedModel buffers when use_buffers=False #84054

Conversation

RangiLyu commented Aug 25, 2022

facebook-github-bot commented Aug 25, 2022 • edited

🔗 Helpful links

✅ No Failures (18 Pending)

samdow left a comment

Choose a reason for hiding this comment

facebook-github-bot commented Oct 4, 2022

linux-foundation-easycla bot commented Oct 4, 2022 • edited

RangiLyu commented Oct 21, 2022

samdow commented Oct 21, 2022

RangiLyu commented Oct 24, 2022

pytorchmergebot commented Oct 24, 2022

Merge started

pytorchmergebot commented Oct 24, 2022

Merge failed

RangiLyu commented Oct 24, 2022

pytorch-bot bot commented Oct 24, 2022

samdow commented Oct 24, 2022

pytorchmergebot commented Oct 24, 2022

pytorchmergebot commented Oct 24, 2022

pytorch-bot bot commented Oct 24, 2022 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/84054

✅ No Failures

samdow commented Oct 24, 2022

samdow commented Oct 24, 2022

pytorchmergebot commented Oct 24, 2022

Merge started

github-actions bot commented Oct 24, 2022

facebook-github-bot commented Aug 25, 2022 •

edited

linux-foundation-easycla bot commented Oct 4, 2022 •

edited

pytorch-bot bot commented Oct 24, 2022 •

edited