[FSDP][2/N] Remove `params_with_grad` #87480

awgu · 2022-10-21T16:57:43Z

Stack from ghstack:

[FSDP][2/N] Remove params_with_grad #87480 [FSDP][2/N] Remove params_with_grad
[FSDP][1/N] Rework clip_grad_norm_() and tests #87479 [FSDP][1/N] Rework clip_grad_norm_() and tests
[FSDP][Docs] Clarify warnings to mention collectives #87478 [FSDP][Docs] Clarify warnings to mention collectives

This PR removes the property params_with_grad from FullyShardedDataParallel. It was introduced when implementing clip_grad_norm_() but was not consistently used. Personally, I do not think it makes sense for FullyShardedDataParallel to expose this helper because it is not a common paradigm.

This PR is technically BC-breaking. However, I checked that no one internally is using this API.

cc @ezyang @gchanan

[ghstack-poisoned]

pytorch-bot · 2022-10-21T16:57:47Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/87480

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures, 3 Pending

As of commit d69725e:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: 0a98d576c4887b164322c01170aad00c927a43f9 Pull Request resolved: #87480

This PR removes the property `params_with_grad` from `FullyShardedDataParallel`. It was introduced when implementing `clip_grad_norm_()` but was not consistently used. Personally, I do not think it makes sense for `FullyShardedDataParallel` to expose this helper because it is not a common paradigm. This PR is technically BC-breaking. However, I checked that no one internally is using this API. [ghstack-poisoned]

ghstack-source-id: 291ed256981ba2929496cae224548aecf2240059 Pull Request resolved: #87480

This PR removes the property `params_with_grad` from `FullyShardedDataParallel`. It was introduced when implementing `clip_grad_norm_()` but was not consistently used. Personally, I do not think it makes sense for `FullyShardedDataParallel` to expose this helper because it is not a common paradigm. This PR is technically BC-breaking. However, I checked that no one internally is using this API. [ghstack-poisoned]

ghstack-source-id: b75d378f4f918d32160153e5c95bb16fef9468f6 Pull Request resolved: #87480

rohan-varma

Sounds good please add BC breaking label for release tracking purposes.

@ezyang

This PR removes the property `params_with_grad` from `FullyShardedDataParallel`. It was introduced when implementing `clip_grad_norm_()` but was not consistently used. Personally, I do not think it makes sense for `FullyShardedDataParallel` to expose this helper because it is not a common paradigm. This PR is technically BC-breaking. However, I checked that no one internally is using this API. cc @ezyang @gchanan [ghstack-poisoned]

ghstack-source-id: d3ce1d344c03644e6d91300379f2f757a5a36299 Pull Request resolved: #87480

awgu · 2022-10-24T12:45:32Z

@pytorchbot merge

pytorchmergebot · 2022-10-24T12:46:58Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

ghstack-source-id: 291ed256981ba2929496cae224548aecf2240059 Pull Request resolved: pytorch#87480

@ezyang

This PR removes the property `params_with_grad` from `FullyShardedDataParallel`. It was introduced when implementing `clip_grad_norm_()` but was not consistently used. Personally, I do not think it makes sense for `FullyShardedDataParallel` to expose this helper because it is not a common paradigm. This PR is technically BC-breaking. However, I checked that no one internally is using this API. cc @ezyang @gchanan Pull Request resolved: pytorch#87480 Approved by: https://github.com/rohan-varma

@ezyang

This PR removes the property `params_with_grad` from `FullyShardedDataParallel`. It was introduced when implementing `clip_grad_norm_()` but was not consistently used. Personally, I do not think it makes sense for `FullyShardedDataParallel` to expose this helper because it is not a common paradigm. This PR is technically BC-breaking. However, I checked that no one internally is using this API. cc @ezyang @gchanan Pull Request resolved: pytorch#87480 Approved by: https://github.com/rohan-varma

@ezyang

This PR removes the property `params_with_grad` from `FullyShardedDataParallel`. It was introduced when implementing `clip_grad_norm_()` but was not consistently used. Personally, I do not think it makes sense for `FullyShardedDataParallel` to expose this helper because it is not a common paradigm. This PR is technically BC-breaking. However, I checked that no one internally is using this API. cc @ezyang @gchanan Pull Request resolved: pytorch#87480 Approved by: https://github.com/rohan-varma

[FSDP] Remove params_with_grad

d21b4d1

[ghstack-poisoned]

awgu requested review from mrshenli, zhaojuanmao, pritamdamania87, rohan-varma, mingzhe09088, H-Huang and kwen2501 as code owners October 21, 2022 16:57

This was referenced Oct 21, 2022

[FSDP][Docs] Clarify warnings to mention collectives #87478

Closed

[FSDP][1/N] Rework clip_grad_norm_() and tests #87479

Closed

pytorch-bot bot added the release notes: distributed (sharded) release notes category label Oct 21, 2022

awgu added a commit that referenced this pull request Oct 21, 2022

[FSDP] Remove params_with_grad

c27f76b

ghstack-source-id: 0a98d576c4887b164322c01170aad00c927a43f9 Pull Request resolved: #87480

awgu added release notes: distributed (fsdp) release notes category and removed release notes: distributed (sharded) release notes category labels Oct 21, 2022

awgu changed the title ~~[FSDP] Remove params_with_grad~~ [FSDP][2/N] Remove params_with_grad Oct 21, 2022

awgu added a commit that referenced this pull request Oct 21, 2022

[FSDP][2/N] Remove params_with_grad

6adb797

ghstack-source-id: 291ed256981ba2929496cae224548aecf2240059 Pull Request resolved: #87480

awgu added a commit that referenced this pull request Oct 22, 2022

[FSDP][2/N] Remove params_with_grad

48cd53e

ghstack-source-id: b75d378f4f918d32160153e5c95bb16fef9468f6 Pull Request resolved: #87480

rohan-varma approved these changes Oct 24, 2022

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Oct 24, 2022

awgu added module: bc-breaking Related to a BC-breaking change topic: bc breaking topic category labels Oct 24, 2022

awgu added a commit that referenced this pull request Oct 24, 2022

[FSDP][2/N] Remove params_with_grad

bdc34ba

ghstack-source-id: d3ce1d344c03644e6d91300379f2f757a5a36299 Pull Request resolved: #87480

pytorchmergebot added the Merged label Oct 24, 2022

pytorchmergebot closed this in 084e773 Oct 24, 2022

awgu added a commit to awgu/pytorch that referenced this pull request Oct 24, 2022

[FSDP][2/N] Remove params_with_grad

8dea6aa

ghstack-source-id: 291ed256981ba2929496cae224548aecf2240059 Pull Request resolved: pytorch#87480

facebook-github-bot deleted the gh/awgu/140/head branch June 8, 2023 15:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FSDP][2/N] Remove `params_with_grad` #87480

[FSDP][2/N] Remove `params_with_grad` #87480

awgu commented Oct 21, 2022 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Oct 21, 2022 •

edited

Loading

rohan-varma left a comment

awgu commented Oct 24, 2022

pytorchmergebot commented Oct 24, 2022

[FSDP][2/N] Remove params_with_grad #87480

[FSDP][2/N] Remove params_with_grad #87480

Conversation

awgu commented Oct 21, 2022 • edited by pytorch-bot bot Loading

pytorch-bot bot commented Oct 21, 2022 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/87480

✅ No Failures, 3 Pending

rohan-varma left a comment

Choose a reason for hiding this comment

awgu commented Oct 24, 2022

pytorchmergebot commented Oct 24, 2022

Merge started

[FSDP][2/N] Remove `params_with_grad` #87480

[FSDP][2/N] Remove `params_with_grad` #87480

awgu commented Oct 21, 2022 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Oct 21, 2022 •

edited

Loading