[FSDP] Fix `nn.Parameter` usage for 2D and `use_orig_params=True` #89782

awgu · 2022-11-28T20:42:37Z

Stack from ghstack (oldest at bottom):

-> [FSDP] Fix nn.Parameter usage for 2D and use_orig_params=True #89782

This ensures that all elements of FlatParameter._params and FlatParameter._shared_params are nn.Parameters (as expected). This was violated by the local tensor of a DTensor when using 2D parallelism. To fix the breakage, we simply wrap with nn.Parameter if needed.

[ghstack-poisoned]

pytorch-bot · 2022-11-28T20:42:40Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/89782

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 93cccd0:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: 5830e8be3148aa7fc3514136ae02f81b7417d50e Pull Request resolved: #89782

awgu · 2022-11-28T20:43:10Z

torch/distributed/fsdp/flat_param.py

@@ -1307,7 +1317,10 @@ def _use_unsharded_views(self, as_params: bool) -> None:
                        assert tensor is not None  # mypy
                        param_var = tensor
                setattr(module, param_name, param_var)
-                if self._use_orig_params and self._training_state == HandleTrainingState.FORWARD:
+                if (


This change and below is just ufmt.

fduwjj

Thanks for the fix!

awgu · 2022-11-28T20:51:08Z

@pytorchbot rebase -s

pytorchmergebot · 2022-11-28T20:52:59Z

@pytorchbot successfully started a rebase job. Check the current status here

…s=True`" This ensures that all elements of `FlatParameter._params` and `FlatParameter._shared_params` are `nn.Parameter`s (as expected). This was violated by the local tensor of a `DTensor` when using 2D parallelism. To fix the breakage, we simply wrap with `nn.Parameter` if needed. [ghstack-poisoned]

pytorchmergebot · 2022-11-28T20:53:16Z

Successfully rebased gh/awgu/217/orig onto refs/remotes/origin/viable/strict, please pull locally before adding more changes (for example, via ghstack checkout https://github.com/pytorch/pytorch/pull/89782)

ghstack-source-id: 6282e9a1d4fa734eb46ab8a4f9ed6166208a335f Pull Request resolved: #89782

awgu · 2022-11-28T23:29:44Z

@pytorchbot merge

pytorchmergebot · 2022-11-28T23:31:17Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

…torch#89782) This ensures that all elements of `FlatParameter._params` and `FlatParameter._shared_params` are `nn.Parameter`s (as expected). This was violated by the local tensor of a `DTensor` when using 2D parallelism. To fix the breakage, we simply wrap with `nn.Parameter` if needed. Pull Request resolved: pytorch#89782 Approved by: https://github.com/fduwjj

[FSDP] Fix nn.Parameter usage for 2D and use_orig_params=True

6202b7f

[ghstack-poisoned]

awgu requested review from mrshenli, zhaojuanmao, pritamdamania87, rohan-varma, H-Huang and kwen2501 as code owners November 28, 2022 20:42

pytorch-bot bot added the release notes: distributed (fsdp) release notes category label Nov 28, 2022

awgu added the topic: not user facing topic category label Nov 28, 2022

awgu added a commit that referenced this pull request Nov 28, 2022

[FSDP] Fix nn.Parameter usage for 2D and use_orig_params=True

0a368e5

ghstack-source-id: 5830e8be3148aa7fc3514136ae02f81b7417d50e Pull Request resolved: #89782

awgu commented Nov 28, 2022

View reviewed changes

fduwjj approved these changes Nov 28, 2022

View reviewed changes

awgu added the ciflow/trunk Trigger trunk jobs on your pull request label Nov 28, 2022

pytorchmergebot pushed a commit that referenced this pull request Nov 28, 2022

[FSDP] Fix nn.Parameter usage for 2D and use_orig_params=True

42282c5

ghstack-source-id: 6282e9a1d4fa734eb46ab8a4f9ed6166208a335f Pull Request resolved: #89782

pytorchmergebot added the Merged label Nov 28, 2022

pytorchmergebot closed this in 943acd4 Nov 28, 2022

awgu mentioned this pull request Nov 29, 2022

[FSDP] Another fix for DTensor, use_orig_params=True #89845

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FSDP] Fix `nn.Parameter` usage for 2D and `use_orig_params=True` #89782

[FSDP] Fix `nn.Parameter` usage for 2D and `use_orig_params=True` #89782

awgu commented Nov 28, 2022 •

edited by pytorchmergebot

pytorch-bot bot commented Nov 28, 2022 •

edited

awgu Nov 28, 2022

fduwjj left a comment

awgu commented Nov 28, 2022

pytorchmergebot commented Nov 28, 2022

pytorchmergebot commented Nov 28, 2022

awgu commented Nov 28, 2022

pytorchmergebot commented Nov 28, 2022

[FSDP] Fix nn.Parameter usage for 2D and use_orig_params=True #89782

[FSDP] Fix nn.Parameter usage for 2D and use_orig_params=True #89782

Conversation

awgu commented Nov 28, 2022 • edited by pytorchmergebot

pytorch-bot bot commented Nov 28, 2022 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/89782

✅ No Failures

awgu Nov 28, 2022

Choose a reason for hiding this comment

fduwjj left a comment

Choose a reason for hiding this comment

awgu commented Nov 28, 2022

pytorchmergebot commented Nov 28, 2022

pytorchmergebot commented Nov 28, 2022

awgu commented Nov 28, 2022

pytorchmergebot commented Nov 28, 2022

Merge started

[FSDP] Fix `nn.Parameter` usage for 2D and `use_orig_params=True` #89782

[FSDP] Fix `nn.Parameter` usage for 2D and `use_orig_params=True` #89782

awgu commented Nov 28, 2022 •

edited by pytorchmergebot

pytorch-bot bot commented Nov 28, 2022 •

edited