Not flatten states when use_orig_param is True and sharding is NO_SHARD #100189

zhaojuanmao · 2023-04-27T18:38:52Z

When use_orig_param is True and sharding is NO_SHARD, parameters and states are not flattened, so optimizer states should not be flattened as well. The unit test will fail without the fix.

Stack from ghstack (oldest at bottom):

-> Not flatten states when use_orig_param is True and sharding is NO_SHARD #100189

[ghstack-poisoned]

pytorch-bot · 2023-04-27T18:38:55Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/100189

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 90a560f:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: 524ba5724383bd5f4dd7e2a9a6a6fc1de6166d88 Pull Request resolved: #100189

awgu

This looks good to me. Maybe @fegin has any opinion about unit test organization.

awgu · 2023-04-27T18:52:39Z

test/distributed/fsdp/test_fsdp_optim_state.py

+        )
+        optim = torch.optim.Adam(model.parameters(), lr=1e-2)
+
+        def step():


nit: Curious, why do we define step() here if we only call it once?

copied from other unit tests:)

I think it should be OK, it is neat to use a function

zhaojuanmao · 2023-04-27T21:37:50Z

@pytorchbot merge

pytorchmergebot · 2023-04-27T21:40:14Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Not flatten states when use_orig_param is True and shaarding is NO_SHARD

90a560f

[ghstack-poisoned]

zhaojuanmao requested review from mrshenli, rohan-varma, H-Huang, awgu, kwen2501, wanchaol, fegin, kiukchung and d4l3k as code owners April 27, 2023 18:38

pytorch-bot bot added the release notes: distributed (fsdp) release notes category label Apr 27, 2023

zhaojuanmao added a commit that referenced this pull request Apr 27, 2023

Not flatten states when use_orig_param is True and shaarding is NO_SHARD

39ce785

ghstack-source-id: 524ba5724383bd5f4dd7e2a9a6a6fc1de6166d88 Pull Request resolved: #100189

zhaojuanmao changed the title ~~Not flatten states when use_orig_param is True and shaarding is NO_SHARD~~ Not flatten states when use_orig_param is True and sharding is NO_SHARD Apr 27, 2023

awgu approved these changes Apr 27, 2023

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Apr 27, 2023

pytorchmergebot added the merging label Apr 27, 2023

pytorchmergebot added the Merged label Apr 27, 2023

pytorchmergebot closed this in ca1cf43 Apr 27, 2023

zhaojuanmao mentioned this pull request Apr 28, 2023

Allow each fully_shard unit to cast foward inputs for mixed precision config #100290

Closed

facebook-github-bot deleted the gh/zhaojuanmao/95/head branch June 8, 2023 19:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Not flatten states when use_orig_param is True and sharding is NO_SHARD #100189

Not flatten states when use_orig_param is True and sharding is NO_SHARD #100189

zhaojuanmao commented Apr 27, 2023 •

edited

pytorch-bot bot commented Apr 27, 2023 •

edited

awgu left a comment

awgu Apr 27, 2023

zhaojuanmao Apr 27, 2023

zhaojuanmao Apr 27, 2023

zhaojuanmao commented Apr 27, 2023

pytorchmergebot commented Apr 27, 2023

Not flatten states when use_orig_param is True and sharding is NO_SHARD #100189

Not flatten states when use_orig_param is True and sharding is NO_SHARD #100189

Conversation

zhaojuanmao commented Apr 27, 2023 • edited

pytorch-bot bot commented Apr 27, 2023 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/100189

✅ No Failures

awgu left a comment

Choose a reason for hiding this comment

awgu Apr 27, 2023

Choose a reason for hiding this comment

zhaojuanmao Apr 27, 2023

Choose a reason for hiding this comment

zhaojuanmao Apr 27, 2023

Choose a reason for hiding this comment

zhaojuanmao commented Apr 27, 2023

pytorchmergebot commented Apr 27, 2023

Merge started

zhaojuanmao commented Apr 27, 2023 •

edited

pytorch-bot bot commented Apr 27, 2023 •

edited