[FSDP] Allow MixedPrecision to skip inputs #90620

mrshenli · 2022-12-10T16:02:44Z

Stack from ghstack (oldest at bottom):

[ghstack-poisoned]

pytorch-bot · 2022-12-10T16:02:47Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/90620

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 Failures

As of commit 85a007b:

The following jobs have failed:

linux-focal-rocm5.2-py3.8 / test (default, 1, 2, linux.rocm.gpu)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: a6d90b935b2d22509be120e82277a5130e178572 Pull Request resolved: #90620

rohan-varma

LGTM, but curious what's the use case to skip inputs?

mrshenli · 2022-12-10T16:15:22Z

Thanks @rohan-varma, one model I am working on has one forward argument, which is sensitive to precision. So have to keep it in fp32, instead of converting it to bfloat16 and convert back.

torch/distributed/fsdp/api.py

test/distributed/fsdp/test_fsdp_mixed_precision.py

torch/distributed/fsdp/api.py

torch/testing/_internal/common_distributed.py

awgu

Overall, this looks good to me. Sorry for leaving all of the comments in separate reviews instead of just one.

[ghstack-poisoned]

awgu · 2022-12-11T03:13:52Z

My bad for suggesting to un-default cast_forward_inputs. I did not see that it already existed in past tests.

I think you need to add a value for cast_forward_inputs in this line:

pytorch/test/distributed/fsdp/test_fsdp_mixed_precision.py

Line 821 in a5d34ad

model = SaveForwardInputsModel(forward_inputs).cuda()

[ghstack-poisoned]

facebook-github-bot · 2022-12-11T06:39:48Z

This pull request has been merged in 80542ad.

ghstack-source-id: d9e229fce99a0fc8c422aee8f80405dd21352a17 Pull Request resolved: pytorch#90620

[FSDP] Allos MixedPrecision to skip inputs

e5ca4f7

[ghstack-poisoned]

mrshenli requested review from pritamdamania87, zhaojuanmao, rohan-varma, H-Huang, awgu, kwen2501 and wanchaol as code owners December 10, 2022 16:02

mrshenli mentioned this pull request Dec 10, 2022

Add global registry to composable API contract #90579

Closed

pytorch-bot bot added the release notes: distributed (fsdp) release notes category label Dec 10, 2022

mrshenli added a commit that referenced this pull request Dec 10, 2022

[FSDP] Allos MixedPrecision to skip inputs

9bf2c3c

ghstack-source-id: a6d90b935b2d22509be120e82277a5130e178572 Pull Request resolved: #90620

mrshenli mentioned this pull request Dec 10, 2022

[FSDP] Fix _pre_forward type annotation #90621

Closed

rohan-varma approved these changes Dec 10, 2022

View reviewed changes

mrshenli added the ciflow/trunk Trigger trunk jobs on your pull request label Dec 10, 2022

awgu reviewed Dec 10, 2022

View reviewed changes

torch/distributed/fsdp/api.py Outdated Show resolved Hide resolved

awgu changed the title ~~[FSDP] Allos MixedPrecision to skip inputs~~ [FSDP] Allow MixedPrecision to skip inputs Dec 10, 2022

awgu reviewed Dec 10, 2022

View reviewed changes

test/distributed/fsdp/test_fsdp_mixed_precision.py Outdated Show resolved Hide resolved

test/distributed/fsdp/test_fsdp_mixed_precision.py Outdated Show resolved Hide resolved

awgu reviewed Dec 10, 2022

View reviewed changes

torch/distributed/fsdp/api.py Outdated Show resolved Hide resolved

awgu reviewed Dec 10, 2022

View reviewed changes

torch/testing/_internal/common_distributed.py Outdated Show resolved Hide resolved

awgu approved these changes Dec 10, 2022

View reviewed changes

mrshenli added 4 commits December 10, 2022 22:48

Update on "[FSDP] Allow MixedPrecision to skip inputs"

39848c1

[ghstack-poisoned]

Update on "[FSDP] Allow MixedPrecision to skip inputs"

e588b51

[ghstack-poisoned]

Update on "[FSDP] Allow MixedPrecision to skip inputs"

b4a60c2

[ghstack-poisoned]

Update on "[FSDP] Allow MixedPrecision to skip inputs"

a5d34ad

[ghstack-poisoned]

Update on "[FSDP] Allow MixedPrecision to skip inputs"

85a007b

[ghstack-poisoned]

pytorchmergebot closed this in 80542ad Dec 11, 2022

facebook-github-bot added the Merged label Dec 11, 2022

mrshenli added a commit to mrshenli/pytorch that referenced this pull request Dec 12, 2022

[FSDP] Allow MixedPrecision to skip inputs

b65e8ec

ghstack-source-id: d9e229fce99a0fc8c422aee8f80405dd21352a17 Pull Request resolved: pytorch#90620

mrshenli mentioned this pull request Dec 13, 2022

Fix FSDP checkpoint tests #90745

Closed

facebook-github-bot deleted the gh/mrshenli/355/head branch June 8, 2023 18:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FSDP] Allow MixedPrecision to skip inputs #90620

[FSDP] Allow MixedPrecision to skip inputs #90620

mrshenli commented Dec 10, 2022 •

edited

pytorch-bot bot commented Dec 10, 2022 •

edited

rohan-varma left a comment

mrshenli commented Dec 10, 2022

awgu left a comment

awgu commented Dec 11, 2022

facebook-github-bot commented Dec 11, 2022

[FSDP] Allow MixedPrecision to skip inputs #90620

[FSDP] Allow MixedPrecision to skip inputs #90620

Conversation

mrshenli commented Dec 10, 2022 • edited

pytorch-bot bot commented Dec 10, 2022 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/90620

❌ 1 Failures

rohan-varma left a comment

Choose a reason for hiding this comment

mrshenli commented Dec 10, 2022

awgu left a comment

Choose a reason for hiding this comment

awgu commented Dec 11, 2022

facebook-github-bot commented Dec 11, 2022

mrshenli commented Dec 10, 2022 •

edited

pytorch-bot bot commented Dec 10, 2022 •

edited