Store `autocast_gpu_dtype` in `custom_fwd` and `custom_bwd` for BFloat16 autocast #88029

crcrpar · 2022-10-29T03:53:42Z

As per #87979, custom_bwd seems to forcefully use torch.float16 for torch.autograd.Function.backward regardless of the dtype used in the forward.

Changes:

store the dtype in args[0]
update tests to confirm the dtype of intermediate result tensors that are outputs of autocast compatible torch functions

cc @ptrblck @ngimel

Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com>

pytorch-bot · 2022-10-29T03:53:45Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/88029

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 4b580fc:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ngimel

Looks good, thanks for the test cleanup!

Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com>

crcrpar · 2022-10-31T20:47:32Z

@pytorchbot merge

pytorchmergebot · 2022-10-31T20:49:20Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

@ptrblck

…t16 autocast (pytorch#88029) As per pytorch#87979, `custom_bwd` seems to forcefully use `torch.float16` for `torch.autograd.Function.backward` regardless of the `dtype` used in the forward. Changes: - store the `dtype` in `args[0]` - update tests to confirm the dtype of intermediate result tensors that are outputs of autocast compatible `torch` functions cc @ptrblck @ngimel Pull Request resolved: pytorch#88029 Approved by: https://github.com/ngimel

@ptrblck

…t16 autocast (pytorch#88029) As per pytorch#87979, `custom_bwd` seems to forcefully use `torch.float16` for `torch.autograd.Function.backward` regardless of the `dtype` used in the forward. Changes: - store the `dtype` in `args[0]` - update tests to confirm the dtype of intermediate result tensors that are outputs of autocast compatible `torch` functions cc @ptrblck @ngimel Pull Request resolved: pytorch#88029 Approved by: https://github.com/ngimel

Store autocast_gpu_dtype in custom_fwd and custom_bwd

cc44ea5

Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com>

pytorchbot added the open source label Oct 29, 2022

ngimel approved these changes Oct 29, 2022

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Oct 29, 2022

Check if BFloat16 is supported

4b580fc

Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com>

pytorchmergebot added the Merged label Oct 31, 2022

pytorchmergebot closed this in bc03aa6 Oct 31, 2022

crcrpar deleted the cuda_custom_fwd_bwd_bf16 branch October 31, 2022 23:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Store `autocast_gpu_dtype` in `custom_fwd` and `custom_bwd` for BFloat16 autocast #88029

Store `autocast_gpu_dtype` in `custom_fwd` and `custom_bwd` for BFloat16 autocast #88029

crcrpar commented Oct 29, 2022

pytorch-bot bot commented Oct 29, 2022 •

edited

ngimel left a comment

crcrpar commented Oct 31, 2022

pytorchmergebot commented Oct 31, 2022

Store autocast_gpu_dtype in custom_fwd and custom_bwd for BFloat16 autocast #88029

Store autocast_gpu_dtype in custom_fwd and custom_bwd for BFloat16 autocast #88029

Conversation

crcrpar commented Oct 29, 2022

pytorch-bot bot commented Oct 29, 2022 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/88029

✅ No Failures

ngimel left a comment

Choose a reason for hiding this comment

crcrpar commented Oct 31, 2022

pytorchmergebot commented Oct 31, 2022

Merge started

Store `autocast_gpu_dtype` in `custom_fwd` and `custom_bwd` for BFloat16 autocast #88029

Store `autocast_gpu_dtype` in `custom_fwd` and `custom_bwd` for BFloat16 autocast #88029

pytorch-bot bot commented Oct 29, 2022 •

edited