Enable bfloat16 for hardtanh_backward_cuda #91511

cchan · 2022-12-29T17:26:37Z

I'm not sure why this was left out in the first place as all adjacent operations have both Half and BFloat16. Things seem to work as expected and this enables relu6 to be used in bfloat16 training. Hardtanh backward is super simple and precision is not relevant.

import torch
x_fp32 = torch.tensor([-1,2,4,7], requires_grad=True, dtype=torch.float32, device="cuda")
x_bf16 = torch.tensor([-1,2,4,7], requires_grad=True, dtype=torch.bfloat16, device="cuda")
torch.nn.functional.relu6(x_fp32).sum().backward()
torch.nn.functional.relu6(x_bf16).sum().backward()
assert (x_fp32.grad == x_bf16.grad).all()

Previously would fail with:

Traceback (most recent call last):
  File "test_hardtanh_patch.py", line 5, in <module>
    torch.nn.functional.relu6(x_bf16).sum().backward()
  File ".../lib/python3.8/site-packages/torch/_tensor.py", line 396, in backward
    torch.autograd.backward(self, gradient, retain_graph, create_graph, inputs=inputs)
  File ".../lib/python3.8/site-packages/torch/autograd/__init__.py", line 173, in backward
    Variable._execution_engine.run_backward(  # Calls into the C++ engine to run the backward pass
RuntimeError: "hardtanh_backward_cuda" not implemented for 'BFloat16'

pytorch-bot · 2022-12-29T17:26:39Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/91511

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit b5ab130:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

linux-foundation-easycla · 2022-12-29T17:26:42Z

The committers listed above are authorized under a signed CLA.

✅ login: cchan / name: Clive Chan (b4444c8)

cchan · 2022-12-29T17:36:04Z

cc @rohithkrn - just checking, do you recall if there was a reason to not add hardtanh_backward, like some test not passing? or just an oversight in this: #32065

cchan · 2022-12-30T05:36:38Z

@pytorchbot rebase

pytorch-bot · 2022-12-30T05:36:40Z

You don't have permissions to rebase this PR, only people with write permissions may rebase PRs.

rohithkrn · 2022-12-30T18:05:25Z

cc @rohithkrn - just checking, do you recall if there was a reason to not add hardtanh_backward, like some test not passing? or just an oversight in this: #32065

@cchan honestly don't remember. It's been a while.

Chillee · 2023-01-06T20:34:18Z

@pytorchbot merge

pytorchmergebot · 2023-01-06T20:36:00Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2023-01-06T20:36:01Z

Merge failed

Reason: This PR is too stale; the last push date was more than 3 days ago. Please rebase and try again. You can rebase by leaving the following comment on this PR:
@pytorchbot rebase

Details for Dev Infra team

Raised by workflow job

cchan · 2023-01-07T00:35:16Z

@pytorchbot rebase

pytorch-bot · 2023-01-07T00:35:18Z

You don't have permissions to rebase this PR, only people with write permissions may rebase PRs.

ngimel · 2023-01-07T00:50:35Z

@pytorchbot rebase

pytorchmergebot · 2023-01-07T00:52:31Z

@pytorchbot successfully started a rebase job. Check the current status here

pytorchmergebot · 2023-01-07T00:52:35Z

Successfully rebased patch-5 onto refs/remotes/origin/viable/strict, please pull locally before adding more changes (for example, via git checkout patch-5 && git pull --rebase)

ngimel · 2023-01-09T18:47:10Z

@pytorchbot merge

pytorchmergebot · 2023-01-09T18:50:23Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorch-bot bot added the release notes: cuda release notes category label Dec 29, 2022

pytorchbot added the open source label Dec 29, 2022

cchan requested review from mruberry and ngimel as code owners December 29, 2022 23:12

ngimel approved these changes Dec 30, 2022

View reviewed changes

cchan force-pushed the patch-5 branch from 7847de6 to a89729c Compare December 30, 2022 07:04

cchan mentioned this pull request Jan 6, 2023

Support BF16 in grid_sample #91800

Closed

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Jan 6, 2023

cchan added 2 commits January 7, 2023 00:52

Enable bfloat16 for hardtanh_backward_cuda

69110d3

ci fix?

b5ab130

pytorchmergebot force-pushed the patch-5 branch from a89729c to b5ab130 Compare January 7, 2023 00:52

pytorchmergebot added the Merged label Jan 9, 2023

pytorchmergebot closed this in d4aa807 Jan 9, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable bfloat16 for hardtanh_backward_cuda #91511

Enable bfloat16 for hardtanh_backward_cuda #91511

cchan commented Dec 29, 2022 •

edited

pytorch-bot bot commented Dec 29, 2022 •

edited

linux-foundation-easycla bot commented Dec 29, 2022 •

edited

cchan commented Dec 29, 2022

cchan commented Dec 30, 2022

pytorch-bot bot commented Dec 30, 2022

rohithkrn commented Dec 30, 2022

Chillee commented Jan 6, 2023

pytorchmergebot commented Jan 6, 2023

pytorchmergebot commented Jan 6, 2023

cchan commented Jan 7, 2023

pytorch-bot bot commented Jan 7, 2023

ngimel commented Jan 7, 2023

pytorchmergebot commented Jan 7, 2023

pytorchmergebot commented Jan 7, 2023

ngimel commented Jan 9, 2023

pytorchmergebot commented Jan 9, 2023

Enable bfloat16 for hardtanh_backward_cuda #91511

Enable bfloat16 for hardtanh_backward_cuda #91511

Conversation

cchan commented Dec 29, 2022 • edited

pytorch-bot bot commented Dec 29, 2022 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/91511

✅ No Failures

linux-foundation-easycla bot commented Dec 29, 2022 • edited

cchan commented Dec 29, 2022

cchan commented Dec 30, 2022

pytorch-bot bot commented Dec 30, 2022

rohithkrn commented Dec 30, 2022

Chillee commented Jan 6, 2023

pytorchmergebot commented Jan 6, 2023

Merge started

pytorchmergebot commented Jan 6, 2023

Merge failed

cchan commented Jan 7, 2023

pytorch-bot bot commented Jan 7, 2023

ngimel commented Jan 7, 2023

pytorchmergebot commented Jan 7, 2023

pytorchmergebot commented Jan 7, 2023

ngimel commented Jan 9, 2023

pytorchmergebot commented Jan 9, 2023

Merge started

cchan commented Dec 29, 2022 •

edited

pytorch-bot bot commented Dec 29, 2022 •

edited

linux-foundation-easycla bot commented Dec 29, 2022 •

edited