Add prelu into Autocast CPU whitelist #95366

yanbing-j · 2023-02-23T07:16:38Z

Motivation

Add prelu to lower precision cast policy on AutocastCPU to fix #95365 :

Before: Within the scope of torch.cpu.amp.autocast(dtype=torch.bfloat16) , prelu cannot address the scenario of different datatypes of input and weight, will get a RuntimeError. This scenario is common in autocast, e.g, with autocast to bf16, if the op before prelu comes out a bf16 output, which is the input of prelu, and prelu's weight is fp32, then it will get a RuntimeError.

After: Within the scope of torch.cpu.amp.autocast(dtype=torch.bfloat16) , prelu be forced to run with bf16 data type.

Before #91238, when input is bf16, weight will be forced to cast to bf16. After #91238, this kind of test scenario will raise a RuntimeError. There is no precision loss since the workable one is also casting to bf16.

And this also alighs with Autocast CUDA whitelist.

cc @mcarilli @ptrblck @leslie-fang-intel @jgong5 @mingfeima @XiaobingSuper @sanchitintel @ashokei @jingxu10

pytorch-bot · 2023-02-23T07:16:40Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/95366

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 64b70f9:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

leslie-fang-intel · 2023-02-23T07:23:53Z

@yanbing-j May I kindly know why we have to put prelu into whitelist? How much will it affect some model's performance?

yanbing-j · 2023-02-23T07:44:26Z

@yanbing-j May I kindly know why we have to put prelu into whitelist? How much will it affect some model's performance?

I have updated the description, which will answer your questions.

leslie-fang-intel · 2023-02-23T07:50:37Z

Thanks for the comments. It looks like PReLU should support mixed precision input like BN instead of add it into whitellist?

yanbing-j · 2023-02-23T07:56:50Z

This is also align with CUDA whitelist.
Since #91238 rewrites all the code of PReLU, and merge CPU and CUDA code together, I think we don't need extra effert to split CPU and GUDA code to add the mixed precision support.

jgong5

I'm wondering what is the design philosophy of adding ops to the autocast whitelist. My understanding is that we only explicitly downcast ops for those that we know they are very likely get perf benefit from low-precision compute even if there is downcast cost. These compute-intensive dot-product ops that have HW acceleration support. From the autocast lists of CPU and CUDA, it seems "prelu" is the only exception. In my opinion, "prelu" is more like batch-norm and better fit "fall-through" policy instead.

Can we do type conversion on weights inside prelu just like batch-norm without throwing errors on type mismatch? @lezcano

lezcano · 2023-02-27T12:17:11Z

I didn't add support for type promotion in prelu in that PR just because it's a bit annoying to get that right so when implementing 2nd order derivatives by hand.
Should I implement this within the operation itself, or should this be done at the autocast level? @ngimel

ngimel · 2023-02-27T17:55:44Z

prelu doesn't follow standard type promotion rules, so it's indeed cumbersome to add the support at the op level. I don't think there's much harm done when prelu is just added to autocast allowlist (at worst an extra conversion of activations to low precision, but likely not that, because activations typically come out in low precision from the preceding gemm/convolution)

lezcano

Let's go with autocast then as per #95366 (comment)

yanbing-j · 2023-02-28T07:03:18Z

@pytorchbot rebase

pytorchmergebot · 2023-02-28T07:05:15Z

@pytorchbot successfully started a rebase job. Check the current status here

pytorchmergebot · 2023-02-28T07:05:20Z

Successfully rebased yanbing/add_prelu_whitelist onto refs/remotes/origin/viable/strict, please pull locally before adding more changes (for example, via git checkout yanbing/add_prelu_whitelist && git pull --rebase)

yanbing-j · 2023-02-28T13:11:23Z

@pytorchbot merge

pytorchmergebot · 2023-02-28T13:13:11Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

### Motivation Add `prelu` to lower precision cast policy on AutocastCPU to fix pytorch/pytorch#95365 : Before: Within the scope of torch.cpu.amp.autocast(dtype=torch.bfloat16) , `prelu` cannot address the scenario of different datatypes of `input` and `weight`, will get a RuntimeError. This scenario is common in autocast, e.g, with `autocast` to `bf16`, if the `op` before `prelu` comes out a `bf16` output, which is the input of `prelu`, and `prelu's` weight is `fp32`, then it will get a RuntimeError. After: Within the scope of torch.cpu.amp.autocast(dtype=torch.bfloat16) , prelu be forced to run with `bf16` data type. Before pytorch/pytorch#91238, when input is `bf16`, weight will be forced to cast to `bf16`. After pytorch/pytorch#91238, this kind of test scenario will raise a RuntimeError. There is no precision loss since the workable one is also casting to `bf16`. And this also alighs with Autocast CUDA whitelist. Pull Request resolved: pytorch/pytorch#95366 Approved by: https://github.com/ngimel, https://github.com/lezcano, https://github.com/leslie-fang-intel

This reverts commit 71ad100.

github-actions bot added the module: amp (automated mixed precision) autocast label Feb 23, 2023

yanbing-j requested a review from CaoE February 23, 2023 07:16

yanbing-j self-assigned this Feb 23, 2023

yanbing-j requested a review from leslie-fang-intel February 23, 2023 07:20

pytorchbot added the open source label Feb 23, 2023

yanbing-j requested review from jgong5 and mingfeima February 23, 2023 08:01

ngimel approved these changes Feb 23, 2023

View reviewed changes

jgong5 reviewed Feb 24, 2023

View reviewed changes

lezcano approved these changes Feb 27, 2023

View reviewed changes

yanbing-j force-pushed the yanbing/add_prelu_whitelist branch from a2a003c to 502e3c1 Compare February 28, 2023 02:30

yanbing-j added topic: not user facing topic category ciflow/trunk Trigger trunk jobs on your pull request labels Feb 28, 2023

leslie-fang-intel approved these changes Feb 28, 2023

View reviewed changes

yanbing-j added the intel This tag is for PR from Intel label Feb 28, 2023

yanbing-j added 2 commits February 28, 2023 07:05

Add prelu into Autocast CPU whitelist

b8b7ebd

Add UTs

64b70f9

pytorchmergebot force-pushed the yanbing/add_prelu_whitelist branch from 502e3c1 to 64b70f9 Compare February 28, 2023 07:05

pytorchmergebot added the Merged label Feb 28, 2023

pytorchmergebot closed this in 71ad100 Feb 28, 2023

msaroufim mentioned this pull request Mar 3, 2023

Remove mention of dynamo.optimize() in docs #96002

Closed

pruthvistony added a commit to ROCm/pytorch that referenced this pull request May 2, 2023

Revert "Add prelu into Autocast CPU whitelist (pytorch#95366)"

ab76b73

This reverts commit 71ad100.

Add prelu into Autocast CPU whitelist #95366

Add prelu into Autocast CPU whitelist #95366

Uh oh!

Conversation

yanbing-j commented Feb 23, 2023 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Uh oh!

pytorch-bot bot commented Feb 23, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/95366

✅ No Failures

Uh oh!

leslie-fang-intel commented Feb 23, 2023

Uh oh!

yanbing-j commented Feb 23, 2023

Uh oh!

leslie-fang-intel commented Feb 23, 2023

Uh oh!

yanbing-j commented Feb 23, 2023

Uh oh!

jgong5 left a comment

Choose a reason for hiding this comment

Uh oh!

lezcano commented Feb 27, 2023

Uh oh!

ngimel commented Feb 27, 2023

Uh oh!

lezcano left a comment

Choose a reason for hiding this comment

Uh oh!

yanbing-j commented Feb 28, 2023

Uh oh!

pytorchmergebot commented Feb 28, 2023

Uh oh!

pytorchmergebot commented Feb 28, 2023

Uh oh!

yanbing-j commented Feb 28, 2023

Uh oh!

pytorchmergebot commented Feb 28, 2023

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

yanbing-j commented Feb 23, 2023 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Feb 23, 2023 •

edited

Loading