[NNC] Some ops have type promotion logic which adds extra casts & does compute in different dtype than eager #49178

eellison · 2020-12-10T19:05:59Z

🐛 Bug

Many NNC type promotion goes through a common path which promotes inputs according to the highest input type, and then after compute, casts the output to whichever output type was recorded when the op was run in eager.

Because we have the output type specified, all NNC ops will give the correct dtype. However, the compute may be done in a different dtype than eager, and there may be extraneous casts added.

In the (worst) case of something like,

x = torch.ones([4,  4], dtype=torch.int16)
torch.addcmul(x, x, x, value=1)

We will cast all three tensor inputs to int32, do the addcmul, then cast the operation back. As opposed to eager, which will just cast value=1 to int16.

The advantage of the current approach is that we are guaranteed to have the same output dtype. I think ops could gradually be migrated over to have the correct casting behavior.

Most ops do promote to the highest dtype, so this isn't that big of an issue, but it does come up.

TODO:

torch.addcmul (add your name here to claim)

To Reproduce

Comment out the output type casting of the various comput{number}Operands
Run python test/test_jit_fuser_te..py

The text was updated successfully, but these errors were encountered:

eellison · 2020-12-10T19:06:36Z

Adding bootcamp here because each individual op is pretty easy to migrate over to the right casting logic. Starting with addcmul but we can add others.

eellison added module: bootcamp We plan to do a full writeup on the issue, and then get someone to do it for onboarding NNC labels Dec 10, 2020

eellison mentioned this issue Dec 10, 2020

[te] Fix clamp with uint8 args #49143

Closed

H-Huang added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Dec 10, 2020

eellison changed the title ~~[NNC] NNC type promotion logic adds extra casts & does compute in different dtype than eager~~ [NNC] Some ops have type promotion logic which adds extra casts & does compute in different dtype than eager Dec 10, 2020

ZolotukhinM added this to High priority in NNC Feb 22, 2021

ZolotukhinM moved this from High priority to Needs triage in NNC Feb 22, 2021

ZolotukhinM moved this from Needs triage to High priority in NNC Feb 22, 2021

ZolotukhinM moved this from High priority to Low priority in NNC Feb 22, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[NNC] Some ops have type promotion logic which adds extra casts & does compute in different dtype than eager #49178

[NNC] Some ops have type promotion logic which adds extra casts & does compute in different dtype than eager #49178

eellison commented Dec 10, 2020 •

edited

eellison commented Dec 10, 2020

[NNC] Some ops have type promotion logic which adds extra casts & does compute in different dtype than eager #49178

[NNC] Some ops have type promotion logic which adds extra casts & does compute in different dtype than eager #49178

Comments

eellison commented Dec 10, 2020 • edited

🐛 Bug

To Reproduce

eellison commented Dec 10, 2020

eellison commented Dec 10, 2020 •

edited