[onnx] support attn_mask fp16 type #110306

rui-ren · 2023-09-29T19:46:11Z

When users define customized attention mask using dtype=torch.float16, e.g.

from torch.nn import functional as F

float_min = torch.finfo(torch.float16).min

attention_mask_fp16 = (attention_mask * 1.0).masked_fill(attention_mask, float_min).to(torch.float16)

attn_output = F.scaled_dot_product_attention(
                 query_layer_, key_layer_, value_layer_, attention_mask_fp16, 0.0, is_causal=False
 )

the onnx graph cannot be exported.

When q, k ,v have the fp16 type, we can support this attn_mask to be fp16 type, by adding

elif (
        _type_utils.JitScalarType.from_value(attn_mask)
        == _type_utils.JitScalarType.FLOAT
        in (_type_utils.JitScalarType.FLOAT, _type_utils.JitScalarType.HALF)

This can export .onnx graph.

Fixes #109336

pytorch-bot · 2023-09-29T19:46:14Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/110306

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 6149727 with merge base d04b35e ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

linux-foundation-easycla · 2023-09-29T19:46:15Z

The committers listed above are authorized under a signed CLA.

✅ login: rui-ren (bacc05a, 6149727)

titaiwangms · 2023-09-29T22:18:43Z

This looks good!
Please sign CLA and lint the code (https://github.com/pytorch/pytorch/wiki/lintrunner).

titaiwangms

LGTM! Thanks!

titaiwangms · 2023-10-01T11:06:16Z

@pytorchbot merge

pytorchmergebot · 2023-10-01T11:08:06Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

support attn_mask fp16 type

bacc05a

pytorch-bot bot added the release notes: onnx torch.onnx related changes that should show up in the release notes label Sep 29, 2023

pytorchbot added the open source label Sep 29, 2023

titaiwangms self-assigned this Sep 29, 2023

titaiwangms added module: onnx Related to torch.onnx topic: improvements topic category labels Sep 29, 2023

make lint

6149727

rui-ren marked this pull request as ready for review September 29, 2023 23:59

rui-ren requested review from BowenBao, abock, thiagocrepaldi and wschin as code owners September 29, 2023 23:59

titaiwangms approved these changes Oct 1, 2023

View reviewed changes

titaiwangms added the ciflow/trunk Trigger trunk jobs on your pull request label Oct 1, 2023

pytorchmergebot added the merging label Oct 1, 2023

pytorchmergebot added Merged and removed merging labels Oct 1, 2023

pytorchmergebot closed this in e441471 Oct 1, 2023

rui-ren deleted the rui-ren/onnx-support-attn-mask-fp16-dtype branch October 1, 2023 15:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[onnx] support attn_mask fp16 type #110306

[onnx] support attn_mask fp16 type #110306

rui-ren commented Sep 29, 2023 •

edited

pytorch-bot bot commented Sep 29, 2023 •

edited

linux-foundation-easycla bot commented Sep 29, 2023 •

edited

titaiwangms commented Sep 29, 2023 •

edited

titaiwangms left a comment

titaiwangms commented Oct 1, 2023

pytorchmergebot commented Oct 1, 2023

[onnx] support attn_mask fp16 type #110306

[onnx] support attn_mask fp16 type #110306

Conversation

rui-ren commented Sep 29, 2023 • edited

pytorch-bot bot commented Sep 29, 2023 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/110306

✅ No Failures

linux-foundation-easycla bot commented Sep 29, 2023 • edited

titaiwangms commented Sep 29, 2023 • edited

titaiwangms left a comment

Choose a reason for hiding this comment

titaiwangms commented Oct 1, 2023

pytorchmergebot commented Oct 1, 2023

Merge started

rui-ren commented Sep 29, 2023 •

edited

pytorch-bot bot commented Sep 29, 2023 •

edited

linux-foundation-easycla bot commented Sep 29, 2023 •

edited

titaiwangms commented Sep 29, 2023 •

edited