inductor: add input type check for fuse_attention #99296

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Closed

XiaobingSuper wants to merge 6 commits into gh/XiaobingSuper/92/base from gh/XiaobingSuper/92/head

Collaborator

XiaobingSuper commented Apr 17, 2023 •

edited

Loading

Stack from ghstack (oldest at bottom):

For TIMM xcit_large_24_p8_224, the scale factor is a tensor(https://github.com/huggingface/pytorch-image-models/blob/main/timm/models/xcit.py#L205), and scaled_dot_product_attention doesn't support it, this PR will add a check which only does the fusion when the scale factor is float/int value.

cc @soumith @voznesenskym @penguinwu @anijain2305 @EikanWang @jgong5 @Guobing-Chen @zhuhaozhe @blzheng @Xia-Weiwen @wenzhe-nrv @jiayisunx @peterbell10 @desertfire

XiaobingSuper added 2 commits

April 17, 2023 04:08


          inductor: add input type check for fuse_attention

04efffa

[ghstack-poisoned]


          Update on "inductor: add input type check for fuse_attention"

9b99f22

cc soumith voznesenskym penguinwu anijain2305 EikanWang jgong5 Guobing-Chen zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 desertfire

[ghstack-poisoned]

pytorch-bot bot commented Apr 17, 2023 •

edited

Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/99296

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

ROCm jobs are failing with No space left on device

✅ No Failures

As of commit 0723f11:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions bot added ciflow/inductor module: inductor labels

XiaobingSuper requested review from jansel and jgong5

April 17, 2023 08:36

XiaobingSuper added the release notes: inductor label


          Update on "inductor: add input type check for fuse_attention"

c5dfc6f


For TIMM ```xcit_large_24_p8_224```, the scale factor is a tensor(https://github.com/huggingface/pytorch-image-models/blob/main/timm/models/xcit.py#L205), and ```scaled_dot_product_attention``` doesn't support it, this PR will add a check which only does the fusion when the scale factor is float/int value.

cc soumith voznesenskym penguinwu anijain2305 EikanWang jgong5 Guobing-Chen zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 desertfire

[ghstack-poisoned]

pytorchbot added the open source label

XiaobingSuper linked an issue

that may be closed by this pull request

[Inductor] [CPU] scaled_dot_product_attention() unexpected a value type caused crash in xcit_large_24_p8_224 #99124

Closed

XiaobingSuper added a commit that referenced this pull request


          inductor: add input type check for fuse_attention

d9e2440

ghstack-source-id: 6570364
Pull Request resolved: #99296

XiaobingSuper requested review from EikanWang and desertfire

April 17, 2023 09:17

XiaobingSuper commented

View reviewed changes

test/inductor/test_fused_attention.py

    
                      self._check_common(sfdp_pattern_6, contains=False)

                  def test_pattern_fails_with_tensor_factor(self):

                      # https://github.com/pytorch/pytorch/issues/99124

Collaborator Author

XiaobingSuper Apr 17, 2023

This issue is reported on CPU side, but I checked it is also happening on GPU side, so add this case here.

EikanWang reviewed

View reviewed changes

torch/_inductor/fx_passes/fuse_attention.py

    
                      scale_factor_node = list(view_node.users.keys())[0]

                      if len(scale_factor_node.args) != 2:

                          return False

                      # make sure the scale_factor a float/int. SymInt?

Collaborator

EikanWang Apr 17, 2023

Should we add a dynamic shape case to verify SymInt?

Collaborator Author

XiaobingSuper Apr 18, 2023

Yes, we may need to add dynamic shape case, but the patterns doesn't match for dynamic shape case. Let do it at next step.

torch/_inductor/fx_passes/fuse_attention.py

Comment on lines +183 to +184

    
              def _return_true(match):

                  return True

Collaborator

EikanWang Apr 17, 2023

I'm wondering if we fold this function to _sfdp_scale_factor_check.

jansel requested changes

View reviewed changes

torch/_inductor/fx_passes/fuse_attention.py Outdated

    
                      # make sure the mul(div) for the scale factor is scalar mul(div).

                      # bmm->view->mul(div)

                      matmuls = filter_nodes(match.nodes, aten.bmm)

                      if len(matmuls) < 2:

Contributor

jansel Apr 17, 2023

Shouldn't this already be checked by the pattern? Can you give an example and add a test for when this fails?

Collaborator Author

XiaobingSuper Apr 18, 2023

Yes, this has been checked by the pattern, I do that to make sure it is all right. I will remove it to simplify it.

torch/_inductor/fx_passes/fuse_attention.py Outdated

    
                      if len(matmuls) < 2:

                          return False

                      if (

                          len(matmuls[0].users) != 1

Contributor

jansel Apr 17, 2023

Shouldn't this already be checked by the pattern? Can you give an example and add a test for when this fails?

Collaborator Author

XiaobingSuper Apr 18, 2023

removed such a check.

torch/_inductor/fx_passes/fuse_attention.py Outdated

    
                          return False

                      view_node = list(matmuls[0].users.keys())[0]

                      if (

                          len(view_node.users) != 1

Contributor

jansel Apr 17, 2023

Shouldn't this already be checked by the pattern? Can you give an example and add a test for when this fails?

Collaborator Author

XiaobingSuper Apr 18, 2023

removed such a check.

torch/_inductor/fx_passes/fuse_attention.py Outdated

    
                          or list(view_node.users.keys())[0].target != scale_factor_op

                      ):

                          return False

                      scale_factor_node = list(view_node.users.keys())[0]

Contributor

jansel Apr 17, 2023

Could you go directly here with filter_nodes(match.nodes, scale_factor_op)?

Collaborator Author

XiaobingSuper Apr 18, 2023

Yes, changed.

test/inductor/test_fused_attention.py Outdated

    
                          torch.randn(tensor_shape, device="cuda"),

                          torch.randn(tensor_shape, device="cuda"),

                      ]

                      with torch.no_grad():

Contributor

jansel Apr 17, 2023

Can you test training as well?

Collaborator Author

XiaobingSuper Apr 18, 2023

training test is added, please note that even if not have this PR, the training path can work well(the pattern doesn't match), and has an accuracy gap compared with eager mode.


          Update on "inductor: add input type check for fuse_attention"

34296c4


For TIMM ```xcit_large_24_p8_224```, the scale factor is a tensor(https://github.com/huggingface/pytorch-image-models/blob/main/timm/models/xcit.py#L205), and ```scaled_dot_product_attention``` doesn't support it, this PR will add a check which only does the fusion when the scale factor is float/int value.

cc soumith voznesenskym penguinwu anijain2305 EikanWang jgong5 Guobing-Chen zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 desertfire

[ghstack-poisoned]

XiaobingSuper added a commit that referenced this pull request


          inductor: add input type check for fuse_attention

48e324c

ghstack-source-id: 27301c3
Pull Request resolved: #99296

XiaobingSuper requested a review from jansel

April 18, 2023 03:03


          Update on "inductor: add input type check for fuse_attention"

a63bf3e


For TIMM ```xcit_large_24_p8_224```, the scale factor is a tensor(https://github.com/huggingface/pytorch-image-models/blob/main/timm/models/xcit.py#L205), and ```scaled_dot_product_attention``` doesn't support it, this PR will add a check which only does the fusion when the scale factor is float/int value.

cc soumith voznesenskym penguinwu anijain2305 EikanWang jgong5 Guobing-Chen zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 desertfire

[ghstack-poisoned]

XiaobingSuper added a commit that referenced this pull request


          inductor: add input type check for fuse_attention

093217d

ghstack-source-id: 0e95ee9
Pull Request resolved: #99296

jgong5 approved these changes

View reviewed changes


          Update on "inductor: add input type check for fuse_attention"

0723f11


For TIMM ```xcit_large_24_p8_224```, the scale factor is a tensor(https://github.com/huggingface/pytorch-image-models/blob/main/timm/models/xcit.py#L205), and ```scaled_dot_product_attention``` doesn't support it, this PR will add a check which only does the fusion when the scale factor is float/int value.

cc soumith voznesenskym penguinwu anijain2305 EikanWang jgong5 Guobing-Chen zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 desertfire

[ghstack-poisoned]

XiaobingSuper mentioned this pull request

inductor(CPU): add ISA check before do cpu fx packed weight #99502

Closed

Collaborator Author

XiaobingSuper commented Apr 19, 2023

@jansel, please help review this PR again. Thanks!

jansel approved these changes

View reviewed changes

XiaobingSuper added the ciflow/trunk label

Collaborator Author

XiaobingSuper commented Apr 20, 2023

@pytorchbot merge

pytorchmergebot added the merging label

Collaborator

pytorchmergebot commented Apr 20, 2023

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot added the Merged label

pytorchmergebot closed this in

27a43c0

facebook-github-bot deleted the gh/XiaobingSuper/92/head branch

June 8, 2023 15:05

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ciflow/inductor ciflow/trunk Merged merging module: inductor open source release notes: inductor