[inductor] support _scaled_dot_product_flash_attention fallback #110085

chenyang78 · 2023-09-26T16:48:28Z

Summary:
This PR supports _scaled_dot_product_flash_attention fallback kernel.
Note that in the abi_compatible mode, we retrieve outputs by passing
output argument pointers rather than relying on std::get.

It also fixes an issue related to dynamic shapes, where we wrongfully
query undefined dynamic symbols.

Test Plan: ci

Reviewed By: frank-wei

Differential Revision: D49620191

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @Xia-Weiwen @wenzhe-nrv @jiayisunx @peterbell10 @ipiszy @yf225 @kadeng @muchulee8 @aakhundov @ColinPeppler

Summary: This PR supports _scaled_dot_product_flash_attention fallback kernel. Note that in the abi_compatible mode, we retrieve outputs by passing output argument pointers rather than relying on std::get. It also fixes an issue related to dynamic shapes, where we wrongfully query undefined dynamic symbols. Test Plan: ci Reviewed By: frank-wei Differential Revision: D49620191

pytorch-bot · 2023-09-26T16:48:35Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/110085

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

⏳ No Failures, 2 Pending

As of commit 40a0fc0 with merge base d91492a ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2023-09-26T16:48:39Z

This pull request was exported from Phabricator. Differential Revision: D49620191

facebook-github-bot · 2023-09-27T00:05:10Z

@pytorchbot merge -f 'Landed internally'

(Initiating merge automatically since Phabricator Diff has merged, using force because this PR might not pass merge_rules.json but landed internally)

pytorchmergebot · 2023-09-27T00:07:34Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

…rch#110085) Summary: This PR supports _scaled_dot_product_flash_attention fallback kernel. Note that in the abi_compatible mode, we retrieve outputs by passing output argument pointers rather than relying on std::get. It also fixes an issue related to dynamic shapes, where we wrongfully query undefined dynamic symbols. Test Plan: ci Reviewed By: frank-wei Differential Revision: D49620191 Pull Request resolved: pytorch#110085 Approved by: https://github.com/desertfire

facebook-github-bot added the fb-exported label Sep 26, 2023

github-actions bot added module: inductor ciflow/inductor labels Sep 26, 2023

chenyang78 requested a review from desertfire September 26, 2023 16:48

desertfire approved these changes Sep 26, 2023

View reviewed changes

chenyang78 added the topic: not user facing topic category label Sep 26, 2023

pytorchmergebot added the merging label Sep 27, 2023

pytorchmergebot added Merged and removed merging labels Sep 27, 2023

pytorchmergebot closed this in 4d0ae7c Sep 27, 2023

chenyang78 mentioned this pull request Sep 27, 2023

[inductor] handle non-list/tuple outputs for FallbackKernel #110145

Closed

chenyang78 mentioned this pull request Nov 17, 2023

[aotinductor] support _scaled_dot_product_flash_attention fallback #110003

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[inductor] support _scaled_dot_product_flash_attention fallback #110085

[inductor] support _scaled_dot_product_flash_attention fallback #110085

chenyang78 commented Sep 26, 2023 •

edited by pytorch-bot bot

pytorch-bot bot commented Sep 26, 2023 •

edited

facebook-github-bot commented Sep 26, 2023

facebook-github-bot commented Sep 27, 2023

pytorchmergebot commented Sep 27, 2023

[inductor] support _scaled_dot_product_flash_attention fallback #110085

[inductor] support _scaled_dot_product_flash_attention fallback #110085

Conversation

chenyang78 commented Sep 26, 2023 • edited by pytorch-bot bot

pytorch-bot bot commented Sep 26, 2023 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/110085

⏳ No Failures, 2 Pending

facebook-github-bot commented Sep 26, 2023

facebook-github-bot commented Sep 27, 2023

pytorchmergebot commented Sep 27, 2023

Merge started

chenyang78 commented Sep 26, 2023 •

edited by pytorch-bot bot

pytorch-bot bot commented Sep 26, 2023 •

edited