[SDPA] Fix alignment check for efficient_attention #90413

drisspg · 2022-12-07T21:28:06Z

Fixes a bug found using head_dim_size==100 on an a100 gpu. This PR contains stricter guards on the input shape. These constraints are taken from xformers: https://github.com/facebookresearch/xformers/blob/gh/danthe3rd/60/orig/xformers/ops/fmha/cutlass.py#L23

pytorch-bot · 2022-12-07T21:28:09Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/90413

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 Failures

As of commit 1c7bc96:

The following jobs have failed:

cuda11.6-py3.10-gcc7-sm86 / test (default, 1, 4, linux.g5.4xlarge.nvidia.gpu)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

aten/src/ATen/native/transformers/cuda/mem_eff_attention/gemm_kernel_utils.h

drisspg · 2022-12-07T22:21:37Z

@pytorchbot merge

pytorchmergebot · 2022-12-07T22:23:40Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2022-12-07T23:54:32Z

Merge failed

Reason: The following mandatory check(s) failed (Rule superuser):

pull

Dig deeper by viewing the failures on hud

Details for Dev Infra team

Raised by workflow job

aten/src/ATen/native/transformers/cuda/sdp_utils.h

drisspg · 2022-12-09T05:17:37Z

@pytorchbot merge

pytorchmergebot · 2022-12-09T05:23:45Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2022-12-09T05:43:56Z

Merge failed

Reason: The following mandatory check(s) failed (Rule superuser):

pull

Dig deeper by viewing the failures on hud

Details for Dev Infra team

Raised by workflow job

drisspg · 2022-12-09T14:41:39Z

@pytorchbot rebase

pytorchmergebot · 2022-12-09T14:43:29Z

@pytorchbot successfully started a rebase job. Check the current status here

pytorchmergebot · 2022-12-09T14:43:32Z

Tried to rebase and push PR #90413, but it was already up to date

drisspg · 2022-12-09T17:03:40Z

@pytorchbot rebase

pytorchmergebot · 2022-12-09T17:05:25Z

@pytorchbot successfully started a rebase job. Check the current status here

pytorchmergebot · 2022-12-09T17:05:29Z

Tried to rebase and push PR #90413, but it was already up to date

drisspg · 2022-12-09T21:07:44Z

@pytorchbot merge -f "unrelated to my changes"

pytorchmergebot · 2022-12-09T21:09:20Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

drisspg marked this pull request as ready for review December 7, 2022 22:04

drisspg requested a review from cpuhrsch December 7, 2022 22:04

drisspg changed the title ~~alignment value should be 8 for sm75+ 16bit types~~ [SDP] Fix alignment check for efficient_attention Dec 7, 2022

mikekgfb approved these changes Dec 7, 2022

View reviewed changes

aten/src/ATen/native/transformers/cuda/mem_eff_attention/gemm_kernel_utils.h Show resolved Hide resolved

drisspg force-pushed the update_sdp_constraints branch from fca3a48 to 114953f Compare December 7, 2022 22:13

pytorch deleted a comment from pytorch-bot bot Dec 7, 2022

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Dec 7, 2022

drisspg force-pushed the update_sdp_constraints branch from 114953f to 6615758 Compare December 8, 2022 00:46

danthe3rd reviewed Dec 8, 2022

View reviewed changes

aten/src/ATen/native/transformers/cuda/sdp_utils.h Outdated Show resolved Hide resolved

drisspg force-pushed the update_sdp_constraints branch 2 times, most recently from c186cf1 to 6f3e76d Compare December 8, 2022 21:35

drisspg added module: multi-headed-attention with-ssh and removed with-ssh labels Dec 8, 2022

drisspg force-pushed the update_sdp_constraints branch from 5978173 to cf15ace Compare December 9, 2022 05:16

alignment value should be 8 for sm75+ 16bit types

81c79dc

drisspg added 7 commits December 9, 2022 17:05

calc alignment per machine and check

861adae

trying limiting headsize to be greater than or equal to 8 '

927279c

tweaks

4e7a1bb

tests

ac5245f

alignment is 1 on on non sm80 machines

9ddc5e9

see if this turns off the right test failure

f9c869e

skip rocm

1c7bc96

drisspg force-pushed the update_sdp_constraints branch from cf15ace to 1c7bc96 Compare December 9, 2022 17:05

pytorchmergebot added the Merged label Dec 9, 2022

pytorchmergebot closed this in 912748e Dec 9, 2022

drisspg changed the title ~~[SDP] Fix alignment check for efficient_attention~~ [SDPA] Fix alignment check for efficient_attention Jan 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SDPA] Fix alignment check for efficient_attention #90413

[SDPA] Fix alignment check for efficient_attention #90413

drisspg commented Dec 7, 2022 •

edited

pytorch-bot bot commented Dec 7, 2022 •

edited

drisspg commented Dec 7, 2022

pytorchmergebot commented Dec 7, 2022

pytorchmergebot commented Dec 7, 2022

drisspg commented Dec 9, 2022

pytorchmergebot commented Dec 9, 2022

pytorchmergebot commented Dec 9, 2022

drisspg commented Dec 9, 2022

pytorchmergebot commented Dec 9, 2022

pytorchmergebot commented Dec 9, 2022

drisspg commented Dec 9, 2022

pytorchmergebot commented Dec 9, 2022

pytorchmergebot commented Dec 9, 2022

drisspg commented Dec 9, 2022

pytorchmergebot commented Dec 9, 2022

[SDPA] Fix alignment check for efficient_attention #90413

[SDPA] Fix alignment check for efficient_attention #90413

Conversation

drisspg commented Dec 7, 2022 • edited

pytorch-bot bot commented Dec 7, 2022 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/90413

❌ 1 Failures

drisspg commented Dec 7, 2022

pytorchmergebot commented Dec 7, 2022

Merge started

pytorchmergebot commented Dec 7, 2022

Merge failed

drisspg commented Dec 9, 2022

pytorchmergebot commented Dec 9, 2022

Merge started

pytorchmergebot commented Dec 9, 2022

Merge failed

drisspg commented Dec 9, 2022

pytorchmergebot commented Dec 9, 2022

pytorchmergebot commented Dec 9, 2022

drisspg commented Dec 9, 2022

pytorchmergebot commented Dec 9, 2022

pytorchmergebot commented Dec 9, 2022

drisspg commented Dec 9, 2022

pytorchmergebot commented Dec 9, 2022

Merge started

drisspg commented Dec 7, 2022 •

edited

pytorch-bot bot commented Dec 7, 2022 •

edited