vulkan: force full subgroups for flash attention to fix intel subgroup crash #17356

0cc4m · 2025-11-18T09:21:18Z

To use subgroup operations on Intel, we have to force full subgroups, otherwise the subgroup size and how many of the threads are actually active may vary. I don't think this change should have any negative effect on other drivers.

…p crash

jeffbolznv · 2025-11-18T15:05:10Z

While I agree this is worth a try, I don't understand why the failing change actually needs it. It doesn't assume a specific subgroup size or mapping of invocations to subgroups.

0cc4m · 2025-11-18T15:29:55Z

I don't fully understand it, but I think if VK_PIPELINE_SHADER_STAGE_CREATE_REQUIRE_FULL_SUBGROUPS_BIT is not set, a Vulkan driver may disable specific threads of a subgroup for performance reasons, which leads to undefined behaviour when subgroup operations are used.

I can't find much information about this in the specification, it's mostly experience from dealing with Intel GPUs which do vary their subgroup size. I guess someone would need to dig deeper into the ANV driver to figure out what exactly it is doing.

vulkan: force full subgroups for flash attention to fix intel subgrou…

155a829

…p crash

0cc4m mentioned this pull request Nov 18, 2025

Misc. bug: Vulkan\Llama-server.exe (b7064+) hangs during prompt processing if "--flash-attn on" #17297

Open

github-actions bot added Vulkan Issues specific to the Vulkan backend ggml changes relating to the ggml tensor library for machine learning labels Nov 18, 2025

DajanaV mentioned this pull request Nov 18, 2025

UPSTREAM PR #17356: vulkan: force full subgroups for flash attention to fix intel subgroup crash auroralabs-loci/llama.cpp#256

Open

jeffbolznv approved these changes Nov 18, 2025

View reviewed changes

jeffbolznv mentioned this pull request Nov 18, 2025

vulkan: Disable skip-neg-inf logic for Intel #17335

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

vulkan: force full subgroups for flash attention to fix intel subgroup crash #17356

vulkan: force full subgroups for flash attention to fix intel subgroup crash #17356

0cc4m commented Nov 18, 2025

Uh oh!

jeffbolznv commented Nov 18, 2025

Uh oh!

0cc4m commented Nov 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

vulkan: force full subgroups for flash attention to fix intel subgroup crash #17356

Are you sure you want to change the base?

vulkan: force full subgroups for flash attention to fix intel subgroup crash #17356

Conversation

0cc4m commented Nov 18, 2025

Uh oh!

jeffbolznv commented Nov 18, 2025

Uh oh!

0cc4m commented Nov 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants