[SYCL] fix flash_attention crash for Qwen3-Coder by arthw · Pull Request #21377 · ggml-org/llama.cpp

arthw · 2026-04-03T15:36:56Z

The code branches can't cover the case of Qwen3-Coder-Next-UD-IQ1_M.gguf.
Add code as final handler.
Verified the LLM and all related UT cases are passed.

arthw · 2026-04-05T15:36:13Z

@ggerganov
Please review and merge this PR!
Thank you!

handle other case

89daec0

arthw requested a review from a team as a code owner April 3, 2026 15:37

arthw requested a review from ggerganov April 3, 2026 15:37

NeoZhangJianyu mentioned this pull request Apr 3, 2026

[SYCL] Enhance flash-attention performance #21185

Merged

github-actions bot added ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language labels Apr 3, 2026

ggerganov added the merge ready A maintainer can use this label to indicate that they consider the changes final and ready to merge. label Apr 3, 2026

ggerganov merged commit f51fd36 into ggml-org:master Apr 6, 2026
183 of 190 checks passed

arthw mentioned this pull request Apr 7, 2026

SYCL: flash attention tile kernel crash on 2nd prompt with Qwen3.5 #21396

Open

iamwavecut pushed a commit to iamwavecut/llama-cpp-turboquant that referenced this pull request Apr 8, 2026

sycl : handle other FA case (ggml-org#21377)

fd73ac7

iamwavecut pushed a commit to iamwavecut/llama-cpp-turboquant that referenced this pull request Apr 8, 2026

sycl : handle other FA case (ggml-org#21377)

a86bb32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SYCL] fix flash_attention crash for Qwen3-Coder#21377

[SYCL] fix flash_attention crash for Qwen3-Coder#21377
ggerganov merged 1 commit intoggml-org:masterfrom
arthw:fix_flash_atten

arthw commented Apr 3, 2026

Uh oh!

arthw commented Apr 5, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

arthw commented Apr 3, 2026

Uh oh!

arthw commented Apr 5, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants