ggml-wegpu: handle the buffer aliasing for rms fuse by Constannnnnt · Pull Request #22266 · ggml-org/llama.cpp

Constannnnnt · 2026-04-22T22:46:45Z

Overview

This PR addressed an edge case of #21983. I load and run a model in the browser, and I met this error:

ggml_webgpu: Device error! Reason: 2, Message: Writable storage buffer binding aliasing found between [BindGroup "RMS_NORM_MUL"] set at bind group index 0, binding index 0, and [BindGroup "RMS_NORM_MUL"] set at bind group index 0, binding index 2, with overlapping ranges (offset: 5242880, size: 4096) and (offset: 5242880, size: 4096) in [Buffer "tensor_buf3"].
While encoding [ComputePassEncoder (unlabeled)].DispatchWorkgroups(1, 1, 1).
While finishing [CommandEncoder (unlabeled)].

As the error showed, it was associated with the buffer overlapping. I used a coding agent to analyze the logs:

the previous inplace flag was only checking if mul_src and dst overlapped
it missed a scenario where the rn_src overlapped with dst, i.e. (rn_src==dst), leading to this tensor_buf3 issue.

I reused the convention from the 'binary' shader:

inplace means src0 == dst. => rn_src == dst
overlap means src1 == dst. => mul_src  == dst
src_overlap means src0 == src1.

Additional information

I didn't run any benchmark tests and only tested the model behaviour in the browser.

Requirements

I have read and agree with the [contributing guidelines](https://github.com/ggml-org/llama.cpp/blob/master/CONTRIBUTING.md)
AI usage disclosure: Yes, I used coding agents for debugging.

reeselevine · 2026-04-22T23:01:48Z

Thanks for the fix. I tested it out and it looks like even with skip_validation turned off, there is nothing in test-backend-ops that would have caught this right now. Potentially other tests would, we actually should turn off skip_validation, I originally added it for performance natively but I think the impact is minimal. FlashAttention needs to handle some aliasing first though, which would be a good target for #22199.

@yomaytk fyi, if you're working on other fusion chains it might be good to check to see what other kind of buffer aliasing can occur and add a test for it in test-backend-ops if possible.

yomaytk · 2026-04-23T02:31:37Z

Thanks for the fix, this looks good to me. How about adding a test case in test-backend-ops and confirming that it passes without skip_validation in this PR? If not, I'm happy to follow up with a separate PR.

@yomaytk fyi, if you're working on other fusion chains it might be good to check to see what other kind of buffer aliasing can occur and add a test for it in test-backend-ops if possible.

Got it, thanks.

…ix/rms-fuse

fix(shader): handle the buffer aliasing for rms fuse

80466e4

Constannnnnt requested a review from a team as a code owner April 22, 2026 22:46

Constannnnnt changed the title ~~fix(shader): handle the buffer aliasing for rms fuse~~ ggml-wegpu: handle the buffer aliasing for rms fuse Apr 22, 2026

github-actions Bot added ggml changes relating to the ggml tensor library for machine learning WebGPU labels Apr 22, 2026

reeselevine approved these changes Apr 23, 2026

View reviewed changes

reeselevine requested a review from CISC April 23, 2026 03:15

CISC approved these changes Apr 23, 2026

View reviewed changes

Constannnnnt added 2 commits April 23, 2026 10:51

merge: merge with upstream master

3627b97

Merge branch 'master' of https://github.com/ggml-org/llama.cpp into f…

d69604d

…ix/rms-fuse

reeselevine mentioned this pull request Apr 23, 2026

ggml-webgpu: enable FLASH_ATTN_EXT on browser without subgroup matrix #22199

Open

reeselevine approved these changes Apr 23, 2026

View reviewed changes

reeselevine merged commit e5f070a into ggml-org:master Apr 23, 2026
44 of 46 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ggml-wegpu: handle the buffer aliasing for rms fuse#22266

ggml-wegpu: handle the buffer aliasing for rms fuse#22266
reeselevine merged 3 commits intoggml-org:masterfrom
noumena-labs:fix/rms-fuse

Constannnnnt commented Apr 22, 2026 •

edited

Loading

Uh oh!

reeselevine commented Apr 22, 2026

Uh oh!

yomaytk commented Apr 23, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

Constannnnnt commented Apr 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

Additional information

Requirements

Uh oh!

reeselevine commented Apr 22, 2026

Uh oh!

yomaytk commented Apr 23, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Constannnnnt commented Apr 22, 2026 •

edited

Loading