HIP: Patch failed testcase in WMMA-MMQ kernels for RDNA 4 #17502

jiachengjason · 2025-11-25T19:52:41Z

Patched failed test case MUL_MAT(type_a=q4_0,type_b=f32,m=576,n=512,k=576,bs=[1,1],nr=[1,1],per=[0,1,2,3],k_v=0,o=1) for enabling WMMA on RDNA4 (verified all test cases passing when running ./build/bin/test-backend-ops test -o MUL_MAT
Quick clean up on mma.cuh to add ggml_cuda_memcpy_1 back in for half2 and bfloat162

…76,bs=[1,1],nr=[1,1],per=[0,1,2,3],k_v=0,o=1) for enabling WMMA on RDNA4

… and bfloat162

meven3000 · 2025-11-25T21:57:49Z

Can confirm this resolves the incorrect Qwen model output.
Thanks

…7502) * patch failed test case MUL_MAT(type_a=q4_0,type_b=f32,m=576,n=512,k=576,bs=[1,1],nr=[1,1],per=[0,1,2,3],k_v=0,o=1) for enabling WMMA on RDNA4 * Quick clean up on mma.cuh to add ggml_cuda_memcpy_1 back in for half2 and bfloat162

patch failed test case MUL_MAT(type_a=q4_0,type_b=f32,m=576,n=512,k=5…

8cec0c4

…76,bs=[1,1],nr=[1,1],per=[0,1,2,3],k_v=0,o=1) for enabling WMMA on RDNA4

github-actions bot added Nvidia GPU Issues specific to Nvidia GPUs ggml changes relating to the ggml tensor library for machine learning labels Nov 25, 2025

Quick clean up on mma.cuh to add ggml_cuda_memcpy_1 back in for half2…

8cc265c

… and bfloat162

jiachengjason marked this pull request as ready for review November 25, 2025 19:59

jiachengjason requested a review from JohannesGaessler as a code owner November 25, 2025 19:59

jiachengjason mentioned this pull request Nov 25, 2025

HIP: WMMA-MMQ kernels for RDNA 4 #17156

Merged

loci-dev mentioned this pull request Nov 25, 2025

UPSTREAM PR #17502: HIP: Patch failed testcase in WMMA-MMQ kernels for RDNA 4 auroralabs-loci/llama.cpp#326

Open

JohannesGaessler approved these changes Nov 25, 2025

View reviewed changes

JohannesGaessler merged commit 3e18dba into ggml-org:master Nov 26, 2025
59 of 63 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

HIP: Patch failed testcase in WMMA-MMQ kernels for RDNA 4 #17502

HIP: Patch failed testcase in WMMA-MMQ kernels for RDNA 4 #17502

jiachengjason commented Nov 25, 2025

Uh oh!

meven3000 commented Nov 25, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

HIP: Patch failed testcase in WMMA-MMQ kernels for RDNA 4 #17502

HIP: Patch failed testcase in WMMA-MMQ kernels for RDNA 4 #17502

Conversation

jiachengjason commented Nov 25, 2025

Uh oh!

meven3000 commented Nov 25, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants