vulkan: support solve_tri with larger N/K values #17781

jeffbolznv · 2025-12-05T04:05:37Z

Split N into chunks to fit into shared memory.
If K > 128, use a larger workgroup with enough invocations. Add perf tests matching qwen3next.

See #17751 (comment)

Split N into chunks to fit into shared memory. If K > 128, use a larger workgroup with enough invocations. Add perf tests matching qwen3next.

vulkan: support solve_tri with larger N/K values

c50e5cb

Split N into chunks to fit into shared memory. If K > 128, use a larger workgroup with enough invocations. Add perf tests matching qwen3next.

jeffbolznv requested review from 0cc4m and ggerganov as code owners December 5, 2025 04:05

loci-dev mentioned this pull request Dec 5, 2025

UPSTREAM PR #17781: vulkan: support solve_tri with larger N/K values auroralabs-loci/llama.cpp#448

Open

github-actions bot added testing Everything test related Vulkan Issues specific to the Vulkan backend ggml changes relating to the ggml tensor library for machine learning labels Dec 5, 2025

0cc4m approved these changes Dec 6, 2025

View reviewed changes

0cc4m merged commit c6c5e85 into ggml-org:master Dec 6, 2025
72 of 78 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

vulkan: support solve_tri with larger N/K values #17781

vulkan: support solve_tri with larger N/K values #17781

Uh oh!

jeffbolznv commented Dec 5, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

vulkan: support solve_tri with larger N/K values #17781

vulkan: support solve_tri with larger N/K values #17781

Uh oh!

Conversation

jeffbolznv commented Dec 5, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants