[test] chore(turboquant): bump fork pin to rebase/upstream-sync-april-2026#9493
[test] chore(turboquant): bump fork pin to rebase/upstream-sync-april-2026#9493
Conversation
Move the TurboQuant llama.cpp fork pin from feature/turboquant-kv-cache (627ebbc6) to rebase/upstream-sync-april-2026 (7f320bb8), picking up the upstream-sync work on the fork. Assisted-by: Claude:claude-opus-4-7
|
Pulled and sanity-checked on AMD / ROCm 7.2.1 (7900 XTX, gfx1100, wave32). HIP path looks good on the new pin. BuildClean build, 0 errors, only a benign Smoke tests (Qwen3-8B-Q4_K_M, -fa on -ngl 99)
Both ran to the What this covers from PR #101's community-testing checklist
Not covered here: gfx1200 (RX 9060 XT), head_dim > 256 (Gemma 4 full-attention), multi-GPU, prefill-heavy workloads against long contexts. CI noteThe only LGTM for the HIP slice. Ship it. |
|
Thank you! |
|
Closing ad this seems now fixed and merged already in #9497 . Thanks for your support! |
Move the TurboQuant llama.cpp fork pin from feature/turboquant-kv-cache (627ebbc6) to rebase/upstream-sync-april-2026 (7f320bb8), picking up the upstream-sync work on the fork.
Testing TheTom/llama-cpp-turboquant#101
cc @TheTom