Conversation
|
@0cc4m omg, omg! Of course!! Thank you. I will try tomorrow |
|
@0cc4m Hello. Done!
And
|
|
Is that good performance? Please compare it to master. |
|
master
pr
|
|
Great, thank you! |
|
I am seeing some significant improvements in token generation speeds on Arrow Lake H, excellent work, many thanks for all your efforts ! Benchmark Model: .\GGUF\qwen\Qwen3-Coder-30B-A3B-Instruct-Q4_K_M.gguf load_backend: loaded RPC backend from C:\AI\bin\llamacpp\Vulkan\8185\ggml-rpc.dll
build: 2afcdb9 (8185) Benchmark Model: .\GGUF\qwen\Qwen3-Coder-30B-A3B-Instruct-Q4_K_M.gguf load_backend: loaded RPC backend from C:\AI\bin\llamacpp\Vulkan\8187\ggml-rpc.dll
build: feefb92 (8187) Benchmark Model: .\GGUF\Qwen3.5-35B-A3B-UD-Q5_K_XL.gguf load_backend: loaded RPC backend from C:\AI\bin\llamacpp\Vulkan\8185\ggml-rpc.dll
build: 2afcdb9 (8185) Benchmark Model: .\GGUF\Qwen3.5-35B-A3B-UD-Q5_K_XL.gguf load_backend: loaded RPC backend from C:\AI\bin\llamacpp\Vulkan\8187\ggml-rpc.dll
build: feefb92 (8187) Benchmark Model: .\GGUF\qwen\Qwen3.5-27B-Q4_K_M.gguf load_backend: loaded RPC backend from C:\AI\bin\llamacpp\Vulkan\8185\ggml-rpc.dll
build: 2afcdb9 (8185) Benchmark Model: .\GGUF\qwen\Qwen3.5-27B-Q4_K_M.gguf load_backend: loaded RPC backend from C:\AI\bin\llamacpp\Vulkan\8187\ggml-rpc.dll
build: feefb92 (8187) Benchmark Model: .\GGUF\GLM-4.7-Flash-Q4_K_M.gguf load_backend: loaded RPC backend from C:\AI\bin\llamacpp\Vulkan\8185\ggml-rpc.dll
build: 2afcdb9 (8185) Benchmark Model: .\GGUF\GLM-4.7-Flash-Q4_K_M.gguf load_backend: loaded RPC backend from C:\AI\bin\llamacpp\Vulkan\8187\ggml-rpc.dll
build: feefb92 (8187) |


Tune MMVQ use for Intel Windows according to #17628 (comment)
@savvadesogle Please try it and see if performance is good.