Issue explained in this PR : https://github.com/ggml-org/llama.cpp/pull/9742 The comment : https://github.com/ggml-org/llama.cpp/pull/9742#discussion_r2379589613 I'm working on a really simple PR to fix it