Eval bug: crash when pooling_type == LLAMA_POOLING_TYPE_MEAN #12543

ivan-tkatchev · 2025-03-24T10:00:01Z

Linux

CPU

ARM Ampere

Qwen2.5-14B-Instruct-1M-Q5_K_M

Setting pooling_type = LLAMA_POOLING_TYPE_MEAN and calling llama_init_from_model() causes this crash:

/build/source/ggml/src/ggml.c:2738: GGML_ASSERT(ggml_can_mul_mat(a, b)) failed

Setting to LLAMA_POOLING_TYPE_LAST and changing nothing else works correctly.

No response

/build/source/ggml/src/ggml.c:2738: GGML_ASSERT(ggml_can_mul_mat(a, b)) failed

The text was updated successfully, but these errors were encountered:

ivan-tkatchev · 2025-03-24T10:01:16Z

Possibly a duplicate of #12517

ivan-tkatchev added the bug-unconfirmed label Mar 24, 2025

Provide feedback