Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Eval bug: crash when pooling_type == LLAMA_POOLING_TYPE_MEAN #12543

Open
ivan-tkatchev opened this issue Mar 24, 2025 · 1 comment
Open

Eval bug: crash when pooling_type == LLAMA_POOLING_TYPE_MEAN #12543

ivan-tkatchev opened this issue Mar 24, 2025 · 1 comment

Comments

@ivan-tkatchev
Copy link

Name and Version

Revision 9b169a4

Operating systems

Linux

GGML backends

CPU

Hardware

ARM Ampere

Models

Qwen2.5-14B-Instruct-1M-Q5_K_M

Problem description & steps to reproduce

Setting pooling_type = LLAMA_POOLING_TYPE_MEAN and calling llama_init_from_model() causes this crash:

/build/source/ggml/src/ggml.c:2738: GGML_ASSERT(ggml_can_mul_mat(a, b)) failed

Setting to LLAMA_POOLING_TYPE_LAST and changing nothing else works correctly.

First Bad Commit

No response

Relevant log output

/build/source/ggml/src/ggml.c:2738: GGML_ASSERT(ggml_can_mul_mat(a, b)) failed
@ivan-tkatchev
Copy link
Author

Possibly a duplicate of #12517

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant