convert : fix squeeze for ssm_conv tensors #12573

ggerganov · 2025-03-25T17:55:30Z

Broken in #10784.

compilade

Thanks!

I've fixed the name matching in 20b256e (using tensor types with self.match_model_tensor_name, and also the name of that tensor ends with .weight), and I've tested this with https://huggingface.co/state-spaces/mamba-130m-hf, which works.

@dodekapod Can you confirm this fixes the problem?

dodekapod · 2025-03-26T11:04:43Z

Thanks !
Yes, this fixes the problem. Tested with https://huggingface.co/tiiuae/Falcon3-Mamba-7B-Instruct .

convert : fix squeeze for ssm_conv tensors

9c60fc4

ggerganov requested a review from compilade March 25, 2025 17:55

github-actions bot added the python label Mar 25, 2025

ggerganov mentioned this pull request Mar 25, 2025

Misc. bug: Falcon3-Mamba-7B fails on ggml_ssm_conv #12572

Closed

convert : match ssm_conv tensors by type

20b256e

compilade approved these changes Mar 25, 2025

View reviewed changes

compilade merged commit df4d20c into master Mar 26, 2025
7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

convert : fix squeeze for ssm_conv tensors #12573

convert : fix squeeze for ssm_conv tensors #12573

ggerganov commented Mar 25, 2025

compilade left a comment

dodekapod commented Mar 26, 2025

convert : fix squeeze for ssm_conv tensors #12573

convert : fix squeeze for ssm_conv tensors #12573

Conversation

ggerganov commented Mar 25, 2025

compilade left a comment

Choose a reason for hiding this comment

dodekapod commented Mar 26, 2025