Bugfix for MiniCPM model support (undo HF model tensor permute) #5392

runfuture · 2024-02-07T15:07:33Z

There is a lack of undoing the huggingface model tensor permute when replacing convert-minicpm.py with convert-hf-to-gguf.py in the previous PR.
This PR fixes the major problems and some minor issues.
Currently, everything is working well. I have performed a self-test and a welcome more test by checking out this branch.

ggerganov

Waiting for confirmation that it works and can merge

sweetcard · 2024-02-08T09:57:09Z

It works now.

* fix bug for norm_rms_eps missing * to align with the same order as convert.py for model write * fix: undo HF models permute tensor * update for flake8 lint

runfuture added 3 commits February 7, 2024 23:01

fix bug for norm_rms_eps missing

e4bd73c

to align with the same order as convert.py for model write

526d517

fix: undo HF models permute tensor

d56b638

This was referenced Feb 7, 2024

MiniCPM 2b model support? #5276

Open

Support MiniCPM #5346

Merged

update for flake8 lint

5b0cec5

ggerganov approved these changes Feb 8, 2024

View reviewed changes

ggerganov merged commit 4aa43fa into ggerganov:master Feb 8, 2024
50 of 54 checks passed

Chaunice mentioned this pull request Feb 8, 2024

[Bad Case]: 在LM Studio 用不了 OpenBMB/MiniCPM#57

Closed

hodlen pushed a commit to hodlen/llama.cpp that referenced this pull request Apr 1, 2024

llama : fix MiniCPM (ggerganov#5392)

80f0aa3

* fix bug for norm_rms_eps missing * to align with the same order as convert.py for model write * fix: undo HF models permute tensor * update for flake8 lint

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bugfix for MiniCPM model support (undo HF model tensor permute) #5392

Bugfix for MiniCPM model support (undo HF model tensor permute) #5392

runfuture commented Feb 7, 2024 •

edited

Loading

ggerganov left a comment

sweetcard commented Feb 8, 2024

Bugfix for MiniCPM model support (undo HF model tensor permute) #5392

Bugfix for MiniCPM model support (undo HF model tensor permute) #5392

Conversation

runfuture commented Feb 7, 2024 • edited Loading

ggerganov left a comment

Choose a reason for hiding this comment

sweetcard commented Feb 8, 2024

runfuture commented Feb 7, 2024 •

edited

Loading