Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bugfix for MiniCPM model support (undo HF model tensor permute) #5392

Merged
merged 4 commits into from
Feb 8, 2024

Conversation

runfuture
Copy link
Contributor

@runfuture runfuture commented Feb 7, 2024

There is a lack of undoing the huggingface model tensor permute when replacing convert-minicpm.py with convert-hf-to-gguf.py in the previous PR.
This PR fixes the major problems and some minor issues.
Currently, everything is working well. I have performed a self-test and a welcome more test by checking out this branch.

This was referenced Feb 7, 2024
Copy link
Owner

@ggerganov ggerganov left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Waiting for confirmation that it works and can merge

@sweetcard
Copy link

It works now.
image

@ggerganov ggerganov merged commit 4aa43fa into ggerganov:master Feb 8, 2024
50 of 54 checks passed
jordankanter pushed a commit to jordankanter/llama.cpp that referenced this pull request Mar 13, 2024
* fix bug for norm_rms_eps missing

* to align with the same order as convert.py for model write

* fix: undo HF models permute tensor

* update for flake8 lint
hodlen pushed a commit to hodlen/llama.cpp that referenced this pull request Apr 1, 2024
* fix bug for norm_rms_eps missing

* to align with the same order as convert.py for model write

* fix: undo HF models permute tensor

* update for flake8 lint
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants