Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We鈥檒l occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Mistral 7B model support #111

Closed
flaneur2020 opened this issue Mar 14, 2024 · 4 comments 路 Fixed by #155
Closed

Add Mistral 7B model support #111

flaneur2020 opened this issue Mar 14, 2024 · 4 comments 路 Fixed by #155
Assignees
Labels
good first issue Good for newcomers

Comments

@flaneur2020
Copy link
Collaborator

No description provided.

@flaneur2020 flaneur2020 added the good first issue Good for newcomers label Mar 14, 2024
@b41sh
Copy link
Contributor

b41sh commented Mar 20, 2024

Hi, @flaneur2020 I'd like to work on this, can you assign this issue to me?

@flaneur2020
Copy link
Collaborator Author

@b41sh thank you for the contribution! you could download the mistral model from links like https://huggingface.co/TheBloke/Mistral-7B-v0.1-GGUF/tree/main

i recommend using a q8_0 model, as it's the most optimized one and tested quantization format

(but i'm a bit afraid of there's a risk that we might already supported mistral model as it's also considered as a llama model, it need a confirmation though)

This was referenced Apr 7, 2024
@flaneur2020
Copy link
Collaborator Author

flaneur2020 commented Apr 7, 2024

I'm sorry, I grabbed this issue in #155 , mistral actually did not make difference with llama, but there's a bug in GQA implementation, it works well after this bug get resolved.

i've splitted more model support related tasks in: #157 , we can make some investigations together to make these models work with crabml

@b41sh
Copy link
Contributor

b41sh commented Apr 7, 2024

Ok, I will continue work on those models.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
good first issue Good for newcomers
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants