Add Mistral 7B model support #111

flaneur2020 · 2024-03-14T08:25:47Z

No description provided.

b41sh · 2024-03-20T11:45:15Z

Hi, @flaneur2020 I'd like to work on this, can you assign this issue to me?

flaneur2020 · 2024-03-20T15:37:15Z

@b41sh thank you for the contribution! you could download the mistral model from links like https://huggingface.co/TheBloke/Mistral-7B-v0.1-GGUF/tree/main

i recommend using a q8_0 model, as it's the most optimized one and tested quantization format

(but i'm a bit afraid of there's a risk that we might already supported mistral model as it's also considered as a llama model, it need a confirmation though)

flaneur2020 · 2024-04-07T14:35:33Z

I'm sorry, I grabbed this issue in #155 , mistral actually did not make difference with llama, but there's a bug in GQA implementation, it works well after this bug get resolved.

i've splitted more model support related tasks in: #157 , we can make some investigations together to make these models work with crabml

b41sh · 2024-04-07T15:10:00Z

Ok, I will continue work on those models.

flaneur2020 added the good first issue Good for newcomers label Mar 14, 2024

flaneur2020 assigned b41sh Mar 20, 2024

This was referenced Apr 7, 2024

feat: Support mistral #155

Merged

Tracking: support more models #157

Open

flaneur2020 closed this as completed in #155 Apr 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Mistral 7B model support #111

Add Mistral 7B model support #111

flaneur2020 commented Mar 14, 2024

b41sh commented Mar 20, 2024

flaneur2020 commented Mar 20, 2024

flaneur2020 commented Apr 7, 2024 •

edited

Loading

b41sh commented Apr 7, 2024

Add Mistral 7B model support #111

Add Mistral 7B model support #111

Comments

flaneur2020 commented Mar 14, 2024

b41sh commented Mar 20, 2024

flaneur2020 commented Mar 20, 2024

flaneur2020 commented Apr 7, 2024 • edited Loading

b41sh commented Apr 7, 2024

flaneur2020 commented Apr 7, 2024 •

edited

Loading