model: Mistral Nemo #19

offgridtech · 2024-07-27T03:11:58Z

I have searched the existing issues

Current behavior

I see a bunch of stuff on HuggingFace and llama.cpp Git about pre-tokenizers causing issues upon initial release of the quantizied Mistal Nemo model, but it seemed everything was cleared up over the last few days due to a llama.cpp update. What worked for other people didn't work for Jan. I've tried several quant versions, and it fails to start. Saw KoboldCPP and LMStudio say they made some updates, and it's fixed now. I'm guessing you all need to do the same. Thanks

More information here:
ggerganov/llama.cpp#8579
ggerganov/llama.cpp#8604

Minimum reproduction step

It doesn't start. Other models like llama 3.1 start fine.

Expected behavior

The model starts

Screenshots / Logs

This log looks like it is the pre-tokenizer issue they were talking about.

Jan version

v0.5.2

In which operating systems have you tested?

macOS
Windows
Linux

Environment details

AppImage on Linux

dan-homebrew · 2024-08-30T03:41:57Z

@nguyenhoangthuan99 Can you look into this:

Is this the Tekken tokenizer?
This would need to be refactored into tokenizer.cpp?
I've scheduled for this sprint: scope is to just investigate and articulate what long-term path is
However: if there's a fast solution, we should go for it

nguyenhoangthuan99 · 2024-09-04T02:00:09Z

Mistral Nemo can be supported by cortex.llamacpp engine now. I tested with current source of llamacpp and it can load and answer question correctly
Next steps:
- Create model hub mistral nemo and upload model
- Integrate with cortex, investigate chat template, stop token,...

dan-homebrew · 2024-09-10T08:15:21Z

@offgridtech I am transferring this issue to cortex.cpp repo. We should be working on it, ETA 2 weeks

nguyenhoangthuan99 · 2024-09-25T09:59:11Z

I think Mistral Nemo is the first model for us to do this pipeline automatically. To add new model support from hugging face

Create a model repo mistral-nemo under cortexso, cc @0xSage @dan-homebrew for helping me to create, my account doesn't have permission to do so
Prepare ReadMe.md, model.yml for this model arch
Run the CI with this instruction to automatically, pull, convert and quantize model with different quantization levels.

nguyenhoangthuan99 · 2024-09-26T14:53:48Z

Mistral-nemo is supported now at cortexso. 10 quantization levels are available now. All models are created and uploaded automatically through CI.

Can try mistral-nemo with cortex-nightly

0xSage · 2024-10-13T09:57:50Z

closing as done and QA'd

offgridtech added the type: bug Something isn't working label Jul 27, 2024

Van-QA assigned vansangpfiev and nguyenhoangthuan99 Jul 29, 2024

dan-homebrew transferred this issue from janhq/jan Sep 10, 2024

dan-homebrew changed the title ~~bug: Unable to Run Mistral Nemo~~ model: Mistral Nemo Sep 10, 2024

dan-homebrew mentioned this issue Sep 10, 2024

epic: Model Converter Pipeline #22

Closed

2 tasks

nguyenhoangthuan99 mentioned this issue Sep 26, 2024

chat with mistral nemo return empty message janhq/cortex.cpp#1338

Closed

dan-homebrew mentioned this issue Sep 29, 2024

epic: Built-in Model Library #21

Open

10 tasks

dan-homebrew transferred this issue from janhq/cortex.cpp Sep 29, 2024

dan-homebrew added the type: model request label Sep 29, 2024

0xSage added the P1: important Important feature / fix label Sep 29, 2024

0xSage closed this as completed Oct 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

model: Mistral Nemo #19

model: Mistral Nemo #19

offgridtech commented Jul 27, 2024 •

edited

Loading

dan-homebrew commented Aug 30, 2024 •

edited

Loading

nguyenhoangthuan99 commented Sep 4, 2024

dan-homebrew commented Sep 10, 2024

nguyenhoangthuan99 commented Sep 25, 2024 •

edited

Loading

nguyenhoangthuan99 commented Sep 26, 2024

0xSage commented Oct 13, 2024

model: Mistral Nemo #19

model: Mistral Nemo #19

Comments

offgridtech commented Jul 27, 2024 • edited Loading

Current behavior

Minimum reproduction step

Expected behavior

Screenshots / Logs

Jan version

In which operating systems have you tested?

Environment details

dan-homebrew commented Aug 30, 2024 • edited Loading

nguyenhoangthuan99 commented Sep 4, 2024

dan-homebrew commented Sep 10, 2024

nguyenhoangthuan99 commented Sep 25, 2024 • edited Loading

nguyenhoangthuan99 commented Sep 26, 2024

0xSage commented Oct 13, 2024

offgridtech commented Jul 27, 2024 •

edited

Loading

dan-homebrew commented Aug 30, 2024 •

edited

Loading

nguyenhoangthuan99 commented Sep 25, 2024 •

edited

Loading