Skip to content

model: Mistral Nemo #19

@offgridtech

Description

@offgridtech

  • I have searched the existing issues

Current behavior

I see a bunch of stuff on HuggingFace and llama.cpp Git about pre-tokenizers causing issues upon initial release of the quantizied Mistal Nemo model, but it seemed everything was cleared up over the last few days due to a llama.cpp update. What worked for other people didn't work for Jan. I've tried several quant versions, and it fails to start. Saw KoboldCPP and LMStudio say they made some updates, and it's fixed now. I'm guessing you all need to do the same. Thanks

More information here:
ggml-org/llama.cpp#8579
ggml-org/llama.cpp#8604

Minimum reproduction step

It doesn't start. Other models like llama 3.1 start fine.

Expected behavior

The model starts

Screenshots / Logs

image

This log looks like it is the pre-tokenizer issue they were talking about.

Jan version

v0.5.2

In which operating systems have you tested?

  • macOS
  • Windows
  • Linux

Environment details

AppImage on Linux

Metadata

Metadata

Type

No type

Projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions