feat: Jan supports safetensors #1056

0xSage · 2023-12-18T04:35:18Z

@Van-QA quoted this from feature request #2723:

problem
You can only use the GGUF model, not a wide range of models. So, if you can use the Transformer model, you can use most models.

Success Criteria
Find the perfect hugging face Transformers model and make it available.

hiro-v · 2024-02-11T05:19:35Z

Supports in #1972 for converting huggingface safetensor to gguf and use

Van-QA · 2024-03-06T15:19:47Z

Although the issue #2167 is resolved, the Import via Hugging Face is on hold until this epic janhq/cortex#571 is complete.

hiro-v · 2024-05-17T09:15:20Z

I have checked the technical possibilities for this.

Please read more in this doc (draft): https://f1da82fe.docs-9ba.pages.dev/guides/glossaries/gguf

Basically there are 2 steps in order to have a single GGUF model:

Convert Huggingface .safetensor to GGUF BF16 (normally takes around 2mins). This requires the use of convert-hf-to-gguf in python (which can be executed using cortex python runtime). The example command is: python llama.cpp/convert-hf-to-gguf.py models --outtype bf16 --outfile "${{ env.MODEL_NAME }}/${{ env.bf16 }}"
Once we have GGUF BF16 model, user can choose the quantization they want and run the quantization (around 2 mins). For this it has CPP low level API in quantize.
The example command: ./llama.cpp/quantize "${{ env.MODEL_NAME }}/${{ env.bf16 }}" "${{ env.MODEL_NAME }}/$qtype" "$method"

I think this would help a lot with the adoption of cortex-cli and jan app

0xSage · 2024-06-11T01:40:25Z

related: janhq/cortex#555

0xSage added the type: epic A major feature or initiative label Dec 18, 2023

0xSage added this to the Jan supports multiple Inference Engines milestone Dec 18, 2023

0xSage changed the title ~~epic: jan supports safetensors~~ epic: Jan supports safetensors Dec 18, 2023

0xSage added type: feature request A new feature and removed type: epic A major feature or initiative labels Dec 18, 2023

0xSage changed the title ~~epic: Jan supports safetensors~~ feat: Jan supports safetensors Dec 18, 2023

0xSage added the engineering: Jan Inference Layer Jan can serve models locally: with correct data structs, APIs, multi-inference engines, multi-model label Dec 22, 2023

0xSage removed this from the Jan supports multiple Inference Engines milestone Dec 27, 2023

0xSage assigned hiro-v Jan 5, 2024

Van-QA assigned louis-jan Feb 17, 2024

Van-QA added this to the v0.4.8 milestone Feb 18, 2024

Van-QA added the P1: important Important feature / fix label Feb 18, 2024

Van-QA mentioned this issue Feb 26, 2024

feat: add a simple way to convert Hugging Face model to GGUF #1972

Merged

3 tasks

Van-QA modified the milestones: v0.4.8, v0.4.10 Mar 1, 2024

hiro-v removed their assignment Mar 14, 2024

Van-QA modified the milestones: v0.4.10, v0.4.11 Mar 25, 2024

louis-jan assigned namchuai Apr 3, 2024

Van-QA modified the milestones: v0.4.11, v0.4.12 Apr 4, 2024

louis-jan mentioned this issue Apr 15, 2024

can you add chatglm model download url? #2472

Closed

Van-QA mentioned this issue Apr 16, 2024

Download and use Transformer models #2723

Closed

louis-jan modified the milestones: v0.4.12, v0.4.13 Apr 16, 2024

louis-jan assigned Inchoker and unassigned namchuai and louis-jan Apr 16, 2024

louis-jan assigned CameronNg Apr 16, 2024

This was referenced Apr 22, 2024

feat: Jan supports model file formats except for GGUF too #2774

Closed

Added a function to input the URL of Hugging Face into Jan and convert it to GGUF if it is not GGUF. #2790

Closed

louis-jan assigned namchuai and unassigned Inchoker Apr 26, 2024

Van-QA modified the milestones: v0.5.0 Broken Rice, v0.5.1 May 2, 2024

Van-QA modified the milestones: v.0.5.0 🍵 Bubur Ayam, v.0.5.1 🍖 Kebap May 29, 2024

Van-QA modified the milestones: v.0.5.1 🍖 Kebap, v.0.5.2 ⚡ Thunder Tea Jun 6, 2024

0xSage closed this as completed Jun 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Jan supports safetensors #1056

feat: Jan supports safetensors #1056

0xSage commented Dec 18, 2023 •

edited by Van-QA

Loading

hiro-v commented Feb 11, 2024

Van-QA commented Mar 6, 2024

hiro-v commented May 17, 2024

0xSage commented Jun 11, 2024

feat: Jan supports safetensors #1056

feat: Jan supports safetensors #1056

Comments

0xSage commented Dec 18, 2023 • edited by Van-QA Loading

hiro-v commented Feb 11, 2024

Van-QA commented Mar 6, 2024

hiro-v commented May 17, 2024

0xSage commented Jun 11, 2024

0xSage commented Dec 18, 2023 •

edited by Van-QA

Loading