feat: add vicuna-13b-v1.5-cuda libraries #15

Joelkang · 2023-08-25T19:42:21Z

Model weights are here https://huggingface.co/Dala/mlc-chat-vicuna-13b-v1.5 in the various branches. They've all been build with target=cuda-multiarch and max-seq-len=4096

Quantized with: - autogptq_llama_q4f16_1 - q4f16_1 - q4f16_2 - q8f16_1 And target: cuda-multiarch

Joelkang force-pushed the main branch from 987ff1c to da1fa5b Compare August 26, 2023 00:02

Joelkang changed the title ~~feat: add vicuna-13b-v1.5-q4f16_1-cuda library~~ feat: add vicuna-13b-v1.5-cuda libraries Aug 26, 2023

feat: add vicuna-13b-v1.5 for cuda libraries

fcefd48

Quantized with: - autogptq_llama_q4f16_1 - q4f16_1 - q4f16_2 - q8f16_1 And target: cuda-multiarch

Joelkang force-pushed the main branch from 443ebf4 to fcefd48 Compare August 30, 2023 18:28

Merge branch 'mlc-ai:main' into main

3ec5c52

tqchen closed this Feb 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add vicuna-13b-v1.5-cuda libraries #15

feat: add vicuna-13b-v1.5-cuda libraries #15

Joelkang commented Aug 25, 2023 •

edited

feat: add vicuna-13b-v1.5-cuda libraries #15

feat: add vicuna-13b-v1.5-cuda libraries #15

Conversation

Joelkang commented Aug 25, 2023 • edited

Joelkang commented Aug 25, 2023 •

edited