Replies: 3 comments 3 replies
-
|
Thanks for reporting this @mtdphn . we don't formally support Carnice just yet. I previously added it but couldn't finish the work. |
Beta Was this translation helpful? Give feedback.
-
|
Carnice is back ✅ — revived, validated end-to-end, and merged in #406 (now on master). One heads-up on the link: it pointed at the old, archived dual-vLLM AutoRound attempt, which is why it 404s. The revived path is single-card beellama — Quick start: WEIGHTS=carnice-v2 bash scripts/setup.sh qwen3.6-27b
bash scripts/switch.sh beellama/carnice-v2-single-q5km-mtp --force # :8068Full results + the n=1-vs-n=2 finding are in the announcement → #407 (it edges the Qwopus-Coder sibling on the agentic packs, 110 vs 103 /150). If you specifically want a 2×3090 Carnice for a bigger window or a higher quant (e.g. Q8), say the word and I'll look at adding a dual config. Thanks for the nudge, @mtdphn! 🙏 |
Beta Was this translation helpful? Give feedback.
-
|
Thanks for the flag, @mtdphn — that link pointed at the old vLLM BF16-MTP path that got restructured. Rather than just repair the link, we built the dual Carnice config folks have been asking for, and went up a quant tier while at it. It's now shipped as Full Results Card (serving + quality + how to run) is in the announcement: #417 → #417 bash scripts/switch.sh --owui beellama/carnice-v2-dual-q8-mtp # :8070 + Open WebUI |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
The following link is missing : https://github.com/noonghunna/club-3090/blob/master/models/qwen3.6-27b/vllm/compose/dual/carnice-bf16mtp/bf16-mtp.yml
Beta Was this translation helpful? Give feedback.
All reactions