Skip to content

feat(curated): refresh FirstRun chat picks (qwen3 family + gpt-oss-20b)#27

Merged
thinmintdev merged 1 commit into
mainfrom
feat/curated-chat-refresh-2026-05-16
May 18, 2026
Merged

feat(curated): refresh FirstRun chat picks (qwen3 family + gpt-oss-20b)#27
thinmintdev merged 1 commit into
mainfrom
feat/curated-chat-refresh-2026-05-16

Conversation

@thinmintdev
Copy link
Copy Markdown
Contributor

Summary

  • Adds 5 new chat picks at the top of `CURATED_MODELS`: qwen3-coder-next, qwen3.6-27b, gpt-oss-20b, qwen3.5-9b, qwen3.5-0.8b
  • Keeps qwen3-4b + phi3-mini in the featured set (explicit ask)
  • llama32-3b stays in catalogue but renders below the refresh
  • No test churn — old ids remain resolvable

Why

Old picks (qwen3-4b/llama32-3b/phi3-mini) were sized for sub-8 GB hosts. The reference deployment is Strix Halo with a ~100 GB unified pool, so we should default to picks that exercise it.

Notes / follow-ups

  • `gpt-oss-20b` HF coords (`unsloth/gpt-oss-20b-GGUF`) are best-effort — verify before v1 cut
  • Follow-up PR will restructure the wizard into chat → embed/rerank → voice steps with FLM-multiplex detection (Phase 2)

🤖 Generated with Claude Code

Old picks (qwen3-4b, llama32-3b, phi3-mini) were sized for sub-8 GB
VRAM hosts.  The reference deployment is Strix Halo with a ~100 GB
unified memory pool — leaving most of it idle.

This adds the 2026-05 chat refresh at the top of CURATED_MODELS so the
wizard surfaces them first:

  qwen3-coder-next   49 GB  frontier coder, full Strix Halo pool
  qwen3.6-27b        20 GB  general-purpose default, Q5_K_XL
  gpt-oss-20b        12 GB  OpenAI open-weights
  qwen3.5-9b          6 GB  lean default — fits alongside embed/voice
  qwen3.5-0.8b      0.6 GB  smoke-test pick, sub-second cold start

qwen3-4b + phi3-mini stay in the featured set (explicit user ask) for
mid-tier Vulkan hosts and the MIT-licensed pick.  llama32-3b stays in
the catalogue but moves below them; the wizard renders in list order.

No tests churn — the old ids remain resolvable, so registry rows that
reference them keep working.  Follow-up (separate PR / session) will
restructure the wizard into chat → embed/rerank → voice steps with
FLM-multiplex detection.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
@thinmintdev thinmintdev force-pushed the feat/curated-chat-refresh-2026-05-16 branch from f2947da to 2667664 Compare May 18, 2026 00:04
@thinmintdev thinmintdev merged commit 07cf56b into main May 18, 2026
@thinmintdev thinmintdev deleted the feat/curated-chat-refresh-2026-05-16 branch May 18, 2026 00:04
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant