Skip to content

v3.7.0

Choose a tag to compare

@Pavelevich Pavelevich released this 20 Jun 09:20
· 11 commits to main since this release
4977b92

Published to npm as llm-checker@3.7.0. Adds a packaged multi-source model registry and wires it into the recommendation flow. Full suite green at 44/44.

Highlights:

  • Multi-source registry: a packaged snapshot of ~3,259 repos / ~33,736 exact installable/downloadable artifacts from Hugging Face, Ollama, and GPT4All, with per-source install commands (hf download ..., ollama pull ...). New registry-sync, registry-search, and registry-recommend commands.
  • recommend (and the check recommendation card) now source candidates from the registry through the canonical deterministic scoring core, with --runtime auto plus Ollama / vLLM / MLX / llama.cpp / Transformers targeting; falls back to the Ollama catalog when the registry is empty or unavailable.
  • Mixture-of-Experts memory sizing fixed: MoE models (e.g. Mixtral-8x7B, Qwen3-397B-A17B) are sized by their TOTAL parameter count (all experts are resident under Ollama / Metal / vLLM), re-derived from the model name so a stale/under-reported DB value can never make a huge model falsely "fit" small hardware. The packaged seed DB was regenerated so stored MoE totals are correct (Mixtral-8x7B 7B→56B; Qwen3.5-397B-A17B 17B→397B total / 17B active).
  • Packaged src/data/seed/models.db is ~45 MB unpacked (tarball ~6.5 MB).

Carries everything from 3.6.1 (issue #88 scoring unification, #95 hardware VRAM, #97 MCP hardening, #86/#98 Windows UI). Full notes: docs/reference/changelog.md