v0.9.36: llmfit doctor, honest fit verdicts, and a self-updating model catalog — a stability milestone #679
AlexsJones
announced in
Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
v0.9.36 is out, and it's a bigger release than the version number suggests — this one is all about making llmfit's answers trustworthy and making the project easier to maintain and contribute to. Here's what landed.
🩺
llmfit doctor— hardware bugs become one-command reportsHardware detection has been our biggest source of recurring bugs (multi-GPU boxes, ROCm quirks, Intel Arc, APUs…). Each fix used to depend on whatever partial output a reporter happened to paste — and fixes for one GPU topology sometimes regressed another.
llmfit doctorends that cycle. It captures the raw output of every tool detection relies on (nvidia-smi, rocm-smi, sysfs, lspci, system_profiler, WMI, vulkaninfo, npu-smi) plus what llmfit concluded, as one paste-ready report. The new bug-report template asks for it, and each report can be dropped verbatim into our regression test suite — two real user systems from #638 are already fixtures. Detection bugs now stay fixed.And the fixes themselves:
📏 Honest fit verdicts
Two long-requested changes to make the score mean what you think it means:
262k→14k— native window → what actually fits on your machine after weights and KV cache — with a warning colour when it drops below 4k. Sorting by Ctx uses the achievable number, and it's in the JSON output asusable_context. Credit to @MrMarble for the design.--use-case coding— parameter count no longer dominates. The table ships in the repo and corrections are welcome PRs. Thanks @mvanhorn for the proposal this is built on.📦 The model catalog keeps itself fresh
custom_models.jsoninto llmfit's data directory (or pointLLMFIT_CUSTOM_MODELSat a file) to add or override any model — no rebuild, no release, works with brew/scoop installs. See the "Adding your own models locally" section in the README.🤝 Contribution pipeline
We also cleared the review backlog — long-stalled PRs have been merged or closed with proper explanations, and several ideas from older PRs (Windows paths, sharded downloads, Ollama pulls, MoE scoring) had already shaped what's in main; that credit is now recorded on each PR. If your PR sat quiet for a while: sorry, and it won't be the pattern going forward — the new templates, weekly automation, and regression-fixture flow are exactly about keeping the loop fast.
Get it
Full changelog: https://github.com/AlexsJones/llmfit/releases/tag/v0.9.36
If llmfit misdetects your hardware, run
llmfit doctorand open an issue with the output — it'll likely become a regression test within days. That's the stability loop we're most excited about. 🚀Beta Was this translation helpful? Give feedback.
All reactions