0.29.0: Sideloaded models
This release opens LlamaBarn up beyond the curated catalog: any GGUF model in your Hugging Face cache now shows up in the installed list with the same one-click load, run, and delete as curated models, with context tiers sized to your device automatically.
- Detect and support sideloaded GGUF models from the Hugging Face cache
- Match llama-server format for sideloaded model IDs so IDs are portable
- Default every model to the 4K context tier for a smaller footprint
- Show the model's native max context alongside the device-fit tier
- Show every size in catalog family drawers, with installed ones badged
- Keep deprecated families like Qwen3 visible for already-installed models
- Add a caption under Launch at login explaining idle resource use
- Show friendlier HTTP download errors
- Fix Gemma 4 download URLs after Hugging Face repo file renames
- Update llama.cpp to b8797