Skip to content

0.29.0: Sideloaded models

Choose a tag to compare

@erusev erusev released this 15 Apr 08:53
· 137 commits to main since this release

This release opens LlamaBarn up beyond the curated catalog: any GGUF model in your Hugging Face cache now shows up in the installed list with the same one-click load, run, and delete as curated models, with context tiers sized to your device automatically.

  • Detect and support sideloaded GGUF models from the Hugging Face cache
  • Match llama-server format for sideloaded model IDs so IDs are portable
  • Default every model to the 4K context tier for a smaller footprint
  • Show the model's native max context alongside the device-fit tier
  • Show every size in catalog family drawers, with installed ones badged
  • Keep deprecated families like Qwen3 visible for already-installed models
  • Add a caption under Launch at login explaining idle resource use
  • Show friendlier HTTP download errors
  • Fix Gemma 4 download URLs after Hugging Face repo file renames
  • Update llama.cpp to b8797