Release 0.29.0: Sideloaded models · ggml-org/Llama-macOS

This release opens LlamaBarn up beyond the curated catalog: any GGUF model in your Hugging Face cache now shows up in the installed list with the same one-click load, run, and delete as curated models, with context tiers sized to your device automatically.

Detect and support sideloaded GGUF models from the Hugging Face cache
Match llama-server format for sideloaded model IDs so IDs are portable
Default every model to the 4K context tier for a smaller footprint
Show the model's native max context alongside the device-fit tier
Show every size in catalog family drawers, with installed ones badged
Keep deprecated families like Qwen3 visible for already-installed models
Add a caption under Launch at login explaining idle resource use
Show friendlier HTTP download errors
Fix Gemma 4 download URLs after Hugging Face repo file renames
Update llama.cpp to b8797

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

0.29.0: Sideloaded models

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Uh oh!