Headline
- ARM64 (aarch64) Linux packages now ship for Debian, Fedora, the portable embeddable archive, and Docker.
- Users can set custom system prompts for Lemonade omni models, as demonstrated in the new RPG-HaloTales-V1 narrative experience model.
- A new OTLP telemetry subsystem exports traces using OpenInference and OpenTelemetry GenAI conventions, driven by a
lemonade telemetrycommand. - The vLLM backend now works with Claude Code through the Anthropic Messages API, and adds Qwen3.6 and GLM-4.7-Flash models.
Breaking Changes
- The Linux system tray is now disabled by default (
REQUIRE_LINUX_TRAY=OFF), so Linux server packages ship headless; rebuild with the tray option enabled to restore it.
Lemonade Server
| Operating System | Downloads |
|---|---|
| Windows | lemonade.msi |
| Ubuntu 24.04+ | Launchpad PPA |
| Debian 13 (x86_64) | lemonade-server_10.9.0-debian13_amd64.deb |
| Debian 13 (ARM64) | lemonade-server_10.9.0-debian13_arm64.deb |
| Fedora 43 (x86_64) | lemonade-server-10.9.0-fc43.x86_64.rpm |
| Fedora 43 (ARM64) | lemonade-server-10.9.0-fc43.aarch64.rpm |
| Fedora 44 (x86_64) | lemonade-server-10.9.0-fc44.x86_64.rpm |
| Fedora 44 (ARM64) | lemonade-server-10.9.0-fc44.aarch64.rpm |
| macOS | Lemonade-10.9.0-Darwin.pkg |
Other platforms? See our Installation Options for Docker, Snap, Arch, Debian, and more.
Embeddable Lemonade
Portable binaries for bundling into your own installer. Run lemond ./ as a subprocess.
| Platform | Download |
|---|---|
| Ubuntu x64 | lemonade-embeddable-10.9.0-ubuntu-x64.tar.gz |
| Ubuntu arm64 | lemonade-embeddable-10.9.0-ubuntu-arm64.tar.gz |
| Windows x64 | lemonade-embeddable-10.9.0-windows-x64.zip |
| macOS arm64 | lemonade-embeddable-10.9.0-macos-arm64.tar.gz |
What's Changed
Thanks @GabrielReusRodriguez, @SlawomirNowaczyk, @abn, @bitgamma, @blackdeathdrow, @bong-water-water-bong, @ckuethe, @clemperorpenguin, @eddierichter-amd, @fl0rianr, @gchriswill, @github-actions, @ianbmacdonald, @jeremyfowers, @kenvandine, @lucifer-vali, @ramkrishna2910, @sawansri, @siavashhub, @superm1, @volen-silo for your awesome contributions to this release!
Click to expand changelog
- fix(packaging): don't auto-enable the per-user lemond.service (Closes #2125) by @ianbmacdonald in #2354
- fix(config): accept flat backend keys like vllm_args in config set (#1824) by @ramkrishna2910 in #2394
- test/ci: harden CLI endpoint suites against shared-server flakes by @fl0rianr in #2365
- Lemonade cli list models with size in Gb by @GabrielReusRodriguez in #2329
- Handle separate sd-cpp CUDA release pins by @fl0rianr in #2369
- fix: refuse to start a second lemond when the port is already in use (#2255) by @siavashhub in #2258
- Fix sd-cpp:metal install (resolve asset filename wildcard) by @jeremyfowers in #2402
- Allow HTTP custom backends with warnings by @sawansri in #2370
- Add comment and PR description guidelines to AGENTS.md by @superm1 in #2409
- [Router] Interfaces + fixtures stub (foundation) by @ramkrishna2910 in #2408
- fix: Prevent the Rosetta prompt on macOS installation by @clemperorpenguin in #2297
- fix(cli): resolve option inheritance bug in launch help options by @abn in #2396
- Move omni collection system prompt to per-model JSON by @ramkrishna2910 in #2395
- fix: ROCm Support for RDNA2 Device Models by @gchriswill in #2410
- feat: expand system stats with storage infos by @fl0rianr in #2436
- test: fix whisper realtime macOS VAD flake by @abn in #2442
- test: fix stale docstring and add VAD rationale comment by @bong-water-water-bong in #2447
- feat(telemetry): add unified OTLP telemetry with OpenInference and OpenTelemetry GenAI support by @abn in #2330
- Router/2378 2379 evaluator registry by @eddierichter-amd in #2440
- test/ci: follow up on hosted Ollama tool-calling timeouts by @fl0rianr in #2399
- website: Overhaul the front page by @jeremyfowers in #2427
- refactor(backends): self-describing WrappedServer backends (#2287) by @jeremyfowers in #2320
- Fix startup self-deadlock with a thread-local recursion guard (minimal alt to #2443) by @jeremyfowers in #2488
- fix(launch): resolve npm .cmd launcher on Windows for pi and codex by @jeremyfowers in #2489
- Fix three broken doc links found by zensical by @lucifer-vali in #2486
- Enable vLLM + Messages API and per model family vLLM args by @sawansri in #2143
- [Router] 2381 unified interface for semantic_similarity and model classifier by @SlawomirNowaczyk in #2483
- Fix flaky Ollama tool-calling CI by disabling default thinking for tool requests by @fl0rianr in #2485
- Merge release-v10.8.1 back to main for proper release tag handling by @kenvandine in #2493
- fix(backends): fallback to installed backend when offline by @fl0rianr in #2393
- Fix (#2412) : Added offline check before pulling a model on server.cpp by @GabrielReusRodriguez in #2434
- Add arm64 builds to all Linux release packages and PPA by @kenvandine in #2430
- fix(rocm): resolve backend assets to published gfx family names (#2415) by @superm1 in #2490
- Add lemonade and lemond options to bug report template by @ckuethe in #2444
- add per-architecture options by @bitgamma in #2496
- Publish rolling :edge and immutable :sha- container images on merge to main by @siavashhub in #2397
- Detect externally-installed ROCm before downloading TheRock by @volen-silo in #2498
- docs(release): add step to reconcile release tag into main by @jeremyfowers in #2504
- fix(server): recover stalled HF model downloads by @fl0rianr in #2421
- [Router] Deterministic conditions: keywords / regex / min_chars / has_* / metadata (M4, #2380) by @ramkrishna2910 in #2491
- fix(packaging): strip lemonade-tray from RPM by @lucifer-vali in #2484
- Update stable-diffusion.cpp to master-721-8caa3f9 and CUDA master-726-db48014 by @github-actions[bot] in #2477
- feat: detect HuggingFace model updates and show Update button in web UI by @blackdeathdrow in #2243
- docs: list ARM64 Linux release artifacts left out by #2430 by @jeremyfowers in #2505
- chore(release): bump version to v10.9.0 by @jeremyfowers in #2506
- feat(models): add RPG-HaloTales-V1 omni collection by @jeremyfowers in #2509
- Fix: Update CUDA version for llama.cpp by @fl0rianr in #2507
New Contributors
- @gchriswill made their first contribution in #2410
- @bong-water-water-bong made their first contribution in #2447
- @SlawomirNowaczyk made their first contribution in #2483
- @volen-silo made their first contribution in #2498
Full Changelog: v10.8.1...v10.9.0
Windows installers are signed. Free code signing provided by SignPath.io, certificate by SignPath Foundation. See our Code Signing Policy.