LLM capacity planner + fleet ledger + desktop inference runtime.
Status: pre-alpha, Phase 0 bootstrap. See PLAN.md for the implementation roadmap.
hwLedger is an Apache-2.0 desktop app + agent/server pair that:
- Plans VRAM and throughput for any HF / GGUF / MLX / Ollama model, correctly handling dense, MoE, MLA, GQA, sliding-window, SSM/Mamba, and hybrid-attention architectures — with a slider UX over a live per-layer breakdown.
- Reconciles predictions against live telemetry from MLX, mistral.rs, llama.cpp, vLLM, or TGI.
- Runs inference locally on Apple Silicon via a forked oMlx sidecar with SSD-paged KV cache.
- Ledgers a heterogeneous fleet — local NVIDIA/AMD boxes, Apple Silicon laptops, cheap cloud rentals (Vast.ai, RunPod, Lambda) — with a shared event-sourced audit log, dispatch planner, and spot-price-aware cost model.
- Ships as per-OS native GUIs (SwiftUI / WinUI 3 / Qt 6 + Slint) over a shared Rust FFI core.
A hobbyist-sized fleet with enterprise bones.
Every existing public VRAM calculator (HF Accelerate, can-it-run-llm, LM Studio's gauge) gets MoE and MLA wrong — they under-count KV cache and over-count MoE throughput. hwLedger's math core is architecture-keyed: it dispatches per AttentionKind (MHA / GQA / MQA / MLA / Sliding / SSM / Hybrid / Sink) and treats resident-vs-active parameters separately for MoE. See PLAN.md §5.
- Core: Rust workspace (
hwledger-core,-arch,-ingest,-probe,-inference,-ledger,-fleet-proto,-agent,-server,-cli,-ffi) - Sidecar:
sidecars/omlx-fork/— fat fork of jundot/omlx, Apache-2.0 - Native apps:
apps/macos/(SwiftUI + UniFFI + XCFramework),apps/windows/(WinUI 3 + .NET 9 + csbindgen),apps/linux-qt/(Qt 6 + cxx-qt + QML),apps/linux-slint/(Rust-native) - Fleet wire: Axum + rustls mTLS for agents; russh + deadpool for SSH agentless; reqwest for Vast/RunPod/Lambda/Modal;
tailscale status --jsonfor tailnet discovery
See the component diagram in PLAN.md §4.1.
One-liner to build FFI + launch server, docs-site, and Streamlit:
cargo run -p hwledger-dev-harness -- upSee docs-site/getting-started/dev-setup.md for ports, log locations, and troubleshooting (FFI auto-build, Swift "engine missing" sheet, streamlit hot-reload).
- PLAN.md — phased WBS + DAG + risks + reuse opportunities
- PRD.md — product requirements (forthcoming)
- ADR.md — index of architecture decisions (see
docs/adr/) - CHARTER.md — scope + principles (forthcoming)
- AGENTS.md — AI-agent operating notes (forthcoming)
- docs/research/ — archived Haiku research briefs (oMlx, MLX IPC, inference engines, KV cache formulas, config ingestion, GPU telemetry, Swift/WinUI/Qt FFI, fleet wire, competitor survey)
| Phase | Status |
|---|---|
| P0 Foundation | in progress |
| P1 Math core | planned |
| P2 Ingestion + probe | planned |
| P3 macOS GUI MVP | planned |
| P4 Inference | planned (macOS only in MVP) |
| P5 Fleet | planned |
| P6 Windows GUI | deferred |
| P7 Linux GUI | deferred |
| WP21 macOS Release | code complete (waiting notarization creds) |
Tracked in AgilePlus: feature hwledger-v1-macos-mvp (see agileplus status).
WP21 deliverables (macOS distribution):
- Codesigning infrastructure: READY (Developer ID cert installed, entitlements defined, scripts complete)
- GitHub Actions release workflow: READY (release.yml deployed)
- DMG + notarization flow: READY (scripts deployed, awaiting App Store Connect credentials)
- Sparkle integration: READY (Package.swift updated, updater wired, key generation documented)
- Documentation: READY (docs/reports/WP21-APPLE-DEV-SECRETS.md with step-by-step setup)
Apache-2.0. See LICENSE.