Skip to content

KooshaPari/hwLedger

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

251 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

hwLedger

LLM capacity planner + fleet ledger + desktop inference runtime.

Status: pre-alpha, Phase 0 bootstrap. See PLAN.md for the implementation roadmap.

hwLedger is an Apache-2.0 desktop app + agent/server pair that:

  1. Plans VRAM and throughput for any HF / GGUF / MLX / Ollama model, correctly handling dense, MoE, MLA, GQA, sliding-window, SSM/Mamba, and hybrid-attention architectures — with a slider UX over a live per-layer breakdown.
  2. Reconciles predictions against live telemetry from MLX, mistral.rs, llama.cpp, vLLM, or TGI.
  3. Runs inference locally on Apple Silicon via a forked oMlx sidecar with SSD-paged KV cache.
  4. Ledgers a heterogeneous fleet — local NVIDIA/AMD boxes, Apple Silicon laptops, cheap cloud rentals (Vast.ai, RunPod, Lambda) — with a shared event-sourced audit log, dispatch planner, and spot-price-aware cost model.
  5. Ships as per-OS native GUIs (SwiftUI / WinUI 3 / Qt 6 + Slint) over a shared Rust FFI core.

A hobbyist-sized fleet with enterprise bones.

Why

Every existing public VRAM calculator (HF Accelerate, can-it-run-llm, LM Studio's gauge) gets MoE and MLA wrong — they under-count KV cache and over-count MoE throughput. hwLedger's math core is architecture-keyed: it dispatches per AttentionKind (MHA / GQA / MQA / MLA / Sliding / SSM / Hybrid / Sink) and treats resident-vs-active parameters separately for MoE. See PLAN.md §5.

Architecture

  • Core: Rust workspace (hwledger-core, -arch, -ingest, -probe, -inference, -ledger, -fleet-proto, -agent, -server, -cli, -ffi)
  • Sidecar: sidecars/omlx-fork/ — fat fork of jundot/omlx, Apache-2.0
  • Native apps: apps/macos/ (SwiftUI + UniFFI + XCFramework), apps/windows/ (WinUI 3 + .NET 9 + csbindgen), apps/linux-qt/ (Qt 6 + cxx-qt + QML), apps/linux-slint/ (Rust-native)
  • Fleet wire: Axum + rustls mTLS for agents; russh + deadpool for SSH agentless; reqwest for Vast/RunPod/Lambda/Modal; tailscale status --json for tailnet discovery

See the component diagram in PLAN.md §4.1.

Dev setup

One-liner to build FFI + launch server, docs-site, and Streamlit:

cargo run -p hwledger-dev-harness -- up

See docs-site/getting-started/dev-setup.md for ports, log locations, and troubleshooting (FFI auto-build, Swift "engine missing" sheet, streamlit hot-reload).

Documentation

  • PLAN.md — phased WBS + DAG + risks + reuse opportunities
  • PRD.md — product requirements (forthcoming)
  • ADR.md — index of architecture decisions (see docs/adr/)
  • CHARTER.md — scope + principles (forthcoming)
  • AGENTS.md — AI-agent operating notes (forthcoming)
  • docs/research/ — archived Haiku research briefs (oMlx, MLX IPC, inference engines, KV cache formulas, config ingestion, GPU telemetry, Swift/WinUI/Qt FFI, fleet wire, competitor survey)

Development status

Phase Status
P0 Foundation in progress
P1 Math core planned
P2 Ingestion + probe planned
P3 macOS GUI MVP planned
P4 Inference planned (macOS only in MVP)
P5 Fleet planned
P6 Windows GUI deferred
P7 Linux GUI deferred
WP21 macOS Release code complete (waiting notarization creds)

Tracked in AgilePlus: feature hwledger-v1-macos-mvp (see agileplus status).

WP21 deliverables (macOS distribution):

  • Codesigning infrastructure: READY (Developer ID cert installed, entitlements defined, scripts complete)
  • GitHub Actions release workflow: READY (release.yml deployed)
  • DMG + notarization flow: READY (scripts deployed, awaiting App Store Connect credentials)
  • Sparkle integration: READY (Package.swift updated, updater wired, key generation documented)
  • Documentation: READY (docs/reports/WP21-APPLE-DEV-SECRETS.md with step-by-step setup)

License

Apache-2.0. See LICENSE.

About

LLM capacity planner, fleet ledger, and desktop inference runtime

Topics

Resources

License

Code of conduct

Contributing

Security policy

Stars

Watchers

Forks

Packages

 
 
 

Contributors