Skip to content

docs(lemonade): ADR-0008/0009/0010 + adoption plan + spike #2 + grill outputs#154

Merged
thinmintdev merged 1 commit into
mainfrom
docs/lemonade-adr-plan-2026-05-22
May 23, 2026
Merged

docs(lemonade): ADR-0008/0009/0010 + adoption plan + spike #2 + grill outputs#154
thinmintdev merged 1 commit into
mainfrom
docs/lemonade-adr-plan-2026-05-22

Conversation

@thinmintdev
Copy link
Copy Markdown
Contributor

Summary

Post-grill source of truth for v0.2 Lemonade migration. Supersedes ADR-0006 + ADR-0007; locks the 22-PR implementation sequence.

Nothing here ships code — pure docs PR. Lands the contract before PRs 3–22.

Key documents

  • ADR-0008 (0008-lemonade-adoption.md) — Lemonade as unified runtime (Path 4). Single `lemond` per host, all modalities via Lemonade, per-type LRU concurrency, mandatory `--threads N`, FLM trio for NPU, OmniRouter client-side, no `extra.*` namespace, no v0.1.x migration. §3 explicitly rescinds ADR-0007's preload validation — per-type LRU + nuclear-evict's not-found exemption list reduce the original hazard.
  • ADR-0009 — FLM trio NPU packing (chat + asr + embed in one AMDXDNA HW context via `--asr 1 --embed 1`).
  • ADR-0010 — Bundle picker first-run UX, no default stack at install.
  • ADR-0006 / ADR-0007 — Status → Superseded by ADR-0008.
  • `lemonade-adoption-plan-2026-05-22.md` — the 13-section, 22-PR contract. Every implementation PR (3–22) cites a §.
  • `lemonade-spike-2-findings-2026-05-22.md` — empirical Phase A/B/C, `/diagnose` chain that uncovered `--threads` deadlock, FLM trio verification.
  • `lemonade-spike-2-runbook-2026-05-22.md` — repro recipe; reusable post-v0.2 regression test.
  • `lemonade-research-2026-05-22/{researcher,architect,api,ui}.md` — 4-agent design pass.
  • `CONTEXT.md` — glossary additions from grill (slot type, group, FLM trio, bundle tiers, model namespace, fresh install, v0.1.x → v0.2 upgrade).

Heads-up — parallel-session collision

PRs #143 (SlotConfig.device + schema_v2) and #144 (preload validation + idle-unload, refs ADR-0007) landed on `main` before this doc PR was visible. ADR-0008 §3 specifically removes the preload validation work that #144 implements. Decision on whether to revert / scope-down / keep-defensively will be raised in a follow-up PR or issue once this docs PR lands.

Test plan

  • All files render in GitHub MD viewer
  • All cross-references resolve (`docs/internal/lemonade-adoption-plan-2026-05-22.md` §N, CONTEXT.md entries)
  • ADR-0006 / 0007 Status fields show "Superseded by ADR-0008"
  • CI passes (docs-only PR — typecheck + tests should remain green)

🤖 Generated with Claude Code

… outputs

Post-grill source of truth for v0.2 Lemonade migration. Supersedes
ADR-0006 + ADR-0007; locks the 22-PR implementation sequence.

- ADR-0008: Lemonade adoption as unified inference runtime (Path 4).
  Rescinds ADR-0007's preload validation (per-type LRU + nuclear-evict
  exemption list make it unnecessary). Locks --threads N mandatory.
- ADR-0009: FLM trio NPU packing (chat + asr + embed in one AMDXDNA HW
  context via --asr 1 --embed 1).
- ADR-0010: bundle picker first-run UX (no default stack).
- ADR-0006/0007: Status -> Superseded by ADR-0008.
- CONTEXT.md: glossary additions from grill (slot type, group, FLM trio,
  bundle tiers, model namespace, fresh install, v0.1.x -> v0.2 upgrade).
- lemonade-adoption-plan-2026-05-22.md: 13 sections, 22-PR roadmap,
  service topology, slot model, NPU+FLM trio, model layout, OmniRouter
  spec, bundle picker, v0.1.x->v0.2 clean break, slot architecture
  migration, implementation sequence, operational caveats.
- lemonade-spike-2-findings + runbook: empirical Phase A/B/C results,
  /diagnose chain that uncovered --threads deadlock, FLM trio
  verification.
- lemonade-research-2026-05-22/{researcher,architect,api,ui}.md:
  4-agent design pass with deep code references.

Implementation contract: docs/internal/lemonade-adoption-plan-2026-05-22.md.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
@thinmintdev thinmintdev merged commit a3abaaf into main May 23, 2026
4 checks passed
@thinmintdev thinmintdev deleted the docs/lemonade-adr-plan-2026-05-22 branch May 23, 2026 01:16
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant