-
Notifications
You must be signed in to change notification settings - Fork 0
state
Phase: 2 (remediation in progress)
Next finding to address: F12 (ProjectLauncher allowlist) — bundled into Wave-1 implementation swarm
Last finding closed: F10 (executor-safety allowlist on LLM-emitted actions)
Last commit: audit F10: validators + runtime filter for LLM-emitted action shapes
Eight commits landed, in order. The data-loss + security + observability tiers are closed; the correctness/cost/maintainability tiers are still open. See audit/execution-log.md for full per-commit detail.
| Commit | Finding | Tier | Test cases |
|---|---|---|---|
| 6eca4c1 | F19 | observability foundation | 8 |
| 3268a04 | F07 | security | 8 |
| 65d110a | F08+F07b | security | 7 |
| 7169750 | F02 | data-loss | 5 |
| f0b95b0 | F03 | data-loss | 4 |
| 5828fa7 | F01 | data-loss | 3 |
| 2a02194 | F11 | security | 3 |
| 2a02194... actually next: 8th commit | F09 | security | 9 |
Remaining priority on the Ledger (run-order):
F10 → F12 → F38+F39 → F53
→ F20 → F30 → F25 → F26 → F18 → F27
→ F06 → F16 → F17 → F22 → F34
→ (then maintainability cohort)
→ (then F40 + F19b + F07c/F08b — TS test infra unblocks the extension half)
Outstanding new findings surfaced during Phase 2 (not yet closed):
- F07b — daemon-side closed in F08; extension wiring split as F08b.
- F08b — extension fetches token via native host (deferred, needs F40).
- F19b — correlation IDs in browser extension (deferred, needs F40). Resume protocol on fresh invocation:
- Read
audit/findings.md— authoritative Ledger. - Read
audit/execution-log.md— what has already shipped. - Read this file — what is next.
- Pick up from the finding ID below; do not re-diagnose.
Locked sequence — see findings.md §VIII for rationale.
F19 → F07 → F08 → F02 → F03 → F38 → F39 → F53
→ F01 → F09 → F10 → F11 → F12
→ F20 → F30 → F25 → F26 → F18 → F27
→ F06 → F16 → F17 → F22 → F34
→ (then maintainability tier — F31, F32, F33, F35, F36, F46, F47, F48, F49, F50, F51, F52, F54, F55, F56)
→ (cross-cutting tier — F40, F41, F42–F45, requires Debt-1 design)
- Debt-1 (shared schema source of truth) — generator + codegen.
- Debt-2 (capability-token trust model) — supersedes F07/F08 piecemeal patches; for Phase 2 we ship the localhost token gate as a tactical fix, not the full client-bootstrap rework.
Next: F19 — end-to-end correlation IDs. Adds a request_id to WSMessage, threads it through controller.py, routes.py, websocket_server.py, anthropic_planner.py, and back into the popup/overlay error surfaces. Test: a single user click on the popup produces log lines from popup.tsx, background.ts, native_host.py, routes.py, state_engine, and anthropic_planner.py — all sharing one request_id.