docs(phase3): blueprint — drop-a-project autonomy (Steps 8/9/10) by AVADSA25 · Pull Request #15 · AVADSA25/codec

AVADSA25 · 2026-05-03T08:47:59Z

Summary

Phase 3 design doc — drop-a-project autonomy with computer use, plan-and-grant safety model, proactive messaging. Three steps, sequential, sign-off-gated, kill-switched. Mirrors Phase 1+2 cadence.

This is the blueprint (analogous to PHASE2-BLUEPRINT.md). No code, no implementation. Sets the design before writing implementation plans for each step.

Design produced via brainstorming session 2026-05-03

15 questions resolved with the user. Notable choices captured in §8:

Q1 Qwen-3.6 ALWAYS (no Gemini Flash fallback — user explicit "local always best")
Q4 Global allowlist tier (~/.codec/agent_global_grants.json) for repeat permissions across agents
Q5 Resume from last atomic checkpoint after PM2 restart (worst case: one op re-fires; safe via idempotency + strict-consent for destructive)
Q9 PWA-only notifications via existing notifications.json (no macOS banner / iMessage / Telegram in v1)
Q12 Proactive intelligence overlay deferred to Phase 3.5 (Step 10 already substantial)

Step breakdown

Step	Scope	New module / service
8	Plan + Permission Contract	`codec_agent_plan.py`, PWA approval flow, global allowlist tier, plan-hash tamper detection
9	Background Execution + Audit Envelope	`codec_agent_runner.py` + `codec-agent-runner` PM2 service, permission gate, 3-concurrent cap, resume-after-restart, heartbeat monitoring
10	Proactive Messaging + Project Mode UI	Agent → user message types, chat thread integration, mode dropdown, status pills, Projects sidebar tab, chat auto-escalation w/ silence

Reuses from Phase 1 + 2 (no rebuilds)

Step 1 audit envelope · Step 2 plugin hooks · Step 3 ask_user + strict-consent + step budget · Step 4 self-recursion guard · Step 5 observer · Step 7 shift_report. Each Phase 1+2 component is leveraged; nothing rebuilt.

Estimated Phase 3 contribution

1 new PM2 service (codec-agent-runner) + 1 modified (codec-heartbeat)
~14 new audit events
+85 to +100 passing tests
3 PRs (one per step) + closeout doc

Each step deployable independently:

Step 8 alone: draft + approve plans, no execution. Useful for refining planning UX.
Step 8+9: end-to-end execution, results in audit log + notifications.json (no UI).
Step 8+9+10: full feature.

Test plan

No code changes — docs-only PR, no test impact
User reviews PHASE3-BLUEPRINT.md, requests changes if needed
After approval: invoke superpowers:writing-plans to generate per-step implementation plans

Out of scope (explicitly deferred)

Proactive intelligence overlay → Phase 3.5
OS-level sandbox (chroot/AppArmor) → Phase 4 only if needed
Multi-channel notifications (macOS / iMessage / Telegram) → Phase 4 if requested
Memory / context system overhaul → user said current is good enough
Marketplace UX overhaul → user said already in place
Total LLM cap per agent → per-checkpoint budget sufficient for v1
Full Projects sidebar overhaul → small tab is enough; bigger UX deferred

🤖 Generated with Claude Code

Phase 3 substrate: long-running autonomous agents with computer use, plan-and-grant safety model, and proactive messaging. Mirrors Phase 1+2 sequential-step + sign-off-gated + kill-switched cadence. Anchor example (user-supplied): "Build me a Telegram bot that monitors Marbella property listings under €X and pings me on new ones." 3 steps: - Step 8: Plan + Permission Contract — codec_agent_plan.py + PWA approval flow + global allowlist tier + plan-hash tamper detection - Step 9: Background Execution — codec-agent-runner PM2 daemon + per-checkpoint exec loop + permission gate + Qwen-3.6 driver + resume-after-restart + 3-concurrent cap + heartbeat monitoring - Step 10: Proactive Messaging + Project Mode UI — agent-to-user messages + chat thread integration + mode dropdown + status pills + small Projects sidebar tab + chat auto-escalation w/ silence 15 Q&A resolved during 2026-05-03 brainstorming session — all captured in §8 of the blueprint. Notable choices: - Q1: Qwen-3.6 ALWAYS (no cloud fallback, user explicit) - Q4: Global allowlist tier across agents - Q5: Resume from last atomic checkpoint (idempotent re-fire safe) - Q9: PWA-only notifications (no macOS/iMessage/Telegram in v1) - Q12: Proactive intelligence overlay deferred to Phase 3.5 Reuses Phase 1+2 substrate: audit envelope (Step 1), plugin hooks (Step 2), ask_user + strict-consent + step budget (Step 3), self-recursion guard (Step 4), observer (Step 5), shift_report (Step 7). Net Phase 3 contribution estimate: 3 PM2 services modified (codec-dashboard, codec-heartbeat, codec-agent-runner NEW), ~14 audit events, +85 to +100 passing tests, 3 PRs (one per step) + closeout doc. Each step deployable independently. Per brainstorming skill workflow: this doc replaces the "docs/superpowers/specs/" default location with the established docs/PHASE*-BLUEPRINT.md pattern from Phase 2. Ready for implementation planning via superpowers:writing-plans once user approves the spec.

AVADSA25 merged commit 2da2d35 into main May 3, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs(phase3): blueprint — drop-a-project autonomy (Steps 8/9/10)#15

docs(phase3): blueprint — drop-a-project autonomy (Steps 8/9/10)#15
AVADSA25 merged 1 commit intomainfrom
docs/phase3-blueprint

AVADSA25 commented May 3, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

AVADSA25 commented May 3, 2026

Summary

Design produced via brainstorming session 2026-05-03

Step breakdown

Reuses from Phase 1 + 2 (no rebuilds)

Estimated Phase 3 contribution

Test plan

Out of scope (explicitly deferred)

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants