Drop-in multi-agent sprint for Claude Code.
No binaries. No servers. No external dependencies. Just Markdown skill files.
micro-squad turns Claude Code into a coordinated engineering team. An orchestrator delegates to parallel sub-agents that think, plan, build, review, and ship — using Claude Code's native Agent tool.
You: /squad add dark mode support
THINK ─── Forcing questions, explore alternatives
You pick scope: EXPAND / SELECTIVE / HOLD / REDUCE
│
PLAN ─────── Architect ──┐ parallel Unified plan with
Scout ─────┘ effort estimates
│
BUILD ─────── Builder implements with atomic commits
Auto-reverts on regression, 3-strike escalation
│
VERIFY ───── Judge A ───┐
Judge B ───┤ parallel Consensus table:
QA Agent ──┘ FIX / TRIAGE / DISMISS
│
Fix Agent → Re-judge (max 2 rounds)
│
SHIP ──────── Commit + PR with full evidence trail
AI makes completeness cheap. The last 10% that teams used to skip costs seconds now. micro-squad is built on three principles:
- Boil the Lake — When 100% costs minutes more than 90%, do 100%.
- Search Before Building — Check what exists before designing from scratch.
- User Sovereignty — AI recommends, users decide. Always.
Read ETHOS.md for the full philosophy.
- Dual blind judges — Two independent reviewers, orchestrator synthesizes consensus
- CRITICAL override — A single CRITICAL finding is never dismissed, even by one judge
- Real parallelism — Architect + Scout, Judge A + Judge B + QA launch simultaneously
- Context isolation — Each agent gets fresh context, reads only what it needs
- Self-regulating builder — Atomic commits, auto-revert on regression, 3-strike rule
- State machine — Enforced phase dependencies, resumable sprints, skip/stop support
- Artifact trail — Everything in
.squad/, inspectable, diffable, git-friendly - Artifact size budgets — Agents produce concise output, no token bloat downstream
- Learning loop —
/verifycaptures findings, future agents start smarter - Retrospectives —
/retroanalyzes git stats, hotspots, and patterns - Agent failure handling — Missing output detected, retry/skip options
- User stays in control — Confirmation between every phase, explicit scope modes
git clone https://github.com/SebastianPuchet/micro-squad.git ~/.claude/skills/micro-squad
cd ~/.claude/skills/micro-squad && ./setup| Command | What it does |
|---|---|
/squad <task> |
Full sprint — think, plan, build, verify, ship |
/think <topic> |
Challenge assumptions with forcing questions |
/plan |
Parallel architect + scout → unified plan |
/build |
Implementation with guardrails |
/verify |
Judgment day: dual blind review + QA + fix loop |
/secure |
Infrastructure-first security audit |
/investigate <bug> |
3-strike root cause debugging |
/ship |
PR with full evidence trail |
/retro |
Sprint retrospective — git stats, patterns, learnings |
Each command works standalone or as part of a full /squad sprint.
micro-squad/
├── squad/SKILL.md # Full sprint orchestrator
├── think/SKILL.md # Forcing questions + scope selection
├── plan/SKILL.md # Parallel architect + scout
├── build/SKILL.md # Builder with guardrails
├── verify/SKILL.md # Judgment day (dual judges + QA)
├── secure/SKILL.md # Security audit
├── investigate/SKILL.md # Root cause debugging
├── ship/SKILL.md # PR with evidence
├── retro/SKILL.md # Sprint retrospective
├── _shared/
│ ├── orchestrator-contract.md # State machine, artifact contract, rules
│ └── agent-prompts.md # All sub-agent prompt templates
├── ETHOS.md # Builder philosophy
├── CLAUDE.md # Project context for Claude Code
├── AGENTS.md # Skill index + dependency graph
└── setup # Installs all skills
Each sprint creates .squad/<sprint-id>/:
.squad/2025-03-27-dark-mode/
├── state.md # Phase tracking + project context
├── think.md # Problem, alternatives, scope decision
├── architecture.md # Data flows, file changes, edge cases
├── scout-report.md # Existing patterns, reusable code, risks
├── plan.md # Unified plan with effort estimates
├── build-log.md # Commit-by-commit progress
├── build-summary.md # Final build status + blockers
├── review-a.md # Judge A findings
├── review-b.md # Judge B findings
├── qa-report.md # Test results
├── verdict.md # Consensus table + final verdict
├── security.md # Security audit (optional)
├── investigation.md # Root cause analysis (if debugging)
└── retro.md # Sprint retrospective (if run)
Two independent code reviewers plus a QA agent run in parallel. Neither judge knows the other exists. The orchestrator builds a consensus table:
| Finding | Severity | Judge A | Judge B | QA | Verdict |
|---|---|---|---|---|---|
| Null check missing | CRITICAL | YES | YES | — | FIX |
| Race condition | WARNING | YES | no | FAIL | FIX |
| Naming inconsistency | SUGGESTION | YES | no | — | TRIAGE |
- FIX: Found by 2+ agents, or ANY single CRITICAL finding
- TRIAGE: Found by 1 agent (non-critical) — user decides
- DISMISS: Contradicted + SUGGESTION severity only
If FIX items exist, a Fix Agent addresses confirmed issues, then all three agents re-run (max 2 rounds). If still broken, escalated to user with full history.
micro-squad is opinionated about how to build with AI. Three principles guide every agent:
- Boil the Lake — Completeness is cheap. Do 100% when it costs minutes more than 90%.
- Search Before Building — Three layers: tried-and-true, new-and-popular, first-principles.
- User Sovereignty — AI recommends, users decide. No exceptions.
See ETHOS.md for details.
MIT