sheldon

A Claude Code plugin that runs Missions — a multi-agent workflow with three roles:

Orchestrator (opus): plans the feature, writes a validation contract, drives the loop. Runs as the main Claude Code thread.
Worker (sonnet): implements the feature on a dedicated mission/<id> branch, in a fresh-context subagent.
Validator (haiku): adversarially verifies the implementation against the contract, in a fresh-context subagent. Read-only.

Inspired by the "Missions" architecture from Factory.ai's Alvoeiro talk. Serial execution per feature, validation-contract-first, branch-per-mission, fresh worker contexts.

How it works

User: /sheldon:mission-new "add user profile editing"
  │
  ▼
Orchestrator (main thread)
  ├─ creates .missions/<id>/, branches mission/<id>
  ├─ writes contract.md (numbered, executable assertions)
  └─ waits for /sheldon:mission-approve
       │
       ▼
   spawns Worker subagent  ──► implements + commits + missions.handoff(...)
       │
       ▼
   spawns Validator subagent ──► reads contract + diff, runs assertions, missions.validate(...)
       │
       ▼
   pass → merge mission/<id> → main
   fail → re-spawn Worker with findings

Mission state lives in plain files under .missions/<id>/ (state.json, contract.md, handoffs/, validations/) — git-friendly, easy to inspect, easy for the TUI to watch.

Install

Recommended: install via the Claude Code plugin marketplace

claude plugin marketplace add sebiko3/sheldon
claude plugin install sheldon

That's it — Sheldon is now available in every Claude Code session, no --plugin-dir needed. Update later with claude plugin update sheldon.

For development (working on Sheldon itself)

git clone https://github.com/sebiko3/sheldon $SHELDON_DIR
cd $SHELDON_DIR
npm install
claude --plugin-dir "$(pwd)"

Usage

A full mission has six beats. You drive two of them; the agents drive the rest.

You: start a mission. Run /sheldon:mission-new "<one-sentence goal>". The Orchestrator (the main Claude Code thread) creates .missions/<id>/, branches mission/<id> off main, and asks any clarifying questions it needs to write the validation contract.
Orchestrator: writes the contract. A YAML frontmatter block listing numbered, executable assertions — each one a bash -c one-liner whose exit code 0 means the assertion holds. The contract is the spec; the implementation will be validated against it strictly.
You: approve. Skim the contract, then run /sheldon:mission-approve (or /sheldon:mission-approve <id> if you have more than one mission in contract_review). This transitions the phase to implementing and the Orchestrator spawns the Worker.
Worker: implements. A fresh-context subagent that only sees the mission id, reads the contract, edits code on mission/<id>, commits atomically, and hands off.
Validator: verifies. Another fresh-context subagent, read-only. Runs every check: command from the contract, looks at the diff, and returns either pass (everything green) or fail with concrete findings. On fail, the Orchestrator re-spawns the Worker once with the findings; a second fail aborts.
Orchestrator: merges. On pass, mission/<id> merges into main and the brain learns. You're done.

While a mission is running, you can check progress with:

/sheldon:mission-status [id]          # phase + diff summary
/sheldon:mission-list                 # everything still in flight
/sheldon:mission-retro <id>           # one-paragraph postmortem (after termination)

A few power tools worth knowing:

/sheldon:epic-new "<vague brief>" — when the work is exploratory ("look at this repo and pull useful ideas"), the Epic Planner decomposes it into 3–7 candidate sub-missions you can selectively promote with /sheldon:epic-promote <epic_id> <issue_id>.
/sheldon:brain-recall [topic] — surfaces what the brain has learned about this project (conventions, lessons, capability proposals). The Orchestrator and Worker consult it automatically before planning and implementing; you can read it directly too.
/sheldon:contract-lint <path> — lint a draft contract before approval. The Orchestrator runs this automatically; you can run it manually if you're hand-editing a contract.
/sheldon:missions-report — health snapshot of the mission loop (throughput, rework rate, time-to-merge percentiles).
bin/sheldon doctor — diagnose install issues (Node version, MCP server build, plugin manifest, git availability) without launching Claude Code.

For a worked end-to-end example covering the happy path, a validator rejection, a contamination event, and an abort-with-cleanup, see docs/walkthrough.md.

Slash commands

Command	What it does
`/sheldon:mission-new <goal>`	Orchestrator creates a mission, branches `mission/<id>`, writes a validation contract, and waits for approval.
`/sheldon:mission-approve [id]`	Approve the contract → spawn Worker → Validator loop → merge on pass / reopen on fail.
`/sheldon:mission-status [id]`	Show mission phase, contract, handoffs, validation runs, and diff summary.
`/sheldon:mission-list [--phase=<phase>]`	List all missions, optionally filtered by phase.
`/sheldon:mission-abort <id> [reason] [--delete-branch]`	Cancel an in-flight mission (destructive; requires confirmation).
`/sheldon:epic-new <brief>`	Decompose a vague brief into 3–7 candidate sub-missions; Epic Planner researches the codebase in parallel and writes `.epics/<id>/epic.md` for review.
`/sheldon:epic-list [--status=<status>]`	List all epics and their proposed issues.
`/sheldon:epic-promote <epic_id> <issue_id>`	Promote one epic issue into a real mission (creates a mission in `planning` phase).
`/sheldon:missions-report`	Print a one-screen health snapshot of the mission loop — phase breakdown, throughput, time-to-merge percentiles, rework + abort rate, recently merged. Pure stdlib Python; safe to run any time.
`/sheldon:missions-gc [--days <N>] [--apply]`	List (or delete with `--apply`) stale `mission/<id>` branches whose phase is `aborted`/`done` and `updated_at` is older than `--days` days (default 14). Never deletes the currently checked-out branch.
`/sheldon:contract-lint <path>`	Lint a draft mission contract before approval — flags the gray-matter colon-space gotcha, missing executable assertions, duplicate or non-kebab-case ids, and prints assertion counts. Stdlib Python; non-zero exit on errors.
`/sheldon:brain-recall [topic]`	Surface what Sheldon has learned about this project — conventions, lessons, agent improvements, and capability proposals from past missions.
`/sheldon:brain-learn <mission_id>`	After a mission terminates (merge/abort/twice-fail), distill its contract + handoffs + validations into durable brain entries the next mission inherits.
`/sheldon:brain-list`	Dump every active brain entry plus per-type counts; pointer to the human-readable digest at `.sheldon/brain/README.md`.
`/sheldon:brain-dedup [--threshold <float>] [--type <type>]`	Scan the brain for near-duplicate entries within each type group and report candidate pairs above the overlap threshold (default 0.6). Read-only; to retire a duplicate use `brain_observe` with `supersedes`.
`/sheldon:mission-retro <mission_id>`	Print a one-paragraph narrative postmortem for a terminated mission — what was built, validator outcome, time-to-terminal.

The brain: how Sheldon learns

Sheldon keeps a small persistent learning layer at .sheldon/brain/ (per project, JSONL-backed). It stores four kinds of entries:

Conventions — project-specific facts (build tool, test runner, style rules, file layout).
Lessons — meta-rules distilled from past mission failures or near-misses (e.g., "quote contract YAML descriptions containing : ").
Capability proposals — net-new skills/hooks/scripts/agents the brain has identified as worth shipping; surfaced via /sheldon:brain-recall --type proposal and ready for promotion into missions.
Agent improvements — proposed tweaks to agents/*.md that would have prevented a prior defect. These never auto-apply; the Orchestrator promotes them into normal missions.

The Orchestrator calls brain_recall before writing each contract and /sheldon:brain-learn <id> after each mission terminates. The Worker calls brain_recall before implementing. The Validator does NOT consult the brain — it validates strictly against the contract, so pass/fail stays mechanically reproducible.

Tools exposed by the MCP server: mcp__plugin_sheldon_missions__brain_observe, mcp__plugin_sheldon_missions__brain_recall, mcp__plugin_sheldon_missions__brain_list. The brain lives in two files:

.sheldon/brain/seed.jsonl (tracked) — the curated baseline that ships with the plugin and is shared across contributors.
.sheldon/brain/entries.jsonl (gitignored) — per-environment observations the local Orchestrator accumulates via brain_observe. Never committed.

listEntries() returns the union; observe() only ever writes to entries.jsonl. Tombstones in entries.jsonl can supersede seed entries via last-write-wins fold. .sheldon/brain/README.md is a regenerated human-readable digest of the active set.

Epics: turning vague briefs into missions

Not every request is a single well-scoped mission. When the work is exploratory — "look at this repo and pull useful ideas," "refactor this subsystem," "design feature X" — start with /sheldon:epic-new <brief>. The Epic Planner agent:

Researches the codebase in parallel via Explore sub-agents.
Decomposes the brief into 3–7 candidate sub-missions (each independently scope-able and assertable).
Writes .epics/<id>/epic.md with rationale + acceptance sketches per issue.
Returns the table for you to review.

You then promote any subset via /sheldon:epic-promote <epic_id> <issue_id>. Each promoted issue becomes a normal mission in planning phase, ready for the standard Orchestrator → Worker → Validator loop. Issues you don't promote stay as proposed for later.

Layout

Path	Purpose
`.claude-plugin/plugin.json`	Plugin manifest (name/version/license)
`.claude-plugin/marketplace.json`	Single-plugin marketplace manifest — what `claude plugin marketplace add` consumes
`settings.json`	Activates Orchestrator as the main thread
`agents/`	Orchestrator / Worker / Validator / Epic Planner definitions
`skills/`	Slash commands (`/sheldon:mission-`, `/sheldon:epic-`, `/sheldon:brain-*`, tooling)
`hooks/hooks.json`	PreToolUse (contract immutability + pre-merge scope-creep advisory) + PostToolUse (touched-file tracking) + SubagentStop (state-transition log)
`mcp/missions-server/`	The shared-state MCP server (stdio, TypeScript) — built by `npm install` postinstall
`tui/`	Mission Control terminal UI
`scripts/hooks/`	Shell scripts invoked by the hook config
`scripts/`	Stdlib Python helpers (`missions-report`, `contract-lint`, `mission-retro`, `brain-dedup`, `missions-gc`, `gen-og-image`)
`bin/sheldon`	CLI launcher; `bin/sheldon doctor` diagnoses install issues
`.missions/<id>/`	Per-mission state files (gitignored) — state.json, contract.md, handoffs/, validations/, touched.list
`.epics/<id>/`	Per-epic proposal files (tracked; audit trail) — epic.md with candidate sub-missions
`.sheldon/brain/`	Learning layer — `seed.jsonl` (tracked baseline) + `entries.jsonl` (gitignored, per-env) + regenerated README.md digest
`docs/`	GitHub Pages landing (`index.html` + `assets/`) plus walkthrough, PLATFORM, RELEASING

Platform

macOS supported today; see docs/PLATFORM.md for Linux compatibility status.

Statusline

Sheldon ships a Claude Code statusline at scripts/statusline.mjs. It renders a single line summarising the most-recently-updated active mission plus the size of the brain:

sheldon | mission:<id-short> phase:<phase> brain:<n> last:<pass|fail|—>

<id-short> is the first 8 characters of the mission ULID, <phase> is the lifecycle phase, <n> is the union of seed.jsonl + entries.jsonl lines in .sheldon/brain/, and last: reflects the most recent validator verdict (or an em-dash if none yet). When no mission exists every field falls back to an em-dash.

The script is wired automatically via .claude/settings.json (top-level statusLine block; node scripts/statusline.mjs). It is stdlib-only, never throws, and caches its rendered output for 5 seconds at .sheldon/cache/statusline.json so rapid statusline repaints don't re-scan disk on every frame.

Environment variables

Var	Default	Purpose
`SHELDON_REPO_ROOT`	`process.cwd()` (which Claude Code sets to the project directory)	Override the directory the MCP server treats as the project root. Unexpanded `${...}` placeholders and non-existent paths are ignored and the fallback is used.

Auth note

This plugin runs inside Claude Code, which uses the user's Claude Pro/Max/Team/Enterprise subscription for inference — ToS-compliant first-party use. Sheldon never handles OAuth tokens itself.

Community

Landing page: sebiko3.github.io/sheldon — overview, animated walkthrough, install instructions (served by GitHub Pages from docs/).
Contributing: CONTRIBUTING.md — fork/branch/PR workflow, coding conventions, testing expectations.
Code of conduct: CODE_OF_CONDUCT.md — adopts the Contributor Covenant 2.1.
Security: SECURITY.md — please report vulnerabilities via private GitHub security advisory, not public issues.
Changelog: CHANGELOG.md — release notes, Keep-a-Changelog format.
Releasing: docs/RELEASING.md — maintainer reference for cutting a release.

Credits

Architectural inspiration: the "Missions" architecture from Factory.ai's Alvoeiro talk — serial execution per feature, validation-contract-first, branch-per-mission, fresh worker contexts.
Borrowed skills (planned via epic 01KRCE0QJX3QFT8MCS8B2R9YDE): the following skills are being ported into skills/ from obra/superpowers (MIT-licensed). Per project convention, attribution is centralized here rather than in SKILL.md footers (trailing metadata inside a skill body competes with its instructions for the model's attention).
- systematic-debugging — adapted from obra/superpowers/skills/systematic-debugging (MIT).
- verification-before-completion — adapted from obra/superpowers/skills/verification-before-completion (MIT).
- test-driven-development — adapted from obra/superpowers/skills/test-driven-development (MIT).
- brainstorming — adapted from obra/superpowers/skills/brainstorming (MIT).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

sheldon

How it works

Install

Recommended: install via the Claude Code plugin marketplace

For development (working on Sheldon itself)

Usage

Slash commands

The brain: how Sheldon learns

Epics: turning vague briefs into missions

Layout

Platform

Statusline

Environment variables

Auth note

Community

Credits

About

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 143 Commits
.claude-plugin		.claude-plugin
.claude		.claude
.epics		.epics
.github		.github
.sheldon/brain		.sheldon/brain
agents		agents
bin		bin
docs		docs
hooks		hooks
mcp/missions-server		mcp/missions-server
scripts		scripts
skills		skills
tui		tui
verification		verification
.gitignore		.gitignore
.mcp.json		.mcp.json
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
package-lock.json		package-lock.json
package.json		package.json
settings.json		settings.json
tsconfig.base.json		tsconfig.base.json

Folders and files

Latest commit

History

Repository files navigation

sheldon

How it works

Install

Recommended: install via the Claude Code plugin marketplace

For development (working on Sheldon itself)

Usage

Slash commands

The brain: how Sheldon learns

Epics: turning vague briefs into missions

Layout

Platform

Statusline

Environment variables

Auth note

Community

Credits

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages