You move fast with AI agents and ship a lot, and the reasoning behind a decision evaporates almost immediately. You knew what you wanted and how you wanted it; the agent took a different path; and a week later, mid-debug, you are trying to reconstruct what actually happened. closedtab keeps that record: what you asked for, what the agent did, and what the agent understood the directive to be.
It is how you pass the baton between agents, keep a multi-agent run legible, track the decisions and the bugs and fixes along the way, and get better at the handoff itself. As much about working well with agents as it is about provenance.
It works best when you stay hands-on: you as the product owner, agents as engineer or PM. Run it feature by feature, on the work you are steering, and the records add up into a history of how the project actually got built.
npm install -g closedtab
closedtab new
It scaffolds a dated Agent Action Record to fill in: a six-part record of one agent run.
| Part | What it captures |
|---|---|
| Intent | the instruction, how success was defined, the authority the agent had, what was out of scope |
| Action | what it did (from logs), what it produced, where it diverged, whether it stayed within authority |
| Judgment | the decisions it made alone, which a human should have seen, where it should have escalated, who is accountable |
| Deviation | the gaps and their root cause, the good deviations, the confidently-wrong |
| Consequence | the outcome, the harm or cost or risk, who was affected, the expected failure at 100 runs |
| Change | what changes before the next run, what to keep, what belongs with a human, the signal it worked |
The record-template.md in this repo is the same form by hand: work through the
six sections in order, no AI required, and keep the record.
closedtab check ./docs/aar-fall-portal.md
Scores a record on whether it captured what is worth keeping: the intent, the reasoning, who decided, how it held up, the deviations, and the risks carried forward.
closedtab reconcile --testimony ./aar.md --trace ./trace.jsonl --out ./diff.json
Lines a record up against a machine trace of what the agent actually did. Intent and Action go in; the Deviation falls out: where they agree, where the record claims more than the trace shows, where the trace shows work the record omits, and any direct contradiction.
closedtab new --type bugfix (also feature, adr, handoff) writes lighter
task docs for the same human-agent team, each to its own prefix (aar-*,
adr-*, handoff-*). The Agent Action Record is the default and the point.
{ "mcpServers": { "closedtab": { "command": "closedtab-mcp" } } }That gives the agent four tools: list_templates, new_doc, check, and
reconcile. The agent finishes a segment, writes the record of what it just did,
checks it, and saves it alongside the change.
import { renderAar, TEMPLATES, checkDoc, reconcileText } from "closedtab";npm publish # runs prepublishOnly: clean, build, test, ships dist/
npm test # vitest
npm run build # tsc to dist/
Local and deterministic: it needs only Node. The record format comes from the
Agent After-Action Review skill (github.com/quarterback/AAR), here under MIT.
More on the thinking in WHY.md.