Agent Flight Recorder

Debug, replay, and audit every AI agent run.

Agent Flight Recorder is a local observability tool for AI coding agents and MCP-powered workflows. It records what the agent was asked to do, what tools it called, what files changed, what commands ran, what errors happened, and what the final result looked like.

Think of it as:

Chrome DevTools for AI agents

Why This Exists

AI coding agents can make useful changes quickly, but the path they take is often hard to inspect after the fact.

Developers need answers to questions like:

What prompt started this run?
Which agent and model were used?
What files did the agent read or edit?
What commands did it run?
Did tests fail before they passed?
Did the agent retry or loop?
What did the final diff contain?
How long did the run take?
What did the run cost?

Agent Flight Recorder turns an agent session into a local timeline.

Install

npm install -g @agentopssec/agent-flight-recorder

Or run it without installing:

npx -y @agentopssec/agent-flight-recorder list

Update

agent-flight update          # check the registry, prompt before installing
agent-flight update --yes    # update without prompting

Primary Workflow

Agent Flight Recorder starts by wrapping an agent command:

agent-flight run -- codex "fix the failing tests"

The workflow should do three things well:

Record the agent command and timeline.
Capture files, commands, errors, and diffs.
Produce a local report that can be inspected later.

CLI

agent-flight run "codex fix the failing tests"
agent-flight run -- codex "fix the failing tests"
agent-flight run -- gemini "refactor this component"
agent-flight inspect latest
agent-flight inspect run_001
agent-flight replay run_001
agent-flight export run_001 --html
agent-flight list
agent-flight diff run_001
agent-flight update [--yes]

Standalone and Stack Use

Agent Flight Recorder runs on its own as a command wrapper:

agent-flight run -- codex "fix the failing tests"
agent-flight inspect latest

When used with the full AgentOpsSec stack, its local run logs can feed Agent Review and Agent Cost Lens without either tool importing Agent Flight Recorder code:

agent-review --from-agent-flight latest
agent-cost month

What Agent Flight Recorder Captures

Agent Flight Recorder captures a timeline of an agent run, including:

User prompt
Agent command
Model and provider metadata when available
Token usage when available
Estimated cost
Latency
Tool calls observed in agent output or provided through structured metadata
MCP server calls observed in agent output or provided through structured metadata
Shell commands
File reads
File writes
Git diffs
Test runs
Errors
Retries
Final response
Final code diff

Agent Flight Recorder includes lightweight adapters for common CLI names such as Codex, Claude, Gemini, and OpenCode. Exact tool-call and token capture depends on what the wrapped CLI prints or provides through environment metadata.

Example Timeline

Agent Flight Recorder Run by github.com/AgentOpsSec

Run: fix-failing-tests-0425

1. Prompt received
2. Model call: 12.8s, 18,400 tokens
3. Tool call: filesystem.read package.json
4. Tool call: shell.exec npm test
5. Error detected: TypeScript compile failure
6. File edit: src/lib/parser.ts
7. File edit: package.json
8. Tool call: shell.exec npm install
9. Tool call: shell.exec npm test
10. Tests passed
11. Final diff: 6 files changed
12. Estimated cost: $0.42

Run Log Shape

{
  "tool": {
    "name": "Agent Flight Recorder",
    "by": "github.com/AgentOpsSec",
    "repository": "github.com/AgentOpsSec/agent-flight-recorder"
  },
  "runId": "run_001",
  "startedAt": "2026-04-25T10:00:00Z",
  "endedAt": "2026-04-25T10:04:12Z",
  "agent": "codex",
  "model": "unknown",
  "prompt": "fix the failing tests",
  "events": [
    {
      "type": "shell.exec",
      "command": "npm test",
      "exitCode": 1,
      "durationMs": 12944
    },
    {
      "type": "file.change",
      "path": "src/lib/parser.ts",
      "changeType": "modified"
    }
  ],
  "gitDiffSummary": {
    "filesChanged": 3,
    "insertions": 44,
    "deletions": 18
  },
  "estimatedCost": 0.18
}

Local Reports

Reports are designed for both terminal inspection and automation:

Local JSON run logs
Terminal run summaries
Final diff summaries
HTML exports
Run list and latest-run inspection
Cost and latency summaries where available

Design Principles

Local-first
Open-source
No telemetry by default
Complete local run history
Useful terminal summaries
Exportable reports
Replayable context
Practical cost visibility

Initial Release Scope

The initial release includes command wrapping, run timeline capture, git diff capture, local reports, HTML export, and cost estimation.

1.0: Command Wrapper

Wrap local agent commands
Capture the requested prompt and command
Record start and end timestamps
Detect the current repository
Store each run under a local run ID
Print a clear terminal summary when the run finishes

1.0: Timeline and Diff Capture

Capture shell commands when available
Capture file changes before and after the run
Capture git diff summaries
Record errors and non-zero exits
Track test command execution
Store structured JSON run logs

1.0: Inspection and Export

Inspect the latest run
List previous runs
View final diffs for a run
Export a run as HTML
Estimate token and provider cost where possible
Compare basic run metadata across sessions

Output

Reports use plain-language status words rather than raw exit codes:

ok — the step ran successfully (green).
failed (exit N) — the step exited non-zero (red); the original code is preserved.
skipped (reason) — the step was not applicable (dim).

Severity colors follow the AgentOpsSec palette (safe = green, warning = amber, risk = red). The palette honors NO_COLOR and FORCE_COLOR, and JSON / CSV output stays plain.

Repo: https://github.com/AgentOpsSec/agent-flight-recorder
npm: https://www.npmjs.com/package/@agentopssec/agent-flight-recorder
AgentOpsSec stack: https://github.com/AgentOpsSec/stack
Website: https://AgentOpsSec.com

Author

Created and developed by Aunt Gladys Nephew.

Website: https://auntgladysnephew.com
GitHub: https://github.com/auntgladysnephew
X: https://x.com/AGNonX

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
bin		bin
src		src
test		test
.gitignore		.gitignore
.npmignore		.npmignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Agent Flight Recorder

Why This Exists

Install

Update

Primary Workflow

CLI

Standalone and Stack Use

What Agent Flight Recorder Captures

Example Timeline

Run Log Shape

Local Reports

Design Principles

Initial Release Scope

1.0: Command Wrapper

1.0: Timeline and Diff Capture

1.0: Inspection and Export

Output

Author

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Agent Flight Recorder

Why This Exists

Install

Update

Primary Workflow

CLI

Standalone and Stack Use

What Agent Flight Recorder Captures

Example Timeline

Run Log Shape

Local Reports

Design Principles

Initial Release Scope

1.0: Command Wrapper

1.0: Timeline and Diff Capture

1.0: Inspection and Export

Output

Author

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages