Observability and audit trails for AI coding agents.
AgentTrace is a local-first CLI that records what an AI coding agent attempted, what changed, what tests ran, what review happened, and what evidence supports the final result. It is designed for teams using tools such as Codex, Claude Code, Cursor, Copilot, and similar coding agents.
The MVP creates structured audit trails in a local Git workspace:
- task description
- git branch and commit at run start
- git status and diff snapshots
- test/build command evidence
- review notes
- linked GitHub pull request metadata
- EvalOps benchmark and regression evidence
- Markdown report suitable for a PR description
- local HTML dashboard across recorded runs
python -m pip install -e .You can also run without installation:
python -m agenttrace --helpInitialize AgentTrace in a Git repository:
agenttrace initStart a run:
agenttrace start "add input validation to the CLI" --tool Codex --model gpt-5Capture current Git evidence:
agenttrace snapshotRun and record a test/build command:
agenttrace add-test "python -m unittest"Record manual test evidence without executing a command:
agenttrace add-test "manual QA in staging" --no-execute --note "Reviewer confirmed happy path."Append review notes:
agenttrace add-review review-notes.mdLink the active run to a GitHub pull request:
agenttrace pr 123 --title "Add input validation" --base main --head feature/input-validationRecord evaluation evidence:
agenttrace eval benchmark-42 --score 94.5 --regression passed --note "No benchmark regression."Generate the final report:
agenttrace reportGenerate a local dashboard:
agenttrace dashboard.agenttrace/
dashboard.html
config.json
runs/
20260518-120000-add-input-validation/
run.json
diff.patch
status.txt
tests.md
review.md
report.md
pr-description.md
- Evidence is local to the repository and not uploaded anywhere.
- Test commands execute with the current user's shell environment.
- The report truncates very large diffs for readability.
- GitHub PR integration records local metadata only; it does not call the GitHub API.
- EvalOps integration records local benchmark metadata only; it does not call an external eval service.
- The dashboard is a static local HTML file, not a hosted web service.
Run the test suite:
$env:PYTHONPATH='src'; python -m unittest discover -s testsSee docs/workflow.md for a complete example workflow.
If you are new to AgentTrace, start with the beginner guides in docs/user-guides/README.md.