AgentTrace

Observability and audit trails for AI coding agents.

AgentTrace is a local-first CLI that records what an AI coding agent attempted, what changed, what tests ran, what review happened, and what evidence supports the final result. It is designed for teams using tools such as Codex, Claude Code, Cursor, Copilot, and similar coding agents.

MVP Scope

The MVP creates structured audit trails in a local Git workspace:

task description
git branch and commit at run start
git status and diff snapshots
test/build command evidence
review notes
linked GitHub pull request metadata
EvalOps benchmark and regression evidence
Markdown report suitable for a PR description
local HTML dashboard across recorded runs

Install for Development

python -m pip install -e .

You can also run without installation:

python -m agenttrace --help

Commands

Initialize AgentTrace in a Git repository:

agenttrace init

Start a run:

agenttrace start "add input validation to the CLI" --tool Codex --model gpt-5

Capture current Git evidence:

agenttrace snapshot

Run and record a test/build command:

agenttrace add-test "python -m unittest"

Record manual test evidence without executing a command:

agenttrace add-test "manual QA in staging" --no-execute --note "Reviewer confirmed happy path."

Append review notes:

agenttrace add-review review-notes.md

Link the active run to a GitHub pull request:

agenttrace pr 123 --title "Add input validation" --base main --head feature/input-validation

Record evaluation evidence:

agenttrace eval benchmark-42 --score 94.5 --regression passed --note "No benchmark regression."

Generate the final report:

agenttrace report

Generate a local dashboard:

agenttrace dashboard

Output Structure

.agenttrace/
  dashboard.html
  config.json
  runs/
    20260518-120000-add-input-validation/
      run.json
      diff.patch
      status.txt
      tests.md
      review.md
      report.md
      pr-description.md

MVP Limitations

Evidence is local to the repository and not uploaded anywhere.
Test commands execute with the current user's shell environment.
The report truncates very large diffs for readability.
GitHub PR integration records local metadata only; it does not call the GitHub API.
EvalOps integration records local benchmark metadata only; it does not call an external eval service.
The dashboard is a static local HTML file, not a hosted web service.

Development

Run the test suite:

$env:PYTHONPATH='src'; python -m unittest discover -s tests

See docs/workflow.md for a complete example workflow.

Beginner User Guides

If you are new to AgentTrace, start with the beginner guides in docs/user-guides/README.md.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.github/workflows		.github/workflows
docs		docs
src/agenttrace		src/agenttrace
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
mkdocs.yml		mkdocs.yml
pyproject.toml		pyproject.toml
requirements-docs.txt		requirements-docs.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AgentTrace

MVP Scope

Install for Development

Commands

Output Structure

MVP Limitations

Development

Beginner User Guides

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AgentTrace

MVP Scope

Install for Development

Commands

Output Structure

MVP Limitations

Development

Beginner User Guides

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages