GitHub - luoyuctl/agenttrace: TUI observability for AI coding agents: trace cost, tokens, tool failures, latency, anomalies, health, diffs, and CI gates across Claude Code, Codex CLI, Gemini CLI, Aider, Cursor exports, and more.

💸 Stop burning cash and hours on invisible AI agent waste

What is agenttrace?

AI coding agents (Claude Code, Gemini CLI, Codex CLI) burn tokens in loops, retry failures silently, and leave you with a surprise bill. You're wasting money on dead tokens and time on broken sessions — and you can't even see where.

agenttrace finds the waste in both — so you stop paying for nothing and start shipping faster.

Site: https://luoyuctl.github.io/agenttrace/

Sample HTML report: https://luoyuctl.github.io/agenttrace/demo-report.html

Featured in: Awesome Codex CLI, Awesome Gemini CLI, Charm in the Wild, and Awesome Claude Code and Skills.

The GIF follows the shortest first-run path: demo data -> critical sessions -> detail -> diagnostics -> overview.

Sharing agenttrace? See docs/demo-playbook.md for the GIF recording script, launch storyline, and posting copy.

Why it exists

AI agents now behave like tiny build systems: they plan, call tools, retry, hang, and spend money while doing it. Most teams only see the final output, not the session health, token burn, tool failure rate, or whether the agent got stuck. agenttrace gives that missing operational view in the terminal.

Where it fits

agenttrace is not a hosted tracing backend or another chat client. It is a local inspection tool for the logs your coding agents already leave on disk.

If you need...	Use agenttrace for...
Local-first privacy	Inspect sessions without uploading prompts, code, or tool logs
Fast terminal triage	Open a TUI, sort bad sessions, and jump into detail/diagnostics
Cross-agent comparison	Compare Claude Code, Codex CLI, Gemini CLI, Qwen Code, Cline, Aider, Cursor exports, Hermes, OpenCode, Oh My Pi, Kimi, and more
Cost and token evidence	See cost, token usage, cache usage, retries, loops, latency, and health in one place
Workflow improvement	Mine local sessions for repeated tool failures, hanging gaps, and costly loops before updating prompts, skills, or project instructions
CI guardrails	Export JSON/Markdown/HTML and fail builds on low health or high tool failure rates

What it catches

Pain	What agenttrace shows
Surprise token bills	input/output/cache tokens, estimated cost, top token-heavy agents
Silent tool loops	repeated tool calls, retry loops, long gaps, hanging sessions
Slow agents	P50/P95/P99 latency, per-tool latency ranking, timeout-like gaps
Quality regressions	health score, anomaly types, shallow reasoning, redacted thinking
Hard-to-compare tools	session diff across Claude Code, Codex CLI, Gemini CLI, Qwen Code, Cline, Aider, Cursor exports, Oh My Pi, and more
Brittle prompts or skills	`tool_fail_rate`, anomaly mix, and expensive sessions that point to the workflow note, skill, or command wrapper to fix next
CI blind spots	JSON reports and health gates for average health, critical sessions, and tool failure rate

✨ Features

Feature	Description
🚀 Single Binary	7.5 MB — `curl -sL ... \| sh` install, no runtime deps
🖥️ Bubble Tea TUI	Modern terminal UI: Overview → Session List → Detail → Diagnostics → Diff
⚡ Persistent Cache	Incremental session cache avoids a full disk parse on every startup
🩺 Doctor Mode	`--doctor` checks detected agent dirs, cache health, and next steps
⌨️ Command Mode	`:health <80`, `:cost >0.1`, `:sort cost desc`, `:anomalies`
🔍 Multi-Format Auto-Detect	Claude Code / Codex CLI / Gemini CLI / Qwen Code / Cline / Aider / Cursor exports / Hermes / OpenCode / OpenClaw / Oh My Pi / Kimi / Copilot-style logs
💸 Cost & Time Waste	How much 💰 you burned + ⏱️ time lost to loops, retries, failures
🚨 6 Anomaly Types	Hanging, tool failures, latency spikes, shallow thinking, redaction, zero-tool sessions
📊 Multi-Session Comparison	Compare across sessions and tools in one table
💯 Health Score	0-100 composite with visual bar and emoji
🤖 Machine Readable	JSON output for CI/CD and automation
🌐 Shareable Reports	Self-contained HTML overview reports for CI artifacts, issues, and demos

🚀 Quick Start

Not sure which install path to use? Start with the one-liner for a quick local trial. Use Homebrew when you want managed upgrades, Go install when you already live in the Go toolchain, and manual build when you want to inspect or change the source first.

One-liner install

Recommended for first-time Linux/macOS users who want to try the TUI quickly:

# Linux / macOS
curl -sL https://raw.githubusercontent.com/luoyuctl/agenttrace/master/install.sh | sh

Windows users can use the PowerShell installer:

# Windows (PowerShell)
iwr -useb https://raw.githubusercontent.com/luoyuctl/agenttrace/master/install.ps1 | iex

Homebrew (macOS / Linux)

Recommended when you prefer package-manager upgrades:

brew install luoyuctl/tap/agenttrace

Go install

Recommended when your $GOBIN or $GOPATH/bin is already on PATH:

go install github.com/luoyuctl/agenttrace/cmd/agenttrace@latest

60-second value check

After installing, run the shortest path before wiring agenttrace into a real workflow:

# See the TUI without needing local agent logs
agenttrace --demo

# Confirm which local session directories and cache state agenttrace can see
agenttrace --doctor

# Preview machine-readable evidence for CI, dashboards, or PR notes
agenttrace --demo --overview -f json

If those outputs show the pain you care about, try the real local scan:

agenttrace --overview \
  --fail-under-health 80 \
  --fail-on-critical \
  --max-tool-fail-rate 15

Codex plugin

This repo includes a Codex plugin manifest and skill so Codex can use agenttrace to audit local AI agent session logs:

.codex-plugin/plugin.json
skills/agenttrace-session-audit/SKILL.md

npm

The npm wrapper is prepared in npm/, but the public package is not published yet. Use the one-liner, Homebrew, or manual build for now.

Manual build

git clone https://github.com/luoyuctl/agenttrace.git
cd agenttrace
go build -ldflags="-s -w" -o agenttrace ./cmd/agenttrace/
sudo mv agenttrace /usr/local/bin/

Usage

# Launch TUI dashboard (default, no flags)
agenttrace

# Try the TUI with built-in sample sessions
agenttrace --demo

# Diagnose local session discovery and cache status
agenttrace --doctor

# Import a Cursor workspace export
agenttrace cursor-export.json

# Analyze latest session
agenttrace --latest

# Compare all sessions
agenttrace --compare -d ~/.hermes/sessions

# JSON output (CI/CD)
agenttrace --latest -f json

# Global fleet overview as JSON
agenttrace --overview -f json -o agenttrace-overview.json

# Markdown report for PR comments or CI artifacts
agenttrace --overview -f markdown -o agenttrace-overview.md

# Self-contained HTML report for sharing
agenttrace --overview -f html -o agenttrace-overview.html

# CI health gate
agenttrace --overview --fail-under-health 80 --fail-on-critical --max-tool-fail-rate 15

# Demo JSON for screenshots, CI examples, or first-time evaluation
agenttrace --demo --overview -f json

# Doctor JSON for support tickets or CI setup checks
agenttrace --doctor -f json

# List all model pricings (900+ from LiteLLM when cached)
agenttrace --list-models

# Update pricing from LiteLLM community database
agenttrace --update-pricing

# Update + list in one go
agenttrace --update-pricing --list-models

# Specify session language for cost estimation
agenttrace --latest --lang zh    # Chinese (supports zh, en)

Cursor Import

Cursor keeps local composer/chat state in SQLite state.vscdb files. Export the relevant JSON keys once, then point agenttrace at the exported file:

db="$HOME/Library/Application Support/Cursor/User/workspaceStorage/<workspace-id>/state.vscdb"
sqlite3 "$db" "select json_group_object(key, json(value)) from ItemTable where key in ('aiService.prompts','aiService.generations','composer.composerData');" > cursor-export.json
agenttrace cursor-export.json

See docs/cursor-import.md for details.

TUI Navigation

Key	Action
`↑↓` / `jk`	Navigate sessions
`Enter`	View session detail
`Tab`	Switch view: Overview → List → Detail → Diagnostics → Diff
`0`-`4`	Jump directly to a view
`h` / `c` / `t` / `e` / `a` / `n`	Sort by health / cost / turns / failures / anomalies / name
`f` / `s` / `/`	Filter by health / source / text
`:`	Command mode
`?`	Open the keyboard shortcut map
`d` / `w`	Open diff / diagnostics
`ctrl+r`	Force reload and rebuild local cache
`q` / `Esc`	Quit / Back

📊 Sample Output

━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
  AGENTTRACE v0.3.48 — AI Agent Session Performance Report
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━

💰 TOKEN COST
────────────────────────────────────────
  Input:             1,342  tokens
  Output:            3,947  tokens
  ────────────────────────────────────
  Total tokens:      5,289
  Est. cost:    $     0.0632  (model: claude-sonnet-4)

📊 ACTIVITY
────────────────────────────────────────
  Messages:    2 user  |  42 turns
  Tool calls:  70
  Success:     91% (64/70)

⏱️  LATENCY
────────────────────────────────────────
  min:     12.3s
  median:  457.9s
  p95:     720.1s
  max:     901.0s
  avg:     358.4s
  Duration: 15.4m

🧠 THINKING / COT
────────────────────────────────────────
  Blocks: 20
  Avg:    392 chars
  Total:  7,840 chars
  Quality: 🔴 shallow

🚨 ANOMALIES
────────────────────────────────────────
  🔴 [HIGH] hanging: 1 gap(s) >60s, max=901s
  🟡 [MEDIUM] shallow_thinking: avg reasoning = 392 chars

💯 HEALTH SCORE
────────────────────────────────────────
  🟢  90/100  [██████████████████░░]

🎯 Anomaly Detection

Type	Trigger	Severity
🔴 Hanging	Event gap > 60s	high/medium
🔴 Tool Failures	Failure rate > 20%	high
🔴 Latency Spikes	p95 latency > 120s	low/medium
🟡 Shallow Thinking	Avg reasoning < 500 chars	high/medium
🟡 Redaction	Redacted thinking blocks	medium
🟡 No Tools	3+ turns with zero tool calls	medium

📈 Multi-Session Comparison

===============================================================
  AGENTTRACE — Multi-Session Comparison (12 sessions)
===============================================================
Session                   Turns  Tools   Succ     Cost  Health
---------------------------------------------------------------
20260501_103809_71476f6d     42     70    91%  $0.0632   90/100
20260501_084515_a1b2c3d4     18     25    96%  $0.0315   95/100
20260430_192030_e5f6g7h8     65    110    78%  $0.1240   65/100 ⚠️
===============================================================

💡 Use Cases

CI/CD Gate — fail builds when agent sessions degrade below health threshold
Cost Audit — find which sessions are burning tokens uselessly
Tool Benchmarking — compare Claude Code vs Gemini CLI objectively
Quality Monitoring — detect when your agent starts hallucinating or hanging
Team Insights — track agent performance across developers

🗺️ Roadmap

Done

Installer paths: curl script, prebuilt binaries, Homebrew, and prepared npm wrapper
Release safety: GitHub Actions, release pipeline, and CI health gates
Session intelligence: historical trends, cost audit, health gates, and shareable reports
Parser coverage: Claude Code, Codex CLI, Gemini CLI, Qwen Code, Cline, Aider, Cursor exports, Hermes, OpenCode, OpenClaw, Oh My Pi, Kimi, and Copilot-style logs

Next

Publish npm package
Web dashboard for report exploration
VS Code extension

See CI Integration for a ready-to-copy GitHub Actions health gate.

📣 Launch Kit

Planning to share or collect feedback? See docs/launch-kit.md for positioning, launch posts, short social copy, target communities, and demo checklist.

Feedback and workflow examples are welcome in GitHub Discussions.

🧩 Add a Parser

Want agenttrace to support another coding agent? Start with docs/parser-guide.md. A good parser PR usually includes:

a tiny redacted fixture or synthetic sample
format detection in DetectFormat
role, timestamp, model, token usage, tool call, and tool error extraction
tests for successful parsing and malformed input

🏗️ Architecture

.
├── cmd/agenttrace/main.go      # CLI entry: flags, TUI/CLI dispatch
└── internal/
    ├── engine/                 # parsers, pricing, anomalies, reports, cache
    ├── index/                  # incremental local session index
    ├── i18n/                   # bilingual UI/report strings
    └── tui/                    # Bubble Tea TUI views, command mode, tests

🤝 Contributing

See CONTRIBUTING.md for contribution flow, validation commands, parser expectations, privacy guidance, and the code of conduct.

git clone https://github.com/luoyuctl/agenttrace.git
cd agenttrace
go test ./...              # verify behavior and rendering constraints
go build -o agenttrace ./cmd/agenttrace/
./agenttrace --latest      # smoke test
./agenttrace --doctor      # verify local discovery and cache status

📄 License

_{Built with ❤️ for the AI engineering community}

Name		Name	Last commit message	Last commit date
Latest commit History 256 Commits
.codex-plugin		.codex-plugin
.github		.github
assets		assets
cmd/agenttrace		cmd/agenttrace
docs		docs
examples/github-actions		examples/github-actions
homebrew		homebrew
internal		internal
npm		npm
scripts		scripts
site		site
skills/agenttrace-session-audit		skills/agenttrace-session-audit
testdata		testdata
.codexignore		.codexignore
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
PRIVACY.md		PRIVACY.md
README.md		README.md
SECURITY.md		SECURITY.md
go.mod		go.mod
go.sum		go.sum
install.ps1		install.ps1
install.sh		install.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

💸 Stop burning cash and hours on invisible AI agent waste

What is agenttrace?

Why it exists

Where it fits

What it catches

✨ Features

🚀 Quick Start

One-liner install

Homebrew (macOS / Linux)

Go install

60-second value check

Codex plugin

npm

Manual build

Usage

Cursor Import

TUI Navigation

📊 Sample Output

🎯 Anomaly Detection

📈 Multi-Session Comparison

💡 Use Cases

🗺️ Roadmap

📣 Launch Kit

🧩 Add a Parser

🏗️ Architecture

🤝 Contributing

📄 License

About

Uh oh!

Releases 50

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

💸 Stop burning cash and hours on invisible AI agent waste

What is agenttrace?

Why it exists

Where it fits

What it catches

✨ Features

🚀 Quick Start

One-liner install

Homebrew (macOS / Linux)

Go install

60-second value check

Codex plugin

npm

Manual build

Usage

Cursor Import

TUI Navigation

📊 Sample Output

🎯 Anomaly Detection

📈 Multi-Session Comparison

💡 Use Cases

🗺️ Roadmap

📣 Launch Kit

🧩 Add a Parser

🏗️ Architecture

🤝 Contributing

📄 License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 50

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages