Skip to content

luoyuctl/agenttrace

agenttrace — find where your AI agents waste money & time

CI Site Release Go Reference Go Report Card GitHub stars Go License Homebrew

💸 Stop burning cash and hours on invisible AI agent waste


What is agenttrace?

AI coding agents (Claude Code, Gemini CLI, Codex CLI) burn tokens in loops, retry failures silently, and leave you with a surprise bill. You're wasting money on dead tokens and time on broken sessions — and you can't even see where.

agenttrace finds the waste in both — so you stop paying for nothing and start shipping faster.

Site: https://luoyuctl.github.io/agenttrace/

Sample HTML report: https://luoyuctl.github.io/agenttrace/demo-report.html

Featured in: Awesome Codex CLI, Awesome Gemini CLI, Charm in the Wild, and Awesome Claude Code and Skills.

agenttrace TUI demo

The GIF follows the shortest first-run path: demo data -> critical sessions -> detail -> diagnostics -> overview.

Sharing agenttrace? See docs/demo-playbook.md for the GIF recording script, launch storyline, and posting copy.

Why it exists

AI agents now behave like tiny build systems: they plan, call tools, retry, hang, and spend money while doing it. Most teams only see the final output, not the session health, token burn, tool failure rate, or whether the agent got stuck. agenttrace gives that missing operational view in the terminal.

Where it fits

agenttrace is not a hosted tracing backend or another chat client. It is a local inspection tool for the logs your coding agents already leave on disk.

If you need... Use agenttrace for...
Local-first privacy Inspect sessions without uploading prompts, code, or tool logs
Fast terminal triage Open a TUI, sort bad sessions, and jump into detail/diagnostics
Cross-agent comparison Compare Claude Code, Codex CLI, Gemini CLI, Qwen Code, Cline, Aider, Cursor exports, Hermes, OpenCode, Oh My Pi, Kimi, and more
Cost and token evidence See cost, token usage, cache usage, retries, loops, latency, and health in one place
Workflow improvement Mine local sessions for repeated tool failures, hanging gaps, and costly loops before updating prompts, skills, or project instructions
CI guardrails Export JSON/Markdown/HTML and fail builds on low health or high tool failure rates

What it catches

Pain What agenttrace shows
Surprise token bills input/output/cache tokens, estimated cost, top token-heavy agents
Silent tool loops repeated tool calls, retry loops, long gaps, hanging sessions
Slow agents P50/P95/P99 latency, per-tool latency ranking, timeout-like gaps
Quality regressions health score, anomaly types, shallow reasoning, redacted thinking
Hard-to-compare tools session diff across Claude Code, Codex CLI, Gemini CLI, Qwen Code, Cline, Aider, Cursor exports, Oh My Pi, and more
Brittle prompts or skills tool_fail_rate, anomaly mix, and expensive sessions that point to the workflow note, skill, or command wrapper to fix next
CI blind spots JSON reports and health gates for average health, critical sessions, and tool failure rate

✨ Features

Feature Description
🚀 Single Binary 7.5 MB — curl -sL ... | sh install, no runtime deps
🖥️ Bubble Tea TUI Modern terminal UI: Overview → Session List → Detail → Diagnostics → Diff
Persistent Cache Incremental session cache avoids a full disk parse on every startup
🩺 Doctor Mode --doctor checks detected agent dirs, cache health, and next steps
⌨️ Command Mode :health <80, :cost >0.1, :sort cost desc, :anomalies
🔍 Multi-Format Auto-Detect Claude Code / Codex CLI / Gemini CLI / Qwen Code / Cline / Aider / Cursor exports / Hermes / OpenCode / OpenClaw / Oh My Pi / Kimi / Copilot-style logs
💸 Cost & Time Waste How much 💰 you burned + ⏱️ time lost to loops, retries, failures
🚨 6 Anomaly Types Hanging, tool failures, latency spikes, shallow thinking, redaction, zero-tool sessions
📊 Multi-Session Comparison Compare across sessions and tools in one table
💯 Health Score 0-100 composite with visual bar and emoji
🤖 Machine Readable JSON output for CI/CD and automation
🌐 Shareable Reports Self-contained HTML overview reports for CI artifacts, issues, and demos

🚀 Quick Start

Not sure which install path to use? Start with the one-liner for a quick local trial. Use Homebrew when you want managed upgrades, Go install when you already live in the Go toolchain, and manual build when you want to inspect or change the source first.

One-liner install

Recommended for first-time Linux/macOS users who want to try the TUI quickly:

# Linux / macOS
curl -sL https://raw.githubusercontent.com/luoyuctl/agenttrace/master/install.sh | sh

Windows users can use the PowerShell installer:

# Windows (PowerShell)
iwr -useb https://raw.githubusercontent.com/luoyuctl/agenttrace/master/install.ps1 | iex

Homebrew (macOS / Linux)

Recommended when you prefer package-manager upgrades:

brew install luoyuctl/tap/agenttrace

Go install

Recommended when your $GOBIN or $GOPATH/bin is already on PATH:

go install github.com/luoyuctl/agenttrace/cmd/agenttrace@latest

60-second value check

After installing, run the shortest path before wiring agenttrace into a real workflow:

# See the TUI without needing local agent logs
agenttrace --demo

# Confirm which local session directories and cache state agenttrace can see
agenttrace --doctor

# Preview machine-readable evidence for CI, dashboards, or PR notes
agenttrace --demo --overview -f json

If those outputs show the pain you care about, try the real local scan:

agenttrace --overview \
  --fail-under-health 80 \
  --fail-on-critical \
  --max-tool-fail-rate 15

Codex plugin

This repo includes a Codex plugin manifest and skill so Codex can use agenttrace to audit local AI agent session logs:

  • .codex-plugin/plugin.json
  • skills/agenttrace-session-audit/SKILL.md

npm

The npm wrapper is prepared in npm/, but the public package is not published yet. Use the one-liner, Homebrew, or manual build for now.

Manual build

git clone https://github.com/luoyuctl/agenttrace.git
cd agenttrace
go build -ldflags="-s -w" -o agenttrace ./cmd/agenttrace/
sudo mv agenttrace /usr/local/bin/

Usage

# Launch TUI dashboard (default, no flags)
agenttrace

# Try the TUI with built-in sample sessions
agenttrace --demo

# Diagnose local session discovery and cache status
agenttrace --doctor

# Import a Cursor workspace export
agenttrace cursor-export.json

# Analyze latest session
agenttrace --latest

# Compare all sessions
agenttrace --compare -d ~/.hermes/sessions

# JSON output (CI/CD)
agenttrace --latest -f json

# Global fleet overview as JSON
agenttrace --overview -f json -o agenttrace-overview.json

# Markdown report for PR comments or CI artifacts
agenttrace --overview -f markdown -o agenttrace-overview.md

# Self-contained HTML report for sharing
agenttrace --overview -f html -o agenttrace-overview.html

# CI health gate
agenttrace --overview --fail-under-health 80 --fail-on-critical --max-tool-fail-rate 15

# Demo JSON for screenshots, CI examples, or first-time evaluation
agenttrace --demo --overview -f json

# Doctor JSON for support tickets or CI setup checks
agenttrace --doctor -f json

# List all model pricings (900+ from LiteLLM when cached)
agenttrace --list-models

# Update pricing from LiteLLM community database
agenttrace --update-pricing

# Update + list in one go
agenttrace --update-pricing --list-models

# Specify session language for cost estimation
agenttrace --latest --lang zh    # Chinese (supports zh, en)

Cursor Import

Cursor keeps local composer/chat state in SQLite state.vscdb files. Export the relevant JSON keys once, then point agenttrace at the exported file:

db="$HOME/Library/Application Support/Cursor/User/workspaceStorage/<workspace-id>/state.vscdb"
sqlite3 "$db" "select json_group_object(key, json(value)) from ItemTable where key in ('aiService.prompts','aiService.generations','composer.composerData');" > cursor-export.json
agenttrace cursor-export.json

See docs/cursor-import.md for details.

TUI Navigation

Key Action
↑↓ / jk Navigate sessions
Enter View session detail
Tab Switch view: Overview → List → Detail → Diagnostics → Diff
0-4 Jump directly to a view
h / c / t / e / a / n Sort by health / cost / turns / failures / anomalies / name
f / s / / Filter by health / source / text
: Command mode
? Open the keyboard shortcut map
d / w Open diff / diagnostics
ctrl+r Force reload and rebuild local cache
q / Esc Quit / Back

📊 Sample Output

━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
  AGENTTRACE v0.3.48 — AI Agent Session Performance Report
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━

💰 TOKEN COST
────────────────────────────────────────
  Input:             1,342  tokens
  Output:            3,947  tokens
  ────────────────────────────────────
  Total tokens:      5,289
  Est. cost:    $     0.0632  (model: claude-sonnet-4)

📊 ACTIVITY
────────────────────────────────────────
  Messages:    2 user  |  42 turns
  Tool calls:  70
  Success:     91% (64/70)

⏱️  LATENCY
────────────────────────────────────────
  min:     12.3s
  median:  457.9s
  p95:     720.1s
  max:     901.0s
  avg:     358.4s
  Duration: 15.4m

🧠 THINKING / COT
────────────────────────────────────────
  Blocks: 20
  Avg:    392 chars
  Total:  7,840 chars
  Quality: 🔴 shallow

🚨 ANOMALIES
────────────────────────────────────────
  🔴 [HIGH] hanging: 1 gap(s) >60s, max=901s
  🟡 [MEDIUM] shallow_thinking: avg reasoning = 392 chars

💯 HEALTH SCORE
────────────────────────────────────────
  🟢  90/100  [██████████████████░░]

🎯 Anomaly Detection

Type Trigger Severity
🔴 Hanging Event gap > 60s high/medium
🔴 Tool Failures Failure rate > 20% high
🔴 Latency Spikes p95 latency > 120s low/medium
🟡 Shallow Thinking Avg reasoning < 500 chars high/medium
🟡 Redaction Redacted thinking blocks medium
🟡 No Tools 3+ turns with zero tool calls medium

📈 Multi-Session Comparison

===============================================================
  AGENTTRACE — Multi-Session Comparison (12 sessions)
===============================================================
Session                   Turns  Tools   Succ     Cost  Health
---------------------------------------------------------------
20260501_103809_71476f6d     42     70    91%  $0.0632   90/100
20260501_084515_a1b2c3d4     18     25    96%  $0.0315   95/100
20260430_192030_e5f6g7h8     65    110    78%  $0.1240   65/100 ⚠️
===============================================================

💡 Use Cases

  • CI/CD Gate — fail builds when agent sessions degrade below health threshold
  • Cost Audit — find which sessions are burning tokens uselessly
  • Tool Benchmarking — compare Claude Code vs Gemini CLI objectively
  • Quality Monitoring — detect when your agent starts hallucinating or hanging
  • Team Insights — track agent performance across developers

🗺️ Roadmap

Done

  • Installer paths: curl script, prebuilt binaries, Homebrew, and prepared npm wrapper
  • Release safety: GitHub Actions, release pipeline, and CI health gates
  • Session intelligence: historical trends, cost audit, health gates, and shareable reports
  • Parser coverage: Claude Code, Codex CLI, Gemini CLI, Qwen Code, Cline, Aider, Cursor exports, Hermes, OpenCode, OpenClaw, Oh My Pi, Kimi, and Copilot-style logs

Next

  • Publish npm package
  • Web dashboard for report exploration
  • VS Code extension

See CI Integration for a ready-to-copy GitHub Actions health gate.


📣 Launch Kit

Planning to share or collect feedback? See docs/launch-kit.md for positioning, launch posts, short social copy, target communities, and demo checklist.

Feedback and workflow examples are welcome in GitHub Discussions.


🧩 Add a Parser

Want agenttrace to support another coding agent? Start with docs/parser-guide.md. A good parser PR usually includes:

  • a tiny redacted fixture or synthetic sample
  • format detection in DetectFormat
  • role, timestamp, model, token usage, tool call, and tool error extraction
  • tests for successful parsing and malformed input

🏗️ Architecture

.
├── cmd/agenttrace/main.go      # CLI entry: flags, TUI/CLI dispatch
└── internal/
    ├── engine/                 # parsers, pricing, anomalies, reports, cache
    ├── index/                  # incremental local session index
    ├── i18n/                   # bilingual UI/report strings
    └── tui/                    # Bubble Tea TUI views, command mode, tests

🤝 Contributing

See CONTRIBUTING.md for contribution flow, validation commands, parser expectations, privacy guidance, and the code of conduct.

git clone https://github.com/luoyuctl/agenttrace.git
cd agenttrace
go test ./...              # verify behavior and rendering constraints
go build -o agenttrace ./cmd/agenttrace/
./agenttrace --latest      # smoke test
./agenttrace --doctor      # verify local discovery and cache status

📄 License

MIT © 2025 agenttrace contributors


Built with ❤️ for the AI engineering community