AltimateAI · anandgupta42 · Mar 16, 2026 · Mar 13, 2026 · Mar 13, 2026 · Mar 16, 2026
diff --git a/.gitignore b/.gitignore
@@ -27,5 +27,6 @@ target
 # Local dev files
 opencode-dev
 logs/
+docs/site/
 *.bun-build
 tsconfig.tsbuildinfo
diff --git a/bun.lock b/bun.lock
diff --git a/docs/docs/configure/tracing.md b/docs/docs/configure/tracing.md
@@ -0,0 +1,341 @@
+# Tracing
+
+Altimate Code captures detailed traces of every headless session — LLM generations, tool calls, token usage, cost, and timing — and saves them locally as JSON files. Traces are invaluable for debugging agent behavior, optimizing cost, and understanding how the agent solves problems.
+
+Tracing is **enabled by default** and requires no configuration. Traces are stored locally and never leave your machine unless you configure a remote exporter.
+
+## Quick Start
+
+```bash
+# Run a prompt — trace is saved automatically
+altimate-code run "optimize my most expensive queries"
+# → Trace saved: ~/.local/share/altimate-code/traces/abc123.json
+
+# List recent traces
+altimate-code trace list
+
+# View a trace in the browser
+altimate-code trace view abc123
+```
+
+## What's Captured
+
+Each trace records the full agent session:
+
+| Data | Description |
+|------|-------------|
+| **Generations** | Each LLM call with model, provider, finish reason, and variant |
+| **Token usage** | Input, output, reasoning, cache read, and cache write tokens per generation |
+| **Cost** | Per-generation and total session cost in USD |
+| **Tool calls** | Every tool invocation with input, output, duration, and status |
+| **Timing** | Start/end timestamps for every span (session, generation, tool) |
+| **Errors** | Error messages and status on failed tool calls or generations |
+| **Metadata** | Model, provider, agent, prompt, user ID, environment, tags |
+
+### Data Engineering Attributes
+
+When using SQL and dbt tools, traces automatically capture domain-specific data:
+
+| Category | Examples |
+|----------|----------|
+| **Warehouse** | Bytes scanned/billed, execution time, queue time, partitions pruned, cache hits, query ID, estimated cost |
+| **SQL** | Query text, dialect, validation results, lineage (input/output tables), schema changes |
+| **dbt** | Command, model status, materialization, rows affected, compiled SQL, test results, Jinja errors |
+| **Data Quality** | Row counts, null percentages, freshness, anomaly detection |
+| **Cost Attribution** | LLM cost + warehouse compute cost + storage delta = total cost, per user/team/project |
+
+These attributes are purely optional — traces are valid without them. They're populated automatically by tools that have access to warehouse metadata.
+
+## Configuration
+
+Add to your config file (`~/.config/altimate-code/altimate-code.json` or project-level `altimate-code.json`):
+
+```json
+{
+  "tracing": {
+    "enabled": true,
+    "dir": "~/.local/share/altimate-code/traces/",
+    "maxFiles": 100,
+    "exporters": []
+  }
+}
+```
+
+| Option | Type | Default | Description |
+|--------|------|---------|-------------|
+| `enabled` | `boolean` | `true` | Enable or disable tracing |
+| `dir` | `string` | `~/.local/share/altimate-code/traces/` | Custom directory for trace files |
+| `maxFiles` | `number` | `100` | Max trace files to keep (oldest pruned automatically). Set to `0` for unlimited |
+| `exporters` | `array` | `[]` | Remote HTTP exporters (see below) |
+
+### Disabling Tracing
+
+```json
+{
+  "tracing": {
+    "enabled": false
+  }
+}
+```
+
+Or per-run with the `--no-trace` flag:
+
+```bash
+altimate-code run --no-trace "quick question"
+```
+
+## Viewing Traces
+
+### List Traces
+
+```bash
+altimate-code trace list
+```
+
+Shows a table of recent traces with session ID, timestamp, duration, tokens, cost, tool calls, and status.
+
+```
+SESSION              WHEN       DURATION   TOKENS     COST       TOOLS  STATUS     PROMPT
+abc123def456         2m ago     45.2s      12,500     $0.0150    8      ok         optimize my most expensive queries
+xyz789abc012         1h ago     12.8s      3,200      $0.0040    3      ok         explain this model
+err456def789         3h ago     5.1s       1,800      $0.0020    2      error      run dbt tests
+```
+
+Options:
+
+| Flag | Description |
+|------|-------------|
+| `-n`, `--limit` | Number of traces to show (default: 20) |
+
+### View a Trace
+
+```bash
+altimate-code trace view <session-id>
+```
+
+Opens a local web server with an interactive trace viewer in your browser. The viewer shows:
+
+- **Summary cards** — duration, token breakdown (input/output/reasoning/cache), cost, generations, tool calls, status
+- **Timeline** — horizontal bars for each span, color-coded by type (generation, tool, error)
+- **Detail panel** — click any span to see its model info, token counts, finish reason, input/output, and domain-specific attributes (warehouse metrics, dbt results, etc.)
+
+Options:
+
+| Flag | Description |
+|------|-------------|
+| `--port` | Port for the viewer server (default: random) |
+| `--live` | Auto-refresh every 2s for in-progress sessions |
+
+Partial session ID matching is supported — `altimate-code trace view abc` matches `abc123def456`.
+
+### Live Viewing (In-Progress Sessions)
+
+Traces are written incrementally — after every tool call and generation, a snapshot is flushed to disk. This means you can view a trace while the session is still running:
+
+```bash
+# In terminal 1: run a long task
+altimate-code run "refactor the entire pipeline"
+
+# In terminal 2: watch the trace live
+altimate-code trace view <session-id> --live
+```
+
+The `--live` flag adds a green "LIVE" indicator and polls for updates every 2 seconds. The page auto-refreshes when new spans appear.
+
+### From the TUI
+
+Type `/trace` in the TUI to open the trace viewer for the current session in your browser. The viewer launches in live mode automatically, so you can watch spans appear as the agent works.
+
+## Remote Exporters
+
+Traces can be sent to remote backends via HTTP POST. Each exporter receives the full trace JSON on session completion.
+
+```json
+{
+  "tracing": {
+    "exporters": [
+      {
+        "name": "my-backend",
+        "endpoint": "https://api.example.com/v1/traces",
+        "headers": {
+          "Authorization": "Bearer <token>"
+        }
+      }
+    ]
+  }
+}
+```
+
+| Field | Type | Description |
+|-------|------|-------------|
+| `name` | `string` | Identifier for this exporter (used in logs) |
+| `endpoint` | `string` | HTTP endpoint to POST trace JSON to |
+| `headers` | `object` | Custom headers (e.g., auth tokens) |
+
+**How it works:**
+
+- All exporters run concurrently with the local file write via `Promise.allSettled`
+- A failing exporter never blocks local file storage or other exporters
+- If the server responds with `{ "url": "..." }`, the URL is displayed to the user
+- Exporters have a 10-second timeout
+- All export operations are best-effort — they never crash the CLI
+
+## Trace File Format
+
+Traces are stored as JSON files in the traces directory. The schema is versioned for forward compatibility.
+
+```json
+{
+  "version": 2,
+  "traceId": "019cf4e2-...",
+  "sessionId": "session-abc123",
+  "startedAt": "2026-03-15T10:00:00.000Z",
+  "endedAt": "2026-03-15T10:00:45.200Z",
+  "metadata": {
+    "model": "anthropic/claude-sonnet-4-20250514",
+    "providerId": "anthropic",
+    "agent": "builder",
+    "variant": "high",
+    "prompt": "optimize my most expensive queries",
+    "userId": "user@example.com",
+    "environment": "production",
+    "version": "2.0.0",
+    "tags": ["benchmark", "nightly"]
+  },
+  "spans": [
+    {
+      "spanId": "...",
+      "parentSpanId": null,
+      "name": "session-abc123",
+      "kind": "session",
+      "startTime": 1710500000000,
+      "endTime": 1710500045200,
+      "status": "ok"
+    },
+    {
+      "spanId": "...",
+      "parentSpanId": "<session-span-id>",
+      "name": "generation-1",
+      "kind": "generation",
+      "startTime": 1710500000100,
+      "endTime": 1710500003500,
+      "status": "ok",
+      "model": {
+        "modelId": "anthropic/claude-sonnet-4-20250514",
+        "providerId": "anthropic"
+      },
+      "finishReason": "stop",
+      "cost": 0.005,
+      "tokens": {
+        "input": 1500,
+        "output": 300,
+        "reasoning": 100,
+        "cacheRead": 200,
+        "cacheWrite": 50,
+        "total": 2150
+      }
+    },
+    {
+      "spanId": "...",
+      "parentSpanId": "<generation-span-id>",
+      "name": "sql_execute",
+      "kind": "tool",
+      "startTime": 1710500001000,
+      "endTime": 1710500003000,
+      "status": "ok",
+      "tool": { "callId": "call-1", "durationMs": 2000 },
+      "input": { "query": "SELECT ..." },
+      "output": "10 rows returned",
+      "attributes": {
+        "de.warehouse.system": "snowflake",
+        "de.warehouse.bytes_scanned": 45000000,
+        "de.warehouse.estimated_cost_usd": 0.0012,
+        "de.sql.validation.valid": true
+      }
+    }
+  ],
+  "summary": {
+    "totalTokens": 2150,
+    "totalCost": 0.005,
+    "totalToolCalls": 1,
+    "totalGenerations": 1,
+    "duration": 45200,
+    "status": "completed",
+    "tokens": {
+      "input": 1500,
+      "output": 300,
+      "reasoning": 100,
+      "cacheRead": 200,
+      "cacheWrite": 50
+    }
+  }
+}
+```
+
+### Span Types
+
+| Kind | Description | Key Fields |
+|------|-------------|------------|
+| `session` | Root span for the entire session | `input` (prompt), `output` (summary) |
+| `generation` | One LLM call (step-start to step-finish) | `model`, `finishReason`, `tokens`, `cost` |
+| `tool` | A tool invocation | `tool.callId`, `tool.durationMs`, `input`, `output` |
+
+### Domain Attribute Namespaces
+
+All domain-specific attributes use the `de.*` prefix and are stored in the `attributes` map on tool spans:
+
+| Prefix | Domain |
+|--------|--------|
+| `de.warehouse.*` | Warehouse metrics (bytes, credits, partitions, timing) |
+| `de.sql.*` | SQL quality (validation, lineage, schema changes) |
+| `de.dbt.*` | dbt operations (model status, tests, Jinja, DAG) |
+| `de.quality.*` | Data quality (row counts, freshness, anomalies) |
+| `de.cost.*` | Cost attribution (LLM + warehouse + storage) |
+
+## Crash Recovery
+
+Traces are designed to survive process crashes:
+
+1. **Immediate snapshot** — A trace file is written as soon as `startTrace()` is called, before any LLM interaction. Even if the process crashes immediately, a minimal trace file exists.
+
+2. **Incremental snapshots** — After every tool call and generation completion, the trace file is updated atomically (write to temp file, then rename). The file on disk always contains a valid, complete JSON document.
+
+3. **Crash handlers** — The `run` command registers `SIGINT`/`SIGTERM`/`beforeExit` handlers that flush the trace synchronously with a `"crashed"` status.
+
+4. **Status indicators** — Trace status tells you exactly what happened:
+
+| Status | Meaning |
+|--------|---------|
+| `completed` | Session finished normally |
+| `error` | Session finished with an error |
+| `running` | Session is still in progress (visible in live mode) |
+| `crashed` | Process was interrupted before the session completed |
+
+Crashed traces contain all data up to the last successful snapshot. You can view them normally with `altimate-code trace view`.
+
+## Historical Traces
+
+All traces are stored in the traces directory and persist across sessions. Use `trace list` to browse history:
+
+```bash
+# Show the last 50 traces
+altimate-code trace list -n 50
+
+# View any historical trace
+altimate-code trace view <session-id>
+```
+
+Traces are automatically pruned when `maxFiles` is exceeded (default: 100). The oldest traces are removed first. Set `maxFiles: 0` for unlimited retention.
+
+## Privacy
+
+Traces are stored **locally only** by default. They contain:
+
+- The prompt you sent
+- Tool inputs and outputs (SQL queries, file contents, command results)
+- Model responses
+
+If you configure remote exporters, trace data is sent to those endpoints. No trace data is included in the anonymous telemetry described in [Telemetry](telemetry.md).
+
+!!! warning "Sensitive Data"
+    Traces may contain SQL queries, file paths, and command outputs from your session. If you share trace files or configure remote exporters, be aware that this data will be included.
diff --git a/docs/docs/usage/cli.md b/docs/docs/usage/cli.md
@@ -33,6 +33,7 @@ altimate --agent analyst
 | `export` | Export session data |
 | `import` | Import session data |
 | `session` | Session management |
+| `trace` | List and view session traces |
 | `github` | GitHub integration |
 | `pr` | Pull request tools |
 | `upgrade` | Upgrade to latest version |
@@ -106,4 +107,19 @@ altimate run --model anthropic/claude-sonnet-4-6 "optimize my warehouse"
 
 # Print logs for debugging
 altimate --print-logs --log-level DEBUG run "test query"
+
+# Disable tracing for a single run
+altimate run --no-trace "quick question"
+```
+
+## Tracing
+
+Every `run` command automatically saves a trace file with the full session details — generations, tool calls, tokens, cost, and timing. See [Tracing](../configure/tracing.md) for configuration options.
+
+```bash
+# List recent traces
+altimate trace list
+
+# View a trace in the browser
+altimate trace view <session-id>
 ```
diff --git a/docs/mkdocs.yml b/docs/mkdocs.yml
@@ -98,6 +98,7 @@ nav:
       - Appearance:
           - Themes: configure/themes.md
           - Keybinds: configure/keybinds.md
+      - Tracing: configure/tracing.md
       - Telemetry: configure/telemetry.md
       - Integrations:
           - LSP Servers: configure/lsp.md