ccaudit — Claude Code Token Usage Explorer

ccaudit is a terminal UI for exploring how Claude Code spends your token budget. It reads the JSONL session logs that Claude Code writes to ~/.claude/projects/ and breaks down token usage by session, exchange, and content category.

Getting Started

With uv (recommended)

uv run main.py

uv reads requirements.txt and manages the environment automatically.

With venv

# Create and activate a virtual environment
python3 -m venv .venv
source .venv/bin/activate

# Install dependencies
pip install -r requirements.txt

# Run (reads all projects by default)
python main.py

Command-Line Flags

Flag	Default	Meaning
`-a`, `--all`	Yes	Read all Claude projects from `~/.claude/projects/`
`-d PATH`, `--dir PATH`	—	Show only the project that corresponds to the code directory at `PATH`. Looks up the matching project in `~/.claude/projects/` by slug; does not read JSONL files from `PATH` itself.

-a and -d are mutually exclusive. If neither is given, --all is the default.

Example — view only the current project:

python main.py -d .
python main.py -d ~/code/myproject

How Sessions and Exchanges Are Stored

Claude Code records every API call to disk as a JSONL file. Each line is one raw API message — a user message, an assistant message, or a system event.

Storage Layout

~/.claude/projects/
  <project-slug>/
    <session-id>.jsonl
    <session-id>.jsonl
    ...

Project slug: a filesystem-safe encoding of the working directory path. Forward slashes are replaced with hyphens, with a leading hyphen. Example: /Users/alan/code/myproject → -Users-alan-code-myproject.
Session ID: a UUID identifying one continuous Claude Code session. The JSONL filename stem is the session ID.
JSONL: one JSON object per line; blank lines and malformed lines are skipped.

What Is an Exchange?

An exchange (this project's term) is one complete human-to-assistant interaction: from a human-typed message through all intermediate tool round-trips, up to but not including the next human-typed message.

The Anthropic API uses "turn" to mean a single message from one role (one user or one assistant message). An exchange in ccaudit spans multiple API turns whenever Claude calls tools.

A single exchange can contain several API message pairs:

user_A   — human message                ← exchange 1 starts here
asst_A   — tool_use (e.g. Read file)
user_B   — tool_result
asst_B   — tool_use (e.g. Write file)
user_C   — tool_result
asst_C   — final text response          ← exchange 1 ends here
user_D   — next human message           ← exchange 2 starts here
asst_D   — final text response          ← exchange 2 ends here

Exchange boundaries are detected by content inspection, not by pointer following:

A user message opens a new exchange if it contains at least one text block that is not injected context (skills or system reminders).
A user message is an intermediate tool-result message if every block has type: tool_result. These belong to the current open exchange and do not start a new one.

Token usage for an exchange is the sum across all assistant messages in that exchange, not just the final one.

Top-Level Message Envelope

Every line in the JSONL file is a JSON object with this shape:

{
  "type": "user" | "assistant" | "system",
  "message": { ... },
  "timestamp": "2026-01-15T10:30:00.000Z",
  "uuid": "a63bf130-920c-46be-a7c5-a9dc2d435487",
  "parentUuid": "d9f76f52-366c-4d12-932d-7afdceaafe44",
  "requestId": "req_01XyzAbc...",
  "subtype": "compact_boundary",
  "content": "...",
  "compactMetadata": { ... }
}

Field	Present when	Meaning
`type`	Always	`"user"`, `"assistant"`, or `"system"`. Determines the shape of the rest of the object.
`message`	`type` is `"user"` or `"assistant"`	The actual API-level message object (see below).
`timestamp`	Always	ISO-8601 datetime string indicating when this message was written. Timezone is typically UTC (`Z`).
`uuid`	Always	Unique identifier for this message record.
`parentUuid`	Always (except first message)	The `uuid` of the immediately preceding message. Forms a linked list that can reconstruct conversation order. `null` or absent on the first message in a session. Does not encode exchange boundaries — distinguishing human messages from intermediate tool-result messages requires inspecting content, not following the chain.
`requestId`	`type` is `"assistant"`	The Anthropic API request ID for this assistant message. Useful for correlating with API logs or billing.
`subtype`	`type` is `"system"`	Currently only `"compact_boundary"` is observed.
`content`	`type` is `"system"`	The compacted context summary text (only present on compact boundary events).
`compactMetadata`	`type` is `"system"`	Metadata about the compaction event (see below).

User Message

When type == "user", the message object is:

{
  "role": "user",
  "content": "string" | [ ...content blocks... ]
}

Field	Meaning
`role`	Always `"user"`.
`content`	The message content. Can be a plain string (rare) or an array of content blocks (typical). When Claude Code is active, the content array is structured: injected context (skills, system reminders, tool results) appears first, followed by the human-typed message as the last text block.

Content Block Types (User)

Text block — either injected context or the human's actual message:

{ "type": "text", "text": "..." }

Tool result block — the output of a tool that the assistant called:

{
  "type": "tool_result",
  "tool_use_id": "toolu_...",
  "content": "string" | [ { "type": "text", "text": "..." }, ... ]
}

The content inside a tool result can itself be a string or a list of text blocks.

User Message Content Order

Claude Code builds user messages in this order (concatenated into the content array):

Skills — loaded skill files, each preceded by Base directory: /path/to/skills/...
System reminders — <system-reminder>...</system-reminder> blocks injected by hooks, MCP servers, or Claude Code internals
Tool results — tool_result blocks from the previous assistant message's tool calls
Human text — the actual message the user typed (always the last plain text block)

Extracting the human's text means walking backwards through the content array to find the last text block that is not injected context.

Assistant Message

When type == "assistant", the message object is:

{
  "role": "assistant",
  "content": "string" | [ ...content blocks... ],
  "usage": {
    "input_tokens": 1234,
    "cache_read_input_tokens": 5678,
    "cache_creation_input_tokens": 910,
    "cache_creation": {
      "ephemeral_5m_input_tokens": 500,
      "ephemeral_1h_input_tokens": 410
    },
    "output_tokens": 456
  }
}

Field	Meaning
`role`	Always `"assistant"`.
`content`	The assistant's response: a plain string (rare) or a list of text and tool-use blocks.
`usage`	Token accounting for this API call. Messages without `usage` are streaming artifacts and are skipped by the loader.

Usage Fields

Field	Meaning
`input_tokens`	Fresh input tokens — tokens that were not served from cache. Billed at the standard input rate.
`cache_read_input_tokens`	Cache hit tokens — prompt tokens served from the prompt cache. Billed at ~10% of the fresh input rate. High values mean Claude reused cached context from a prior exchange.
`cache_creation_input_tokens`	Cache write tokens — tokens added to the prompt cache this call. Billed at ~125% of the fresh input rate; they will be cheap to reuse in future exchanges.
`cache_creation.ephemeral_5m_input_tokens`	Subset of cache writes that use a 5-minute TTL cache slot.
`cache_creation.ephemeral_1h_input_tokens`	Subset of cache writes that use a 1-hour TTL cache slot.
`output_tokens`	Output tokens — tokens in Claude's response. Billed at the output rate.

Total prompt size for an exchange ≈ input_tokens + cache_read_input_tokens + cache_creation_input_tokens (summed across all assistant messages in the exchange). Only input_tokens + cache_creation_input_tokens are freshly processed; cache hits are served without reprocessing.

Content Block Types (Assistant)

Text block — Claude's written response:

{ "type": "text", "text": "..." }

Tool use block — a tool call Claude is making:

{
  "type": "tool_use",
  "id": "toolu_01abc...",
  "name": "Read",
  "input": { "file_path": "/path/to/file" }
}

Field	Meaning
`id`	Unique identifier for this tool call. Matched against `tool_result.tool_use_id` in the next user message.
`name`	The tool name (e.g. `"Read"`, `"Write"`, `"Bash"`, `"Agent"`, `"Grep"`, `"Glob"`, or `"mcp__<server>__<tool>"` for MCP tools).
`input`	Tool-specific parameters as a dict.

System Message — Compact Boundary

When type == "system" and subtype == "compact_boundary", Claude Code has compressed the conversation history:

{
  "type": "system",
  "subtype": "compact_boundary",
  "timestamp": "2026-01-15T10:45:00.000Z",
  "content": "Summary of prior conversation...",
  "compactMetadata": {
    "trigger": "auto" | "manual",
    "preTokens": 95000
  }
}

Field	Meaning
`content`	The compressed summary that replaces the prior conversation history.
`compactMetadata.trigger`	`"auto"` if triggered automatically (context approaching limit); `"manual"` if the user ran `/compact`.
`compactMetadata.preTokens`	Token count immediately before compaction.

The first exchange after a compact boundary is tagged after_compact = true in the parsed model. Its cache_read_input_tokens reflects the compressed context being cached, not the original system prompt. In the TUI, these exchanges are marked with a ⚡ prefix.

Token Categories

ccaudit classifies each exchange's fresh token budget across six categories by inspecting content blocks structurally, then attributing tokens proportionally to character counts.

Category	Source blocks	Examples
Skills	`text` blocks (user) whose text starts with `Base directory: .../skills/`	Superpowers skill files loaded at the start of a message
Tools	`tool_use` blocks (assistant) for built-in tools; matching `tool_result` blocks (user)	`Read`, `Write`, `Bash`, `Glob`, `Grep`, `WebFetch`, etc.
MCP	`tool_use` blocks (assistant) whose name starts with `mcp__`; matching `tool_result` blocks (user)	`mcp__github__...`, `mcp__slack__...`
Agents	`tool_use` blocks (assistant) with `name == "Agent"`	Subagent dispatch via the `Agent` tool
Messages	Remaining `text` blocks — human-typed message and Claude's written response	The actual conversation content
Other	Unclassified content	Injected context blocks that don't match any pattern above

How attribution works: The fresh token budget for an exchange (input_tokens + cache_creation_input_tokens, summed across all assistant messages in the exchange) is distributed across categories in proportion to their share of total characters in the exchange's content. This is an approximation — character count is not the same as token count, and content that tokenizes densely (code, JSON) may be under-attributed relative to prose.

MCP tool results are routed to the MCP category (not Tools) by looking up the tool_use_id in the preceding assistant message to recover the original tool name.

Parsed Data Model

The loader produces these Python dataclasses (defined in parser/models.py):

ExchangeStats

One complete human-to-assistant exchange, including all intermediate tool round-trips.

Field	Type	Source
`exchange_number`	`int`	1-based counter within the session
`timestamp`	`str`	`timestamp` of the final assistant message in the exchange
`input_tokens`	`int`	Sum of `usage.input_tokens` across all assistant messages in the exchange
`cache_read_tokens`	`int`	Sum of `usage.cache_read_input_tokens`
`cache_create_tokens`	`int`	Sum of `usage.cache_creation_input_tokens`
`cache_create_5m_tokens`	`int`	Sum of `usage.cache_creation.ephemeral_5m_input_tokens`
`cache_create_1h_tokens`	`int`	Sum of `usage.cache_creation.ephemeral_1h_input_tokens`
`output_tokens`	`int`	Sum of `usage.output_tokens`
`category_breakdown`	`CategoryBreakdown`	Per-category token estimates
`after_compact`	`bool`	`True` if the human message immediately followed a compact boundary
`user_text`	`str`	Last text block of user content that isn't injected context (≤800 chars)
`assistant_text`	`str`	Text blocks from the final assistant message joined (≤800 chars)
`files_read`	`list[str]`	Paths from `Read` calls; `Glob:pattern` for Glob; `Grep:'pattern' in path` for Grep
`tool_calls`	`list[tuple[str, dict]]`	`(tool_name, input_dict)` for every `tool_use` block across all assistant messages
`raw_user`	`dict`	Full JSONL envelope of the opening human user message
`raw_assistants`	`list[dict]`	Full JSONL envelopes of all assistant messages in the exchange (intermediates + final)
`jsonl_path`	`str`	Absolute path to the source JSONL file

SessionStats

One Claude Code session (one JSONL file).

Field	Meaning
`session_id`	UUID from the filename stem
`display_name`	First 8 characters of the session ID
`first_timestamp`	Timestamp of the first message in the file
`exchanges`	Ordered list of `ExchangeStats`

ProjectStats

One project directory under ~/.claude/projects/.

Field	Meaning
`project_slug`	Raw directory name (e.g. `-Users-alan-code-myproject`)
`display_name`	Human-readable name: the portion of the slug after the 3rd hyphen-separated path component
`sessions`	List of `SessionStats`
`loaded`	`False` until `load_project()` is called (lazy loading)
`load_error`	Set to an error string if loading fails; `None` otherwise

Schema Sources

The fields documented here come from two distinct sources:

Anthropic Messages API — officially documented at docs.anthropic.com:

usage.input_tokens, cache_read_input_tokens, cache_creation_input_tokens, output_tokens
usage.cache_creation.ephemeral_5m_input_tokens, ephemeral_1h_input_tokens (prompt caching with TTL)
role, content, and all content block types (text, tool_use, tool_result)

Claude Code private format — not in Anthropic's API docs; written by Claude Code when it persists sessions to disk:

Envelope fields: uuid, parentUuid, requestId, timestamp
System message fields: subtype, compactMetadata
requestId corresponds to the x-request-id response header from the API, recorded by Claude Code for traceability.

The JSONL format as a whole is Claude Code's own storage format and is not officially documented by Anthropic.

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
docs/superpowers		docs/superpowers
parser		parser
tests		tests
tui		tui
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py
pytest.ini		pytest.ini
requirements.txt		requirements.txt
screenshot.png		screenshot.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ccaudit — Claude Code Token Usage Explorer

Getting Started

With uv (recommended)

With venv

Command-Line Flags

How Sessions and Exchanges Are Stored

Storage Layout

What Is an Exchange?

Top-Level Message Envelope

User Message

Content Block Types (User)

User Message Content Order

Assistant Message

Usage Fields

Content Block Types (Assistant)

System Message — Compact Boundary

Token Categories

Parsed Data Model

ExchangeStats

SessionStats

ProjectStats

Schema Sources

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ccaudit — Claude Code Token Usage Explorer

Getting Started

With uv (recommended)

With venv

Command-Line Flags

How Sessions and Exchanges Are Stored

Storage Layout

What Is an Exchange?

Top-Level Message Envelope

User Message

Content Block Types (User)

User Message Content Order

Assistant Message

Usage Fields

Content Block Types (Assistant)

System Message — Compact Boundary

Token Categories

Parsed Data Model

ExchangeStats

SessionStats

ProjectStats

Schema Sources

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages