codemap

Code map your repo so AI coding agents find the right files on the first try.

Codemap indexes every file in your repository with a one-line summary, then uses a cheap fast model to pick only the files relevant to a task. Instead of your agent burning 10+ tool calls exploring the codebase, it gets the exact files it needs in one shot.

The idea is simple — code map with a cheap model, auto-context with a cheap model, then code with a big model. Higher precision on the input, higher precision on the output.

How it works

                    codemap build                    codemap select
                    (once, cached)                   (per task, fast)
                         |                                |
    Your repo -----> Per-file index -----> Cheap model picks files -----> Agent gets
    2688 files       summary, types,       "381 candidates ->            focused context
                     functions, imports     5 files (27 KB)"             30-80k tokens

codemap build indexes every file with a one-line summary using a cheap model (Haiku/Flash). Cached incrementally via mtime + BLAKE3 — only changed files get re-indexed.

codemap select takes a task description, sends the summaries (not source) to a cheap model which picks the 5-10 files that matter. Returns their full source code.

Your agent gets exactly the files it needs. No grep. No glob. No wrong turns.

Install

go install github.com/jonnonz1/codemap/cmd/codemap@latest

Quick start

cd your-project

# Interactive setup — picks provider, model, API key
codemap init

# Index your repo (first run takes a few minutes, after that it's seconds)
codemap build

# Register MCP server with Claude Code
claude mcp add codemap -- codemap mcp

That's it. Start a Claude Code session and it'll discover codemap_select as a native tool.

Setup

codemap init walks you through it:

Select LLM provider for file summaries:
  1) anthropic  (Claude — recommended)
  2) openai     (GPT)
  3) google     (Gemini)
  4) mock       (no LLM, placeholder summaries)

Provider [1]: 1
Model [claude-haiku-4-5-20251001]:
API key (stored in .codemap.yaml): sk-ant-...

This creates:

.codemap.yaml — config with your API key (gitignored)
A CLAUDE.md section telling Claude Code to use codemap tools
A SessionStart hook that injects code map status at session start
An example task file in tasks/

Claude Code integration (MCP)

Codemap runs as an MCP server that Claude Code calls natively:

# Register once
claude mcp add codemap -- codemap mcp

Claude gets three tools:

codemap_select — given a task, returns full source of the most relevant files
codemap_status — check if the index is fresh or stale
codemap_build — trigger an incremental rebuild

When you give Claude a task, it calls codemap_select first, gets focused context, and starts coding. No exploration phase.

CLI commands

codemap init                    # Interactive project setup
codemap build                   # Index repo (incremental, cached)
codemap render                  # Render code map as markdown
codemap select --task task.md   # Select files for a task (CLI mode)
codemap context                 # Show what gets injected at session start
codemap doctor                  # Check cache health
codemap statistics              # View usage metrics
codemap statistics --eval       # Evaluate selection accuracy vs git

Measuring it

Codemap tracks real metrics — what actually happened, not estimates:

$ codemap statistics --eval

Build Performance
  Total builds:        12
  Files indexed:       2688
  Avg cache hit rate:  94%

Context Selection
  Total selections:    8
  Avg files selected:  6.2
  Avg context saved:   97%

Selection Accuracy (vs actual git changes)
  Evaluations:         5
  Avg precision:       65%  (of selected files, how many were actually needed)
  Avg recall:          82%  (of changed files, how many were pre-selected)

Exploration Overhead
  Total Read calls:    48
  Extra reads:         7   (files NOT in codemap selection)
  Overhead:            15%
  Verdict:             codemap is providing good coverage

Hit rate / recall — did codemap predict the files you actually changed? Precision — did it include junk files you didn't need? Exploration overhead — did Claude need to search beyond what codemap gave it? Context saved — how much was the candidate pool compressed?

All computed from observed data (git diff, tool call logs). No counterfactuals.

Task files

For CLI-based selection (without MCP), write a task file:

---
context_globs:
  - src/invoices/**
  - tests/invoices/**
knowledge_globs:
  - src/types/**
max_files: 10
---

Add soft-delete support to invoices. Preserve existing patterns. Update tests.

codemap select --task task.md
cat .claude/cache/selected-context.md

How indexing works

Each file in the code map gets:

summary — one-sentence description (from LLM)
when_to_use — when a developer would need this file (from LLM)
public_types — exported type names (from parser)
public_functions — exported function names (from parser)
imports — dependencies (from parser)
keywords — domain terms (from LLM)

Deterministic facts come from parsers (currently Go via go/ast). Semantic fields come from the cheap LLM. The index is cached as JSON and only rebuilt for files that actually changed (mtime + BLAKE3 hash).

Providers

Provider	Model	Rough cost for 2700 files
Anthropic	claude-haiku-4-5-20251001	~$2-3
OpenAI	gpt-4o-mini	~$1-2
Google	gemini-2.0-flash	~$0.50
Mock	(none)	Free

The mock provider works without any API key — handy for testing the workflow before committing to a provider.

Configuration

.codemap.yaml (gitignored, contains API key):

llm:
  provider: anthropic
  model: claude-haiku-4-5-20251001
  api_key: sk-ant-...
  workers: 32        # concurrent API calls during build
  rate_limit: 50     # max requests per minute
cache_dir: .claude/cache

Requirements

Go 1.22+
An LLM API key (or use mock mode)
Claude Code (for MCP integration) or any agent that reads markdown

Licence

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
cmd/codemap		cmd/codemap
docs		docs
internal		internal
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
go.mod		go.mod
go.sum		go.sum

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

codemap

How it works

Install

Quick start

Setup

Claude Code integration (MCP)

CLI commands

Measuring it

Task files

How indexing works

Providers

Configuration

Requirements

Licence

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

codemap

How it works

Install

Quick start

Setup

Claude Code integration (MCP)

CLI commands

Measuring it

Task files

How indexing works

Providers

Configuration

Requirements

Licence

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages