fat-docs

Local semantic search over a project's docs/ folder, exposed as an MCP server. Install once per machine, query from any MCP client (Claude Code, Codex, Cursor, ...) across every project you own.

What it does

fat-docs indexes the markdown files in your repo's docs/ directory, generates embeddings with a local model (no API calls, nothing leaves your machine), and serves two MCP tools:

search_docs — semantic search with two-stage ranking (doc-level prefilter, then chunk-level ranking inside the top docs). Returns stitched excerpts around each hit. Supports path_filter, tags, and groups as pre-filters.
index_docs — builds or rebuilds the index for the current project.

It ships alongside a groupings skill for the major coding agents (Claude Code, Codex, Cursor) that helps you generate docs/groupings.md — a taxonomy your agents use when writing doc front matter and maintaining docs/guide.md.

Why a background service

The embedding model is ~275MB and takes a few seconds to load. Spawning a fresh MCP process per query would reload it every time. So fat-docs runs as a single long-lived service on 127.0.0.1:18800, loaded once, multi-tenanting every docs folder on your machine. Each MCP request sends an X-Docs-Dir header pointing at the docs folder it wants.

The service is installed via:

macOS: launchd user agent (~/Library/LaunchAgents/com.fat-docs.server.plist)
Linux: systemd --user unit (~/.config/systemd/user/fat-docs.service)
Windows: Task Scheduler task at logon (\fat-docs)

No admin privileges. Runs at login, restarts on crash.

Quick start

# In your project
npx fat-docs init

Interactive setup walks you through:

Installing the background service (first time only; subsequent projects reuse it)
Scaffolding the canonical docs/ structure (product/, technical/patterns/, technical/runbooks/ with overview files) — existing files are never overwritten
Writing the .mcp.json entry pointing at the service
Adding docs/.fat-docs/ to .gitignore
Appending a "docs search is available" block to CLAUDE.md and/or AGENTS.md (each opt-in)
Installing the groupings skill for Claude Code, Codex, and Cursor (each opt-in)

Then:

# In your agent: invoke the `groupings` skill to generate docs/groupings.md
# Then:
npx fat-docs index

Restart your MCP client and search_docs is available.

Canonical docs structure

fat-docs is opinionated about a base layout every docs/ folder should have. On init, the following files are scaffolded (existing files are never overwritten):

docs/
├── product/
│   └── overview.md           # product-level docs: what the thing does, who it's for
└── technical/
    ├── overview.md
    ├── patterns/
    │   └── overview.md       # code conventions, patterns, and rules
    └── runbooks/
        └── overview.md       # operational how-tos (local dev, deploy, debug, onboarding)

Each overview.md describes its folder's purpose and serves as a mini-index of the docs within. Agents maintain these as docs are added (per the memory-file instructions).

Why opinionated? Generic engineering tooling — the /dev command in your agent, onboarding automation, agent-driven code review — benefits from a predictable place to find patterns and runbooks. Anyone landing in the repo should know where to look.

You're free to add other top-level folders (architecture/, business/, whatever) as needed; fat-docs only cares about the scaffolded ones.

How the pieces fit together

┌───────────────────────────────────────────────────────────────┐
│  Your project                                                 │
│                                                               │
│  docs/                                                        │
│    outline.md                 ← maintained by your agent      │
│    guide.md                   ← maintained by your agent      │
│    groupings.md               ← taxonomy (from skill)         │
│    product/overview.md        ← scaffolded by `init`          │
│    technical/overview.md      ← scaffolded by `init`          │
│    technical/patterns/        ← scaffolded by `init`          │
│    technical/runbooks/        ← scaffolded by `init`          │
│    .fat-docs/docs.db          ← search index (gitignored)     │
│                                                               │
│  .mcp.json                    ← points at the service         │
│  CLAUDE.md / AGENTS.md        ← "you can use search_docs"     │
│  .claude/skills/groupings/    ← skill for Claude Code         │
│  .codex/skills/groupings/     ← skill for Codex               │
│  .cursor/rules/groupings.mdc  ← skill for Cursor              │
└───────────────────────────────────────────────────────────────┘
                        │
                        │ HTTP (X-Docs-Dir header)
                        ▼
┌───────────────────────────────────────────────────────────────┐
│  fat-docs background service (one per machine)                │
│  - loaded embedding model                                     │
│  - serves every project concurrently                          │
│  - logs to ~/Library/Logs/fat-docs.{log,err} (macOS)          │
└───────────────────────────────────────────────────────────────┘

When your agent edits docs, the instructions in CLAUDE.md / AGENTS.md tell it to keep outline.md and guide.md up to date and to tag new docs with groups: (keys from groupings.md). That's how the system stays coherent without you having to maintain it.

CLI

fat-docs init [--yes]                # interactive setup for this project (--yes: accept all defaults)
fat-docs install                     # install + start the service (done automatically by init)
fat-docs uninstall                   # stop + remove the service
fat-docs start                       # start the service
fat-docs stop                        # stop the service (without uninstalling)
fat-docs restart                     # restart the service
fat-docs status                      # is it running? which port? uptime?
fat-docs logs [--follow]             # tail the logs
fat-docs index [--force]             # build/rebuild the index for the current repo
fat-docs search <query> [--limit N] [--path-filter PATH] [--tags a,b] [--groups a,b]
fat-docs serve                       # run the server in the foreground (debugging)

Non-interactive environments (CI, piped stdin) must pass --yes to accept all defaults; otherwise init errors out.

`.mcp.json` shape

{
  "mcpServers": {
    "docs": {
      "type": "http",
      "url": "http://127.0.0.1:18800/mcp",
      "headers": {
        "X-Docs-Dir": "/absolute/path/to/project/docs"
      }
    }
  }
}

Front matter

All fields optional. Used by search_docs as filters and by your agent when routing.

---
title: Agent Lifecycle
summary: How agents are created, paused, and retired.
groups: [agents, governance]
tags: [lifecycle, state-machine]
hidden: false
---

groups — keys drawn from docs/groupings.md. Taxonomy-based; used by guide.md routing and the search_docs groups filter.
tags — free-form strings; used by the search_docs tags filter.
hidden: true — exclude from indexing.

`.docignore`

Optional docs/.docignore, gitignore-style rules:

drafts/**
internal/*.md
!internal/public-stuff.md

File locations

Per-project (inside your repo):

docs/ — your markdown
docs/.fat-docs/docs.db — SQLite index (init offers to add docs/.fat-docs/ to .gitignore)
.mcp.json — MCP client config
CLAUDE.md / AGENTS.md — memory-file blocks that teach agents the system (each opt-in at init time)
.claude/skills/groupings/SKILL.md, .codex/skills/groupings/SKILL.md, .cursor/rules/groupings.mdc — groupings skill for each agent (each opt-in at init time)

Per-machine:

~/.fat-docs/model-cache/ — downloaded embedding model (~275MB, shared across projects)
~/.fat-docs/server.port — port the service is listening on
Logs:
- macOS: ~/Library/Logs/fat-docs.{log,err}
- Linux: ~/.local/state/fat-docs/fat-docs.{log,err}
- Windows: %LOCALAPPDATA%\fat-docs\logs\fat-docs.{log,err}

License

MIT

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

fat-docs

What it does

Why a background service

Quick start

Canonical docs structure

How the pieces fit together

CLI

`.mcp.json` shape

Front matter

`.docignore`

File locations

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.claude/skills/groupings		.claude/skills/groupings
.codex/skills/groupings		.codex/skills/groupings
.cursor/rules		.cursor/rules
docs		docs
src		src
.gitignore		.gitignore
.mcp.json		.mcp.json
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json

Folders and files

Latest commit

History

Repository files navigation

fat-docs

What it does

Why a background service

Quick start

Canonical docs structure

How the pieces fit together

CLI

.mcp.json shape

Front matter

.docignore

File locations

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`.mcp.json` shape

`.docignore`

Packages