BrainPalace

last_validated	2026-05-30

BrainPalace

Local-first RAG for code & docs, with long-term memory for AI agents. BM25, vector, GraphRAG, and hybrid search over your codebase and documentation — plus session memory, a temporal knowledge graph, and git/LSP-aware indexing. Use it from the CLI, over MCP, or as a Claude Code plugin. Runs fully local on Ollama, or with cloud LLMs.

Install

Pick the path that matches how you'll use BrainPalace. All paths share the same prerequisites and end with the same brainpalace CLI on your PATH.

Prerequisites

Need	Why
Python 3.10+	CLI + server runtime
`pipx`	Isolated install for the CLI (`apt install pipx` or `brew install pipx`, then `pipx ensurepath`)
`git`	The installer fetches BrainPalace from GitHub
One provider — cloud key or Ollama	Cloud: `OPENAI_API_KEY`, `ANTHROPIC_API_KEY`, `COHERE_API_KEY`, `GEMINI_API_KEY`, `GROK_API_KEY`. Local-only: a running Ollama with an embedding model pulled
Claude Code CLI	Only for the Claude Code plugin path below

Install as a Claude Code plugin

Richest UX — 30 slash commands, 3 agents, 2 skills. The setup wizard inside Claude Code installs the CLI + server, configures provider keys, initialises the project, starts the server, and runs the first index — all from one slash command.

Install the plugin:

claude plugins install github:bxw91/brainpalace

Run the setup wizard in Claude Code:
```
/brainpalace-setup
```

Full Claude Code reference: docs/PLUGIN_GUIDE.md.

Install as a CLI or MCP server

Use this if you want brainpalace as a command-line tool, or if you want to connect an MCP-capable editor (Cursor, VS Code Copilot, Cline, Continue, Kilo Code, Zed). One command does both — it will ask which MCP client to wire (or "none" for CLI-only) along the way. Nothing runs until you answer.

curl -sSL https://raw.githubusercontent.com/bxw91/brainpalace/stable/scripts/setup.sh | bash

That's it.

Got more projects to index later? The binary is installed once per machine; every new project needs three commands (brainpalace init --start --watch auto, then brainpalace index .). Full how-to: docs/INSTALL.md → Adding more projects.

Need more control?

CI / no-TTY, step-by-step manual install, low-level install.sh flags, install for other AI runtimes (Codex, OpenCode, Gemini CLI), Windows / WSL2 notes — all in docs/INSTALL.md.

What is BrainPalace

BrainPalace indexes your codebase and documentation, then exposes the resulting search over multiple interfaces so any AI assistant — or you yourself — can answer questions against it. On top of plain retrieval it keeps long-term memory: curated facts, captured coding-session summaries and decisions, and a temporal knowledge graph that tracks how those decisions supersede each other over time. Local-first by default (Ollama), with optional cloud providers for embeddings and summarisation.

Component	What it does
Server (`brainpalace-rag`)	FastAPI backend — indexing pipeline, BM25 + vector + GraphRAG stores, REST API
CLI (`brainpalace-cli`)	Click-based command-line client; primary interface for automation, mono-repos, and standalone use
MCP server (`brainpalace mcp`)	Opt-in stdio shim for non-Claude-Code AI clients (VS Code / Copilot, Cursor, Kilo Code, Cline, Continue, Zed)
Claude Code plugin	30 slash commands, 3 agents, 2 skills for Claude Code users

Features

Hybrid search — BM25 + vector + GraphRAG, fused per query (hybrid, multi) or selectable per call (bm25, vector, graph).
Session intelligence — capture AI-coding sessions into curated memory (remember/recall, markdown source-of-truth), searchable summaries + decisions, and a typed knowledge graph (Decision / Error / File / …). Cross-session linking supersedes stale decisions and promotes durable ones into memory. See SESSION_INDEXING.
Persistent graph backend — opt-in store_type: sqlite with temporal validity (per-edge validity windows, invalidate, timeline); scales past the in-memory default. See GRAPHRAG_GUIDE.
Time-decay ranking — newer chunks rank higher (configurable half-life).
Git-history indexing — commit messages + diff stats as a searchable source, bridging why ↔ what. See GIT_HISTORY.
LSP cross-references (opt-in) — typed calls/extends/implements symbol graph from a real language server. See LSP_INTEGRATION.
AST-aware code chunking — tree-sitter for Python, TypeScript, JavaScript, Java, Kotlin, C, C++, C#, Go, Rust, Swift.
LLM code summaries — optional AI-generated per-chunk descriptions to lift semantic recall on code.
GraphRAG — entity + relationship extraction. Dependency-aware queries: "what calls X", "modules importing Y", "classes extending Z".
Cross-encoder reranking — opt-in two-stage retrieval for higher precision on the top-k.
.gitignore-aware indexing + watching — every project .gitignore (nested files, negation patterns) honoured at index and watch time.
File watcher — per-folder, debounced, post-enqueue cooldown. Default OFF, opt-in per folder (auto).
Multi-instance — one server per project, automatic port allocation, .brainpalace/runtime.json discovery. Helpers: whoami, status --all, stop --url.
URL auto-discovery — CLI walks up from CWD to the owning server. Works correctly in mono-repos.
Incremental indexing — manifest + SHA-256; only changed files re-embed; chunk eviction tracks deletes.
Embedding cache — TTL 3600 s, hit-rate tracked. Cuts provider cost on reindex.
Pluggable providers — embeddings (OpenAI · Cohere · Ollama), summarisation (Anthropic · OpenAI · Gemini · Grok · Ollama). Fully local mode via Ollama for both.

Search Modes

Mode	Best For	Example Query
`HYBRID`	General questions (default)	"How does caching work?"
`VECTOR`	Conceptual understanding	"Explain the architecture"
`BM25`	Exact terms, error codes	"NullPointerException", "getUserById"
`GRAPH`	Relationships, dependencies	"What classes use AuthService?"
`MULTI`	Comprehensive search (all modes via RRF)	"Everything about data validation"

Pluggable Providers

Embedding Providers

Provider	Models	Local
OpenAI	text-embedding-3-large, text-embedding-3-small	No
Cohere	embed-english-v3.0, embed-multilingual-v3.0	No
Ollama	nomic-embed-text, mxbai-embed-large	Yes

Summarisation Providers

Provider	Models	Local
Anthropic	claude-haiku-4-5-20251001, claude-sonnet-4-5-20250514	No
OpenAI	gpt-5, gpt-5-mini	No
Gemini	gemini-3-flash, gemini-3-pro	No
Grok	grok-4, grok-4-fast	No
Ollama	llama4:scout, mistral-small3.2, qwen3-coder	Yes

Fully Local Mode

Run completely offline with Ollama for both embeddings and summarisation:

/brainpalace-providers
# Pick Ollama for both

Or via the CLI: docs/PROVIDER_CONFIGURATION.md.

Claude Code Plugin

The plugin ships 30 slash commands, 3 agents, and 2 skills. Full reference: docs/PLUGIN_GUIDE.md.

Category	Commands
Search	`/brainpalace-search`, `/brainpalace-semantic`, `/brainpalace-vector`, `/brainpalace-keyword`, `/brainpalace-bm25`, `/brainpalace-hybrid`, `/brainpalace-graph`, `/brainpalace-multi`
Server	`/brainpalace-start`, `/brainpalace-stop`, `/brainpalace-status`, `/brainpalace-list`
Index	`/brainpalace-index`, `/brainpalace-folders`, `/brainpalace-inject`, `/brainpalace-reset`, `/brainpalace-types`
Setup	`/brainpalace-setup`, `/brainpalace-install`, `/brainpalace-init`, `/brainpalace-verify`, `/brainpalace-providers`

Agent	Role
Search Assistant	Multi-step search across modes; synthesises answers with citations
Research Assistant	Deep exploration with follow-up queries
Setup Assistant	Guided installation and troubleshooting

Skill	Purpose
`using-brainpalace`	Search-mode selection, query optimisation, API knowledge
`configuring-brainpalace`	Installation, provider configuration, troubleshooting

Project Structure

brainpalace/
├── brainpalace-plugin/                     # Claude Code plugin
│   ├── commands/                            # 30 slash commands
│   ├── agents/                              # 3 intelligent agents
│   ├── skills/                              # 2 context skills
│   └── templates/                           # mcp-config-claude-code.json + sessionstart hook
├── brainpalace-server/                     # FastAPI backend (REST API)
├── brainpalace-cli/                        # CLI + Python SDK + MCP shim
│   └── brainpalace_cli/
│       ├── commands/                        # CLI subcommands incl. `mcp`
│       ├── mcp_server/                      # Opt-in MCP stdio shim
│       └── client/                          # Python SDK
└── docs/                                    # User + developer docs

Documentation

Getting Started

Install (alternative paths) — manual / CI / other AI runtimes / low-level flags
Quick Start — first-run walkthrough
MCP Setup — per-client config for non-Claude-Code AI clients
Plugin Guide — full Claude Code plugin reference
User Guide — CLI usage and feature reference

Reference

API Reference — REST API documentation
Configuration — config.yaml options
Provider Configuration — embedding + summarisation provider setup
Changelog — per-version notes

Architecture

Architecture Overview — components, data flow
GraphRAG Guide — knowledge-graph features
Code Indexing — AST-aware chunking
Deployment — local + production deployment
Developer Guide — monorepo layout, sub-modules, contributing

Development

git clone https://github.com/bxw91/brainpalace.git
cd brainpalace
task install
task before-push      # full quality gate — mandatory before merge

Full setup and contribution workflow: docs/DEVELOPERS_GUIDE.md.

Technology Stack

Server: FastAPI + Uvicorn
Vector Store: ChromaDB (HNSW, cosine similarity)
BM25 Index: LlamaIndex BM25Retriever
Graph Store: LlamaIndex SimplePropertyGraphStore (JSON) or SQLite (persistent, temporal)
Embeddings: OpenAI · Cohere · Ollama
Summarisation: Anthropic · OpenAI · Gemini · Grok · Ollama
AST Parsing: tree-sitter (10+ languages)
CLI: Click + Rich
MCP: Anthropic mcp SDK (stdio transport)
Build: Poetry

Contributing

PRs land on stable. Before pushing, task before-push must pass. See docs/DEVELOPERS_GUIDE.md for monorepo layout, test commands, and release discipline.

License

MIT — see LICENSE and NOTICE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.claude-plugin		.claude-plugin
.github		.github
brainpalace-cli		brainpalace-cli
brainpalace-plugin		brainpalace-plugin
brainpalace-server		brainpalace-server
docs		docs
e2e-cli		e2e-cli
e2e		e2e
playgrounds		playgrounds
scripts		scripts
tests/validation		tests/validation
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
NOTICE		NOTICE
README.md		README.md
Taskfile.yml		Taskfile.yml
config.yaml.example		config.yaml.example

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BrainPalace

Install

Prerequisites

Install as a Claude Code plugin

Install as a CLI or MCP server

Need more control?

What is BrainPalace

Features

Search Modes

Pluggable Providers

Embedding Providers

Summarisation Providers

Fully Local Mode

Claude Code Plugin

Project Structure

Documentation

Getting Started

Reference

Architecture

Development

Technology Stack

Contributing

License

Links

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

BrainPalace

Install

Prerequisites

Install as a Claude Code plugin

Install as a CLI or MCP server

Need more control?

What is BrainPalace

Features

Search Modes

Pluggable Providers

Embedding Providers

Summarisation Providers

Fully Local Mode

Claude Code Plugin

Project Structure

Documentation

Getting Started

Reference

Architecture

Development

Technology Stack

Contributing

License

Links

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages