CodeBuddy

Local-first MCP memory and automatic project-context engine for multi-agent coding. Curated facts, summaries, and plans are written as readable Markdown under .codebuddy/ and indexed in PostgreSQL with pgvector. On top of that, CodeBuddy prepares the smallest useful context for an agent before it edits, captures durable knowledge after a turn, measures the token savings, and warns before risky changes — without the user manually juggling remember/recall.

What It Ships

codebuddy is a single npm package with four surfaces:

TypeScript SDK helpers
MCP server over stdio (memory, plan, risk, map, and the v2.0 context tools)
CLI for setup, indexing, context, savings, suggestions, review, and health
LangGraph helpers

See the 2.0 migration notes if you are upgrading from 0.1.x, and degraded-mode behavior for how the tools fail soft.

Requirements

Node 20+
PostgreSQL 16+
pgvector
Hugging Face token

Optional peer dependency:

@langchain/langgraph: ^0.2.0 || ^0.3.0

Quick Start

Install once globally, then let the interactive wizard wire everything:

npm install -g @ayushkumar320/codebuddy
codebuddy init

codebuddy init prompts for your Hugging Face token, validates your Postgres URL, offers to start the bundled pgvector/pgvector:pg16 container if Postgres isn't reachable, applies migrations, and optionally registers itself with Claude Desktop.

To wire any additional project into Claude Desktop with its own namespace:

cd ~/Projects/your-project
codebuddy claude install --namespace your-project

Then restart Claude Desktop (⌘Q + reopen) and the new namespace's tools are live.

Team setup

CodeBuddy is designed so multiple developers or agents can share project knowledge without sharing local secrets.

codebuddy init and codebuddy use create this project scaffold:

.codebuddy/
├── .gitignore          # keeps local secrets, caches, and locks out of git
├── config.json         # local-only Postgres/token settings
├── memory/
│   ├── facts/          # reviewable Markdown facts
│   └── summaries/      # reviewable Markdown summaries
└── plans/              # reviewable Markdown work plans

Commit .codebuddy/memory/**/*.md and .codebuddy/plans/*.md when you want the team to share agent memory and plans. Do not commit .codebuddy/config.json; each teammate runs codebuddy use or codebuddy init locally with their own token and database URL.

Use one namespace per project, for example:

codebuddy use my-project

That keeps each project isolated while still allowing explicit sharing through the share tool when you want one namespace to hand facts to another.

Manual setup (if you prefer)

# 1. Start Postgres (pgvector image bundled)
codebuddy postgres up

# 2. Apply migrations
codebuddy migrate

# 3. Run the MCP stdio server
codebuddy serve

Useful commands

codebuddy doctor              # DB, pgvector, vector index, HF model health
codebuddy claude list         # see all codebuddy entries in Claude Desktop
codebuddy claude remove <key> # detach a namespace from Claude Desktop
codebuddy postgres down       # stop the bundled DB (volume preserved)
codebuddy stats               # storage + usage stats
codebuddy inspect <namespace> # contents + per-agent breakdown
codebuddy list-facts          # paginate facts
codebuddy migrate to-files    # preview export of existing DB memories
codebuddy migrate to-files --write
codebuddy reindex --full      # rebuild facts + summaries from Markdown
codebuddy risk assess --git   # assess risk for current git changes
codebuddy map build           # summarize the lightweight import graph
codebuddy index               # build the background file index
codebuddy context bootstrap   # preview start-of-turn project context
codebuddy savings             # measured token savings from summaries
codebuddy suggest --git       # read-only, evidence-backed suggestions
codebuddy memory review       # curate auto-captured memory
codebuddy rules install --client claude
codebuddy prune --older-than 30d

Automatic context engine (v2.0)

The v2.0 goal is to make project context automatic: CodeBuddy prepares compact context before an edit, captures durable knowledge after a turn, and proves the token savings — all local-first and inspectable.

Background indexing

codebuddy index          # content hashes + deterministic file summaries
codebuddy index --full   # re-summarize every file
codebuddy watch          # incrementally re-index on change (Ctrl+C to stop)

The index is a rebuildable cache at .codebuddy/cache/index.json (gitignored) — never a source of truth. It respects .gitignore and skips .codebuddy/.

Context at the right moment

codebuddy context bootstrap                       # identity, plan, policy, incidents, architecture
codebuddy context preview "fix OAuth callback"    # before-edit context for the change set
codebuddy context preview --paths src/auth/oauth.ts
codebuddy context explain --git                   # why each piece was included + token savings

Through MCP, agents call context_bootstrap, context_before_edit, and context_after_turn directly (see the tools table below).

Token savings demo

Prove the reduction end-to-end:

codebuddy index                       # summarize the repo
codebuddy savings                     # e.g. "saved N tokens (95% smaller)"
codebuddy context explain --paths src/foo.ts   # per-stage accounting for one change set

Savings are measured against a real baseline — the cost of reading the represented files' raw source — and are clamped so they are never fabricated.

Automatic memory capture with review

context_after_turn (MCP) turns a turn summary into classified candidates. Confident facts, decisions, and incidents auto-save as Markdown; notes, low-confidence, and sensitive items queue for review; chatter is dropped.

codebuddy memory review           # list pending candidates
codebuddy memory review show <id>
codebuddy memory review approve <id>   # promote to a durable fact
codebuddy memory review reject <id>

Read-only suggestions

codebuddy suggest --git                       # or --paths a,b or --plan <id>
codebuddy suggest --paths src/auth/oauth.ts --json

Evidence-backed, severity-ranked findings composed from risk, plan divergence, architecture blast radius, incident history, missing tests, and stale policies. Suggestions never edit code.

Client workflow templates

codebuddy rules show --client claude          # preview the workflow rules
codebuddy rules install --client claude       # managed block in CLAUDE.md
codebuddy rules install --client codex        # managed block in AGENTS.md

Install writes an idempotent block between codebuddy:workflow markers, so re-running updates it in place without touching your other content.

File-backed memory

Facts and summaries written through normal CLI/MCP runtimes are stored in:

.codebuddy/
└── memory/
    ├── facts/
    │   └── fact_<id>.md
    └── summaries/
        └── sum_<id>.md

The Markdown files contain versioned YAML front matter followed by the memory text. PostgreSQL still stores the current searchable records and embeddings, but the files can reconstruct facts and summaries after database loss.

Interactions, shares, audit records, model diagnostics, and generated embeddings remain PostgreSQL-only in this release.

Upgrade existing memories

Preview the migration without writing files:

codebuddy migrate to-files

Write missing fact and summary files:

codebuddy migrate to-files --write

The operation is non-destructive and idempotent. It does not truncate database tables or overwrite existing files. If a file has the same ID but different content, the command reports a conflict with database and file checksums.

Review the output and the generated files before committing them.

Restore after database loss

Start an empty PostgreSQL database, apply migrations, then rebuild:

codebuddy postgres up
codebuddy migrate
codebuddy reindex --full

Reindex restores fact and summary rows and creates pending embeddings. The embedding worker regenerates vectors afterward.

Incident memory

Facts can optionally be tagged as incidents. This lets CodeBuddy remember past breakages and connect them to files:

category: incident
paths:
  - src/auth/oauth.ts
severity: high
introducedBy: abc123
resolvedBy: null

Normal facts still work without these fields. Incident facts are useful for risk checks: if a future plan touches src/auth/oauth.ts, CodeBuddy can find the past incident and warn the agent before it repeats the same mistake.

Risk and architecture map

CodeBuddy can assess change risk from structured memory, project policy, and recent Git churn:

codebuddy risk assess --git
codebuddy risk assess --paths src/auth/oauth.ts,src/api/login.ts
codebuddy risk assess --plan pln_abc123 --explain
codebuddy risk assess --json

Optional project policy lives in .codebuddy/policies.yaml:

rules:
  - id: auth-sensitive
    pattern: "src/auth/**"
    message: "Auth changes need careful review."
    weight: 70

The lightweight architecture map scans TypeScript/JavaScript imports on demand:

codebuddy map build
codebuddy map deps src/cli/index.ts
codebuddy map dependents src/core/codebuddy.ts

Rollback

The migration does not delete old database records. To roll back before a database loss, keep using the same database and remove or ignore the generated .codebuddy/memory/ directory. Back up the directory before deleting it.

After database loss, Markdown restores facts and summaries only. It cannot restore raw interactions, share relationships, audit logs, or diagnostics.

Configuration

Preferred auth source:

HF_TOKEN

Supported config file:

.codebuddy/config.json

Rules:

codebuddy init creates the config file with 0600 permissions
codebuddy doctor warns if permissions are broader than 0600
the Hugging Face token must never be logged and is redacted in errors
CodeBuddy uses Hugging Face only; additional providers are out of scope
the core context, index, savings, and suggestion paths run with no LLM at all

SDK Example

Use createRuntime for production wiring. It creates the Postgres repository, initializes CodeBuddy, and returns a close() hook that drains the worker and closes the database client.

import { createRuntime } from "codebuddy";

const runtime = await createRuntime({
  postgresUrl: process.env.DATABASE_URL!,
  provider: {
    type: "huggingface",
    apiKey: process.env.HF_TOKEN,
  },
  namespace: "research-agent",
  tokenBudget: 8000,
});

const remembered = await runtime.memory.remember({
  sessionId: "sess_123",
  content: "Use layer caching for the Docker build.",
  type: "fact",
  idempotencyKey: "msg_123",
  agentId: "agent-a",
});

const context = await runtime.memory.recall({
  sessionId: "sess_123",
  query: "What do we know about deployment?",
  budget: 6000,
  callerModel: "meta-llama/Llama-3.2-3B-Instruct",
  conflictMode: "all",
});

await runtime.memory.share({
  from: "research-agent",
  to: "ops-agent",
  factIds: [remembered.id],
  mode: "reference",
  agentId: "agent-a",
});

await runtime.close();

Notes:

remember defaults to type: "interaction"
explicit interaction, fact, and summary writes are supported
repeated writes with the same idempotency key or same content return deduplicated: true
embeddings are prepared asynchronously; recall falls back to recency while vectors are pending

MCP Tools

Tool	Input	Output
`remember`	`{ sessionId?, content, type?, idempotencyKey?, agentId? }`	`{ id, sessionId, deduplicated }`
`remember_batch`	`{ items: [{ sessionId?, content, type?, idempotencyKey?, agentId? }] }`	`{ items: [{ id, sessionId, deduplicated }] }`
`recall`	`{ sessionId, query, budget?, callerModel?, conflictMode? }`	`{ system, messages, stats }`
`list_facts`	`{ subject?, limit?: number = 50, cursor?: string }`	`{ facts[], nextCursor? }`
`list_namespaces`	`{}`	`{ namespaces: [{ name, factCount, lastActivity }] }`
`forget`	`{ id }`	`{ ok, entityType }`
`share`	`{ to_namespace, factIds, mode?: "reference" \| "snapshot", agentId? }`	`{ shared }`
`plan_current`	`{}`	`{ plan, policy }`
`plan_create`	`{ title, brief, ... }`	`{ plan }`
`plan_amend`	`{ id, patch }`	`{ plan }`
`plan_status`	`{ id, status, ... }`	`{ plan }`
`risk_assess`	`{ planId?, paths?, useGit?, limit? }`	`{ items, stats }`
`map_query`	`{ from?, to?, limit? }`	`ModuleEdge[]`
`map_neighbours`	`{ module, direction? }`	`{ modules, edges }`
`context_bootstrap`	`{}`	`{ project, plan, policy, policyRules, incidents, architecture, tokens }`
`context_before_edit`	`{ task?, paths?, planId?, useGit? }`	`{ targetPaths, plan, policyRules, incidents, risks, neighbours, tokens }`
`context_after_turn`	`{ summary, changedFiles?, planId?, taskType?, agentId? }`	`{ saved, queuedForReview, ignored, reasons }`
`code_suggestions`	`{ paths?, planId?, useGit?, limit? }`	`{ suggestions, stats }`

MCP currently ships over stdio. HTTP transport remains deferred.

LangGraph

Expected node state shape:

type CodeBuddyState = {
  messages: BaseMessage[];
  memory?: PlannedContext;
  sessionId: string;
  agentId?: string;
};

Example:

import { Annotation, StateGraph } from "@langchain/langgraph";
import type { BaseMessage } from "@langchain/core/messages";
import type { PlannedContext } from "codebuddy";
import { CodeBuddyNode, CodeBuddyCheckpointer } from "codebuddy/langgraph";

const State = Annotation.Root({
  messages: Annotation<BaseMessage[]>({
    reducer: (left, right) => left.concat(right),
    default: () => [],
  }),
  memory: Annotation<PlannedContext | undefined>({
    reducer: (_, right) => right,
    default: () => undefined,
  }),
  sessionId: Annotation<string>({ reducer: (_, right) => right }),
  agentId: Annotation<string | undefined>({
    reducer: (_, right) => right,
    default: () => undefined,
  }),
});

const graph = new StateGraph(State)
  .addNode("recall", new CodeBuddyNode({ memory, mode: "recall", cacheTtlSeconds: 30 }))
  .addNode("agent", agentNode)
  .addNode("remember", new CodeBuddyNode({ memory, mode: "remember" }))
  .compile({ checkpointer: new CodeBuddyCheckpointer({ namespace: "research-agent" }) });

Recall caching defaults to 30 seconds and is keyed by (namespace, sessionId, query).

Examples

examples/claude-desktop: MCP stdio config for Claude Desktop
examples/claude-code: project MCP config for Claude Code
examples/langgraph-agent: LangGraph recall/agent/remember loop
examples/multi-agent-handoff: reference and snapshot sharing across namespaces

For detailed Claude Code and Codex setup with Docker commands, see CLAUDE_CODEX.md.

Hugging Face Free-Tier Notes

Hugging Face-hosted models may be cold, gated, rate-limited, or temporarily unavailable. CodeBuddy handles cold starts, retries, queueing, fallback models, and timeout reporting, but free-tier limits still matter.

Use:

codebuddy doctor

Doctor reports database connectivity, pgvector availability, vector/index health, model reachability, gated-model failures, recent calls, estimated daily cap remaining, and config permissions.

Privacy

CodeBuddy stores facts and summaries in plaintext Markdown
conversations, shares, telemetry, and indexed data are stored in PostgreSQL
inspect .codebuddy/memory/ before committing it because facts may contain secrets
for private project memory, add .codebuddy/memory/ to the project’s .gitignore
for selective sharing, ignore a private subdirectory and keep public memory separate
encryption at rest is the deployment owner’s responsibility
automatic PII detection is not yet shipped
sensitivity metadata and stronger redaction workflows are planned

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
docs		docs
examples		examples
src		src
.env.example		.env.example
.gitignore		.gitignore
.rules		.rules
ARCHITECTURE.md		ARCHITECTURE.md
CLAUDE.md		CLAUDE.md
CLAUDE_CODEX.md		CLAUDE_CODEX.md
LICENSE		LICENSE
README.md		README.md
WORKFLOW.md		WORKFLOW.md
biome.json		biome.json
docker-compose.yml		docker-compose.yml
drizzle.config.ts		drizzle.config.ts
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json
tsup.config.ts		tsup.config.ts
vitest.config.ts		vitest.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CodeBuddy

What It Ships

Requirements

Quick Start

Team setup

Manual setup (if you prefer)

Useful commands

Automatic context engine (v2.0)

Background indexing

Context at the right moment

Token savings demo

Automatic memory capture with review

Read-only suggestions

Client workflow templates

File-backed memory

Upgrade existing memories

Restore after database loss

Incident memory

Risk and architecture map

Rollback

Configuration

SDK Example

MCP Tools

LangGraph

Examples

Hugging Face Free-Tier Notes

Privacy

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CodeBuddy

What It Ships

Requirements

Quick Start

Team setup

Manual setup (if you prefer)

Useful commands

Automatic context engine (v2.0)

Background indexing

Context at the right moment

Token savings demo

Automatic memory capture with review

Read-only suggestions

Client workflow templates

File-backed memory

Upgrade existing memories

Restore after database loss

Incident memory

Risk and architecture map

Rollback

Configuration

SDK Example

MCP Tools

LangGraph

Examples

Hugging Face Free-Tier Notes

Privacy

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages