GitHub - RetainDB/RetainDB: Open source memory and context infrastructure for ai systems

RetainDB

Open-source memory and context infrastructure for AI agents

RetainDB is a self-hostable memory layer for AI agents. Give your agents persistent, structured memory that survives across sessions, users, and time — with one docker compose up.

What it does

Stores memories extracted from conversations via LLM — typed, versioned, confidence-scored
Retrieves context in one call — semantic search + BM25 + rerank, packed into a context string ready for your LLM
Memory graph — memories relate to each other (updates, contradicts, supports, derives)
Temporal validity — facts have validFrom/validUntil; stale knowledge gets superseded, not lost
MCP server — works with Claude Desktop and any MCP host out of the box
Framework adapters — Vercel AI SDK, LangChain, LangGraph

Quickstart

Option 1 — Docker (easiest)

git clone https://github.com/retaindb/retaindb
cd retaindb
docker compose up
# Server ready at http://localhost:3000

No config needed. No API keys required for local use.

To enable OpenAI embeddings (optional — falls back to a local model otherwise):

OPENAI_API_KEY=sk-... docker compose up

Option 2 — Without Docker (Node 18+ + Postgres)

If you don't have Docker, you can run the server directly with Node.js. You'll need Postgres with the pgvector extension installed.

1. Install Postgres + pgvector

macOS: brew install postgresql pgvector then brew services start postgresql
Ubuntu/Debian: sudo apt install postgresql then install pgvector from github.com/pgvector/pgvector
Windows: use Postgres.app + pgvector from the pgvector releases page

2. Create the database

psql -U postgres -c "CREATE USER retaindb WITH PASSWORD 'retaindb';"
psql -U postgres -c "CREATE DATABASE retaindb OWNER retaindb;"
psql -U postgres -d retaindb -c "CREATE EXTENSION IF NOT EXISTS vector;"

3. Clone and configure

git clone https://github.com/retaindb/retaindb
cd retaindb
cp .env.example packages/server/.env
# Edit packages/server/.env and set DATABASE_URL if needed

4. Install dependencies and run

npm install -g pnpm
pnpm install
pnpm --filter @retaindb/server run db:push   # apply schema
pnpm dev:server                               # start the server
# Server ready at http://localhost:3000

OSS notes

The OSS server is single-tenant by default. Requests are scoped to a local default org on the server side, so clients do not need to send X-Organization-Id.
If you do set RETAINDB_API_KEY, auth becomes a single shared server key for your deployment. This is for self-hosted protection, not multi-tenant cloud isolation.
The playwright connector degrades to the standard web crawler in OSS instead of relying on the cloud browser-agent stack.

SDK

npm install @retaindb/sdk

import { RetainDBContext } from "@retaindb/sdk";

const db = new RetainDBContext({
  apiKey: "",                        // leave blank if no RETAINDB_API_KEY set
  baseUrl: "http://localhost:3000",
  project: "my-agent",
});

Store a memory

await db.addMemory({
  project: "my-agent",
  content: "User prefers concise answers and hates bullet points",
  memory_type: "preference",
  user_id: "user_123",
});

Retrieve context before an LLM call

const { context } = await db.query({
  project: "my-agent",
  query: "What are this user's preferences?",
  user_id: "user_123",
  include_memories: true,
});

// context is a pre-packed string — drop it straight into your system prompt

Extract memories from a conversation

await db.ingestSession({
  project: "my-agent",
  session_id: "session_abc",
  user_id: "user_123",
  messages: [
    { role: "user", content: "I'm building a SaaS in Next.js" },
    { role: "assistant", content: "Got it! What's the stack?" },
    { role: "user", content: "Next.js, Prisma, Postgres. Deploying on Vercel." },
  ],
});
// Memories like "User is building a Next.js SaaS on Vercel with Prisma + Postgres"
// are extracted and stored automatically.

Vercel AI SDK

import { withRetainDB } from "@retaindb/sdk/ai-sdk";

const { context } = await db.query({ project: "my-agent", query: userMessage, user_id });

const result = await streamText({
  model: openai("gpt-4o"),
  system: `You are a helpful assistant.\n\n${context}`,
  messages,
});

MCP (Claude Desktop)

npx @retaindb/mcp

Add to claude_desktop_config.json:

{
  "mcpServers": {
    "retaindb": {
      "command": "npx",
      "args": ["@retaindb/mcp"],
      "env": {
        "RETAINDB_BASE_URL": "http://localhost:3000"
      }
    }
  }
}

Claude will now call context, remember, forget, compress, index, and search automatically.

REST API

All endpoints require Authorization: Bearer <RETAINDB_API_KEY> if you set one. Otherwise open.

Method	Path	Description
`POST`	`/v1/memory`	Store a memory
`GET`	`/v1/memory/profile/:userId`	List memories for a user
`POST`	`/v1/memory/search`	Semantic search over memories
`POST`	`/v1/memory/ingest/session`	Extract + store memories from a conversation
`PUT`	`/v1/memory/:id`	Update a memory
`DELETE`	`/v1/memory/:id`	Delete a memory
`POST`	`/v1/context/query`	Retrieve packed context for an agent turn
`GET`	`/v1/projects`	List projects
`POST`	`/v1/projects/:id/sources`	Connect a knowledge source
`POST`	`/v1/sources/:id/sync`	Trigger a sync

In OSS, project visibility is server-scoped and single-tenant. The cloud-style org header model is not required for local use.

Memory model

RetainDB's memory model is more structured than most:

type MemoryType =
  | "factual"       // "User's name is John"
  | "preference"    // "User prefers dark mode"
  | "decision"      // "Project standardises on Bun"
  | "constraint"    // "Must deploy on AWS Lambda"
  | "instruction"   // "Always use formal tone"
  | "goal"          // "User wants to learn Python"
  | "event"         // "User attended conf on Jan 15"
  | "correction"    // Supersedes a stale memory
  | "workflow"      // Reusable agent habit
  | ...             // + relationship, opinion, solution, project_state

type RelationType =
  | "updates"       // New fact supersedes old
  | "extends"       // Adds detail to existing memory
  | "contradicts"   // Conflicts detected
  | "supports"      // Provides evidence for
  | "derives"       // Inferred from other memories

Each memory has validFrom, validUntil, confidence, entityMentions, and a version chain — so the graph stays accurate as your agent learns over time.

Connectors

Index external knowledge into your agent's context:

Connector	Type
GitHub	Repos, code, issues
Web / Sitemap	Docs sites, pages
PDF	Local or remote
Notion	Pages and databases
Confluence	Spaces and pages
Slack	Channel history
Discord	Server history
arXiv	Papers
npm / PyPI	Package docs
HuggingFace	Model cards
Plain text	Inline content

Configuration

Env var	Default	Description
`DATABASE_URL`	—	Postgres connection string (required)
`RETAINDB_API_KEY`	unset	Auth key. Unset = open access (fine for localhost).
`OPENAI_API_KEY`	unset	Enables OpenAI embeddings (`text-embedding-3-small`). Without it, a local BGE model is used.
`EMBEDDING_MODE`	auto	`openai` \| `local` \| `hybrid`
`EXTRACTION_MODEL`	`gpt-4o-mini`	LLM used to extract memories from conversations
`PORT`	`3000`	HTTP port
`DISABLE_SCHEDULER`	`false`	Disable background sync scheduler

Self-host vs. RetainDB Cloud

This repo is the self-hosted version. RetainDB Cloud adds:

Managed Postgres + pgvector — no ops
Higher-quality embeddings and proprietary reranking
Additional connectors (email, video transcription, live streams)
Memory analytics — recall heatmaps, drift detection
Team access controls and audit logs
SLA + support

The cloud is not a crippled version of this — it's this plus the infrastructure and scale layer that most teams don't want to operate.

Packages

Package	npm	Description
`packages/server`	—	Self-hostable API server
`packages/sdk`	`@retaindb/sdk`	TypeScript SDK
`packages/mcp`	`@retaindb/mcp`	MCP server

Contributing

PRs welcome. Open an issue first for anything substantial.

pnpm install
cp .env.example .env    # edit DATABASE_URL
cd packages/server
pnpm db:push            # apply schema to your local Postgres
pnpm dev                # start the server with hot reload

License

packages/sdk — Apache 2.0
packages/mcp — Apache 2.0
packages/server — Business Source License 1.1 Free to self-host. Building a hosted service on top requires a commercial license — reach out.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
packages		packages
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
pnpm-workspace.yaml		pnpm-workspace.yaml
turbo.json		turbo.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RetainDB

What it does

Quickstart

Option 1 — Docker (easiest)

Option 2 — Without Docker (Node 18+ + Postgres)

OSS notes

SDK

Store a memory

Retrieve context before an LLM call

Extract memories from a conversation

Vercel AI SDK

MCP (Claude Desktop)

REST API

Memory model

Connectors

Configuration

Self-host vs. RetainDB Cloud

Packages

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

RetainDB

What it does

Quickstart

Option 1 — Docker (easiest)

Option 2 — Without Docker (Node 18+ + Postgres)

OSS notes

SDK

Store a memory

Retrieve context before an LLM call

Extract memories from a conversation

Vercel AI SDK

MCP (Claude Desktop)

REST API

Memory model

Connectors

Configuration

Self-host vs. RetainDB Cloud

Packages

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages