vibeMemory

A self-hosted local memory stack for MCP-compatible editors (Cursor, VS Code, Windsurf, Zed, etc.).

Stores, searches, and manages semantic memories backed by Qdrant vector storage — accessible via any MCP client or the included browser dashboard.

Architecture

┌─────────────────────┐     MCP (streamable HTTP)      ┌──────────────────┐
│  IDE / MCP Client   │ ──────────────────────────────► │  server.py :8000 │
│  (Cursor, VS Code…) │                                 │  FastMCP tools   │
└─────────────────────┘                                 └────────┬─────────┘
                                                                 │ Qdrant client
┌─────────────────────┐     REST (browser-safe)                  ▼
│  Dashboard :8080    │ ──────────────────────────────► ┌──────────────────┐
│  FastAPI + HTML/JS  │                                 │  Qdrant :6333    │
└─────────────────────┘                                 │  (vector store)  │
                                                        └──────────────────┘

Quick Start (Podman Desktop)

1. Start Qdrant

podman run -d --name qdrant -p 6333:6333 qdrant/qdrant

2. Configure environment

cp .env.example .env
# Edit .env if you need non-default values

3. Start the memory server

uv run python server.py

4. Verify

curl http://localhost:8000/health

Full stack with Podman Compose

podman compose up -d

MCP Client Setup

Copy mcp.json from the project root to your IDE's config location:

Editor	Config location
Cursor	`.cursor/mcp.json`
VS Code (MCP ext)	`.vscode/settings.json` → `"mcp.servers"`
Windsurf	MCP config panel
Zed	`~/.config/zed/settings.json`

The endpoint is always http://localhost:8000/mcp.

MCP Tools

Tool	Description	Required Args	Optional Args
`remember`	Store a memory; merges with an existing one if cosine similarity >= threshold	`text`	`scope`, `tags`, `source`
`recall`	Retrieve memories semantically ranked by similarity to a query	`query`	`scope`, `limit`
`forget`	Delete a memory by UUID	`id`	—
`list_memories`	Browse all memories in a scope (no semantic ranking)	—	`scope`, `limit`

Environment Variables

Variable	Required	Default	Description
`QDRANT_URL`	No	`http://localhost:6333`	URL of the Qdrant instance
`QDRANT_COLLECTION`	No	`memories`	Qdrant collection name
`EMBED_MODEL`	No	`BAAI/bge-small-en-v1.5`	FastEmbed model (384-dim, CPU-friendly)
`SIMILARITY_THRESHOLD`	No	`0.92`	Cosine similarity above which memories are merged
`MAX_TEXT_LEN`	No	`2000`	Characters to store per memory (text is truncated)
`MEMORY_SERVER_URL`	No	`http://localhost:8000`	Dashboard -> memory server URL
`HF_TOKEN`	No	(unset)	Hugging Face token (avoids download rate limits)

Dashboard

Open http://localhost:8080 after starting the stack. Features:

Scope selector — switch between memory namespaces
Semantic search — query memories by meaning
Browse all — list all memories in a scope
Per-memory delete — remove individual entries
Score distribution chart (Chart.js)

Project Structure

vibeMemory/
├── server.py              # FastMCP memory server (port 8000)
├── dashboard/
│   ├── app.py             # FastAPI dashboard shim (port 8080)
│   └── static/
│       ├── index.html     # Dashboard UI
│       └── ui.css         # daisyUI theme
├── Containerfile          # Memory server container image
├── dashboard/Containerfile
├── compose.yaml           # Podman Compose — full stack
├── mcp.json               # IDE-agnostic MCP client config
├── .env.example           # Environment variable reference
├── pyproject.toml         # Python project + dependencies
└── docs/                  # Design docs, specs, implementation plan

Notes

First run: FastEmbed downloads the embedding model (~67 MB). Subsequent starts are fast.
Merge threshold: Lower SIMILARITY_THRESHOLD to merge more aggressively; raise it to preserve distinct memories.
Scope isolation: Each scope value is an independent namespace — memories in "work" are never returned when searching "personal".

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.claude/plans		.claude/plans
.opencode		.opencode
dashboard		dashboard
docs		docs
.DS_Store		.DS_Store
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
Containerfile		Containerfile
README.md		README.md
biome.json		biome.json
compose.yaml		compose.yaml
main.py		main.py
mcp.json		mcp.json
memory.py		memory.py
opencode.json		opencode.json
package-lock.json		package-lock.json
package.json		package.json
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
server.py		server.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

vibeMemory

Architecture

Quick Start (Podman Desktop)

1. Start Qdrant

2. Configure environment

3. Start the memory server

4. Verify

Full stack with Podman Compose

MCP Client Setup

MCP Tools

Environment Variables

Dashboard

Project Structure

Notes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

vibeMemory

Architecture

Quick Start (Podman Desktop)

1. Start Qdrant

2. Configure environment

3. Start the memory server

4. Verify

Full stack with Podman Compose

MCP Client Setup

MCP Tools

Environment Variables

Dashboard

Project Structure

Notes

About

Resources

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages