Engram

Your Personal AI Assistant With Memory

Local-first AI chat that remembers everything, builds knowledge graphs, learns skills autonomously, searches the web, and keeps all your data on your machine.

Screenshots

Chat with Web Search

Chat interface with live web search results

Settings

Sidebar Navigation

Sidebar with conversations, search, notes, budget, and more

Install

Requires Docker only. One command to start.

Quick Start

git clone https://github.com/engramsoftware/engram.git
cd engram
docker compose up

Open http://localhost:8000 in your browser. That is it.

The first build takes a few minutes (downloads Python packages and builds the frontend). After that, starts in seconds.

Stop and Start

docker compose down      # stop
docker compose up -d     # start in background
docker compose logs -f   # view logs

Uninstall

docker compose down -v   # stop and delete all data

Or keep your data and just remove the container: docker compose down (data persists in the Docker volume).

Troubleshooting

Windows: Engram works on Windows. If you see exec /app/docker-entrypoint.sh: no such file or directory, ensure you have the latest code (git pull). The repo includes a fix for Windows line endings; the Dockerfile normalizes scripts during build.

What Gets Installed

When you run docker compose up, Docker builds the image in two stages and downloads everything Engram needs. Here is exactly what happens and why.

Stage 1: Frontend Build (Node.js)

Docker pulls Node.js 20 and builds the React frontend into static HTML/CSS/JS files.

Component	What It Is	Why Engram Needs It
React 18	UI framework	Renders the entire chat interface — conversations, sidebar, settings, notes, budget
TypeScript	Typed JavaScript	Catches bugs at compile time instead of runtime, keeps the frontend maintainable
Tailwind CSS	Utility-first CSS framework	Styles every component without writing custom CSS files — dark theme, responsive layout
Vite	Build tool and dev server	Bundles and optimizes the frontend for production (fast builds, tree-shaking, minification)
Zustand	State management	Manages app state (active conversation, messages, settings) without prop-drilling
React Router	Client-side routing	Handles navigation between chat, settings, and other pages without full page reloads
react-markdown + remark-gfm	Markdown renderer	Renders LLM responses as formatted text with tables, code blocks, bold, links, lists
react-syntax-highlighter	Code highlighting	Syntax-colors code blocks in LLM responses (Python, JavaScript, SQL, etc.)
Lucide React	Icon library	Provides all icons in the UI (send button, settings gear, sidebar icons, etc.)
date-fns	Date formatting	Displays message timestamps, schedule dates, and relative time ("2 hours ago")

After building, the compiled frontend (~2 MB) is copied into the backend container to be served as static files. No separate frontend server needed.

Stage 2: Backend (Python 3.11)

Docker pulls Python 3.11 slim and installs the backend with all its dependencies.

Core Server

Package	What It Is	Why Engram Needs It
FastAPI	Async web framework	Handles all API routes — messages, auth, settings, budget, notes, documents, schedule
uvicorn	ASGI server	Runs FastAPI in production with async I/O support and Server-Sent Events for streaming
python-multipart	Form data parser	Handles file uploads (RAG documents, profile images)
pydantic	Data validation	Validates every API request and response with type-safe models
python-dotenv	Environment variables	Loads configuration from `.env` files

Database and Storage

Package	What It Is	Why Engram Needs It
aiosqlite	Async SQLite driver	Main database for users, conversations, messages, settings, budget, schedule — zero-install, no separate server
ChromaDB	Vector database	Stores and searches embeddings for memories, messages, documents, and negative knowledge — enables semantic "meaning-based" search instead of just keyword matching
neo4j (driver)	Graph database client	Connects to Neo4j Aura (optional) for the knowledge graph — stores entities and relationships extracted from conversations

Search and Retrieval

Package	What It Is	Why Engram Needs It
rank-bm25	BM25 ranking algorithm	Keyword-based search scoring — combined with vector search for hybrid retrieval (catches exact terms vectors might miss)
sentence-transformers	ML model framework	Runs the cross-encoder re-ranker that scores how well a retrieved result actually answers the query (second-pass quality filter)
beautifulsoup4 + markdownify	HTML parser and converter	Fetches and cleans web pages into readable text for the LLM to reference in responses
diskcache	Persistent cache	Caches fetched web page content to avoid re-downloading the same URL repeatedly

AI and NLP

Package	What It Is	Why Engram Needs It
GLiNER	Named entity recognition	Extracts entities (people, places, organizations) from your queries at search time to find matching nodes in the knowledge graph
httpx	Async HTTP client	Communicates with LLM providers (OpenAI, Anthropic, LM Studio, Ollama) — handles streaming responses, retries, and timeouts

Security and Auth

Package	What It Is	Why Engram Needs It
PyJWT + python-jose	JSON Web Tokens	Authenticates API requests — every message and settings change requires a valid JWT token
bcrypt	Password hashing	Hashes user passwords so they are never stored in plain text
cryptography	Encryption library	Encrypts API keys at rest in the database using Fernet (AES-128-CBC) — your OpenAI/Anthropic keys are never stored as plain text

Document Processing

Package	What It Is	Why Engram Needs It
pypdf	PDF parser	Extracts text from uploaded PDF files for RAG (ask questions about your documents)
python-docx	Word document parser	Extracts text from uploaded .docx files for RAG
python-dateutil	Flexible date parser	Parses dates from LLM output in any format — "tomorrow at 3pm", "in 2 hours", "2026-02-15 14:00" — used for scheduled emails and calendar events

MCP Server

Package	What It Is	Why Engram Needs It
mcp	Model Context Protocol SDK	Runs the MCP server with 37 tools that integrates with Windsurf and other MCP-compatible editors — gives your code editor access to Engram's knowledge base, conversation history, and skill playbooks

Pre-Downloaded ML Models

After installing packages, Docker pre-downloads two ML models so your first message is not slow:

Model	Size	What It Does
all-MiniLM-L6-v2 (ONNX)	~23 MB	ChromaDB's default embedding model — converts text into 384-dimensional vectors for semantic search across memories, messages, and documents
ms-marco-MiniLM-L-6-v2 (cross-encoder)	~23 MB	Re-ranks search results by scoring query-passage relevance — makes retrieval more accurate by filtering out false positives from the initial vector search

Both models run locally inside the container. No API calls, no cost, no data leaves your machine.

First-Run Setup

On the very first startup, the entrypoint script:

Generates a secure JWT secret (64-character random hex) so your auth tokens cannot be forged
Generates a secure encryption key (32-character random hex) for encrypting API keys at rest
Creates all data directories under /data for SQLite, ChromaDB, uploads, skills, and logs
Starts the FastAPI server on port 8000, serving both the API and the compiled React frontend

After the first run, subsequent starts skip key generation and boot in seconds.

Features

Multi-Provider LLM Support

Connect to any LLM provider and switch models mid-conversation. Configure providers in the Settings tab.

Provider	Cost	Setup
OpenAI (GPT-4o, o1, etc.)	Per-token	API key from platform.openai.com
Anthropic (Claude 3.5 Sonnet, Haiku, etc.)	Per-token	API key from console.anthropic.com
LM Studio	Free, runs locally	Download from lmstudio.ai, start server
Ollama	Free, runs locally	Download from ollama.com, run `ollama serve`

Autonomous Memory

Engram learns from every conversation automatically. You do not need to save anything manually.

Extracts facts, preferences, decisions, skills, goals, and relationships from your messages
Resolves conflicts when new information contradicts something it already knows
Injects relevant memories into future conversations so the AI has context about you
Tracks negative knowledge so it remembers what failed or what to avoid

Knowledge Graph

Builds a Neo4j knowledge graph from your conversations over time.

LLM-based entity and relationship extraction (people, places, technologies, preferences)
Dynamic relationship labels generated by the LLM (LIVES_IN, PREFERS, WORKS_AT, etc.)
2-hop graph traversal at query time for deep contextual retrieval
Temporal decay so recent knowledge is weighted higher than old information
Community detection groups related entities by topic
Old relationships are marked inactive rather than deleted, preserving history

Web Search

Searches the web via the Brave Search API when you ask.

Brave Search (2000 free queries/month)
Adaptive gate decides IF a query needs web search (no LLM call, pure heuristics)
LLM-powered query reformulation that understands conversation context ("search it" becomes a real query)
Returns numbered search results with titles, URLs, and descriptions

RAG Documents

Upload PDFs, DOCX, TXT, or Markdown files. Engram chunks them, generates embeddings, and retrieves relevant sections when you ask questions about them.

Notes

Create notes manually through the sidebar, or let the AI auto-save important information from conversations. Notes are searchable and injected as context when relevant.

Personas

Custom AI personalities with configurable system prompts. Switch between different personas (assistant, tutor, coder, researcher) depending on what you need.

Skill Voyager (Autonomous Learning)

Voyager-inspired skill learning system that improves over time.

Classifies every query (factual, research, creative, technical, conversational) and matches it to learned skills
Applies skill strategies by injecting them into the LLM's context before generation
Evaluates response quality and updates skill confidence with asymmetric EMA
Self-reflects on failures and evolves skill strategies in-place
Learns from corrections — edits, regenerations, and thumbs-down reduce skill confidence
Retrieval learning — tracks which retrieval sources (memory, graph, web, hybrid) work best per query type
Curriculum engine proposes new skills to fill knowledge gaps
Works across both chat and OpenAI-compatible API (code agents train skills too)
Skill lifecycle: candidate → verified → mastered → deprecated
Toggle on/off per user in Settings → Addins. Fully optional — nothing breaks if disabled.

Plugin System

Extensible plugin architecture with built-in plugins:

Skill Voyager - autonomous skill learning (interceptor + GUI dashboard)
Code Improver - code analysis and adaptive retrieval
Calculator - safe math evaluation using AST parsing (no eval)
Image Generator - image generation support
Auto Translator - automatic message translation
Mood Journal - track emotional patterns over time
Pomodoro - focus timer with session tracking
Dice Roller - random number generation for tabletop games
Word Counter - text statistics and word count
Web Search - search integration plugin

Security and Privacy

Most data stays on your machine. SQLite and ChromaDB store everything locally. The exception is the optional Neo4j Aura knowledge graph, which is a cloud service — if you enable it, entity and relationship data is stored on Neo4j's servers.
Private network only. Blocks all public IP access by default (LAN and VPN only).
API keys encrypted at rest using Fernet symmetric encryption (AES-128-CBC).
Rate limiting on authentication endpoints to prevent brute force attacks.
Security headers (HSTS, X-Frame-Options, CSP) on all responses.
Input validation on all user-facing endpoints.

Architecture

Every message flows through a retrieval-augmented pipeline:

User Message
  |
  +-- [INLET] Parallel context retrieval:
  |     +-- Hybrid search (BM25 + vector via ChromaDB)
  |     +-- Autonomous memories (vector similarity, min confidence 0.3)
  |     +-- Notes (text search, top 3 relevant)
  |     +-- RAG documents (vector search, top 3 chunks)
  |     +-- Knowledge graph (Neo4j entity linking, 2-hop traversal)
  |     +-- Web search (Brave API, if triggered)
  |
  +-- [RRF FUSION] Reciprocal Rank Fusion merges and ranks all sources
  |
  +-- [INTERCEPTORS] Addin hooks (e.g. Skill Voyager classifies query, injects skill strategy)
  |
  +-- [LLM] Stream response via Server-Sent Events
  |
  +-- [INTERCEPTORS] Post-LLM hooks (evaluate response, update skill confidence)
  |
  +-- [OUTLET] Background processing after response:
        +-- Memory extraction and conflict resolution
        +-- Entity and relationship extraction to knowledge graph
        +-- Note and email marker extraction
        +-- Retrieval learning (which sources helped most)

Context is injected into the system prompt with a 6000-token budget. Priority order: warnings, memories, graph context, conversation history, web results.

Tech Stack

Layer	Technology
Backend	Python 3.11, FastAPI, uvicorn
Frontend	React 18, TypeScript, Tailwind CSS, Zustand
Database	SQLite via aiosqlite (zero-install, async)
Vector Store	ChromaDB (embedded, file-based)
Knowledge Graph	Neo4j Aura (optional, free tier available)
Search	Brave Search API
Entity Extraction	GLiNER + spaCy for query-time, LLM-based for outlet
Auth	JWT with bcrypt password hashing
MCP Server	37 tools across 3 SQLite databases

Two API Entry Points

POST /messages - Main chat UI endpoint with streaming
POST /v1/chat/completions - OpenAI-compatible API for code agents and external tools

Both run the same retrieval and outlet pipeline.

Configuration

On first start, Engram auto-generates secure JWT and encryption keys. No manual setup needed.

LLM provider API keys are configured through the Settings tab in the app. They are stored encrypted in the database, not in plain text.

Environment Variables

You can set environment variables in docker-compose.yml or mount a .env file. See backend/.env.example for all options.

Local LLM Providers (LM Studio, Ollama)

If you run LM Studio or Ollama on your host machine, use host.docker.internal as the hostname so the Docker container can reach them:

environment:
  - LMSTUDIO_BASE_URL=http://host.docker.internal:1234/v1
  - OLLAMA_BASE_URL=http://host.docker.internal:11434

Optional Services

These are not required but add extra capabilities:

Service	What It Adds	How to Set Up
Neo4j Aura	Knowledge graph with entity relationships	Free tier at neo4j.com/cloud. Add URI and password in Settings.
Brave Search	Primary web search provider (2000 free queries/month)	Free API key at brave.com. Add in Settings.
LM Studio	Run LLMs locally for free (no API key needed)	Download from lmstudio.ai. Start the local server on port 1234.
Ollama	Run LLMs locally for free (no API key needed)	Download from ollama.com. Run `ollama serve` in a terminal.

Without any optional services, Engram still works with any LLM provider you configure. Add a Brave API key for web search.

Data and Privacy

Most data stays on your machine. The following data is sent externally:

LLM API calls to whichever provider you configure (OpenAI, Anthropic, or local). Your messages and conversation context are sent to generate responses.
Web search queries when you explicitly ask via the Brave Search API.
Knowledge graph data if you enable Neo4j Aura (optional cloud service) — entities and relationships extracted from your conversations are stored on Neo4j's servers.

If you use only local LLM providers (LM Studio, Ollama) and skip Neo4j, nothing leaves your machine except web search queries.

All user data lives in a Docker volume (engram-data) mapped to /data inside the container:

/data/
+-- app.db              # SQLite database (users, conversations, messages, settings)
+-- chroma/             # Vector embeddings (memories, messages, documents)
+-- mcp/                # MCP server databases (knowledge, interactions, reasoning)
+-- learning/           # Skill transfer and adaptive retrieval data
+-- uploads/            # Files you upload for RAG
+-- crawl_cache/        # Search validation cache
+-- logs/               # Service logs

To reset everything: docker compose down -v removes the container and all data.

To back up data: docker cp engram:/data ./backup

Disclaimer

This software is provided "as is" without warranty of any kind. By using Engram, you acknowledge and agree that:

You are solely responsible for all content generated by AI models through this software. The authors have no control over LLM outputs and accept no liability for them.
You are solely responsible for any third-party API costs incurred (OpenAI, Anthropic, Brave Search, Neo4j, etc.). Monitor your own API usage and billing.
You are solely responsible for reviewing and verifying any AI-generated content before acting on it, sharing it, or publishing it.
You are solely responsible for securing your own installation, backing up your data, and understanding what data is sent to external services.
The authors and contributors are not liable for any data loss, security incidents, unauthorized access, or damages of any kind arising from the use of this software.
No medical, legal, or financial advice. AI-generated content is not a substitute for professional advice. Do not rely on this software for decisions affecting health, legal matters, or finances.

By using this software, you agree to indemnify and hold harmless the authors and contributors from any claims, damages, or expenses arising from your use. See LICENSE for full legal terms.

Built With

Engram was built entirely with Windsurf, the AI-powered IDE by Codeium. From backend architecture to frontend components, every line of code was pair-programmed with Windsurf's Cascade agent.

License

Non-Commercial Use Only. See LICENSE for full terms.

Free to use, modify, and share for personal and non-commercial purposes. Commercial use requires a separate license.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
backend		backend
docs		docs
frontend		frontend
.dockerignore		.dockerignore
.gitattributes		.gitattributes
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
docker-entrypoint.sh		docker-entrypoint.sh
mcp_config.example.json		mcp_config.example.json
mcp_server.py		mcp_server.py

License

engramsoftware/engram

Folders and files

Latest commit

History

Repository files navigation

Engram

Your Personal AI Assistant With Memory

Screenshots

Install

Quick Start

Stop and Start

Uninstall

Troubleshooting

What Gets Installed

Stage 1: Frontend Build (Node.js)

Stage 2: Backend (Python 3.11)

Core Server

Database and Storage

Search and Retrieval

AI and NLP

Security and Auth

Document Processing

MCP Server

Pre-Downloaded ML Models

First-Run Setup

Features

Multi-Provider LLM Support

Autonomous Memory

Knowledge Graph

Web Search

RAG Documents

Notes

Personas

Skill Voyager (Autonomous Learning)

Plugin System

Security and Privacy

Architecture

Tech Stack

Two API Entry Points

Configuration

Environment Variables

Local LLM Providers (LM Studio, Ollama)

Optional Services

Data and Privacy

Disclaimer

Built With

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages