mind — brain-like memory for any coding agent

One Python file. Zero dependencies. Zero API keys. Fully offline. Bilingual (EN + AR).

Your coding agent forgets everything between sessions. mind gives it a memory that works the way yours does: a weighted concept graph that recalls by spreading activation (not flat search), forgets by the Ebbinghaus curve (unused memories fade, reinforced ones harden), and reorganizes itself while you sleep through a deterministic dream cycle — no LLM calls, no token bill, every decision explained in a journal you can read.

It plugs into every agent at once: one memory, exported to AGENTS.md (Codex, Cursor, Zed, ...), CLAUDE.md (Claude Code) and GEMINI.md — and adopted automatically by .cursorrules, .windsurfrules, .clinerules and .roo/rules/mind.md in projects that already use those tools.

curl -O https://raw.githubusercontent.com/Da7-Tech/mind/main/mind.py
python3 mind.py init
python3 mind.py remember "the project database is postgres 16"
python3 mind.py recall "which database do we use"
# recall for "which database do we use" — 1 results [0.20 ms]
#   1. [0.033] (direct) the project database is postgres 16
python3 mind.py dream        # between sessions: forget, consolidate, promote

That's the whole install. No server, no vector store, no embedding model, no configuration file.

Measured, not vibes

Every number below comes from python3 bench/bench.py — rerun it yourself (Python 3.14 on Apple M-series — latencies are environment-dependent, rerun for your hardware; 20 bilingual queries with known answers against distractor-filled graphs):

graph size	recall@1	recall@5	median latency	p95
100 nodes	0.95	0.95	~0.5–0.7 ms	~2–4 ms
1,000 nodes	0.95	0.95	~2–3 ms	~11–16 ms

Dream determinism: PASS — identical memory state always produces the identical consolidation plan.

180-day soak (bench/soak.py — the real code driven through an injected clock with a realistic workload: daily/weekly/monthly facts + 357 junk notes

a dream every night): core-fact survival 15/15 across all cadence tiers, junk older than the grace window surviving: 0/256, graph size bounded (~106 nodes), recall on the aged graph 0.37 ms. The soak caught two real calibration bugs before release (facts pruned one day before their first monthly recall; decayed weight vetoing exact matches) — both fixed with regression tests. The one benchmark miss ("what css framework" → tailwind) is an honest limitation documented below, not hidden.

Test suite: 88 tests, stdlib unittest, python3 -m unittest discover -s tests — including regression tests for concurrency (parallel writers must not lose each other's memories), destructive-op gating, and corrupt-graph recovery.

How it works — three layers, like a brain

Layer 1  WORKING MEMORY   .mind/ACTIVE.md  → injected into agent rule files
         the ~200-300 tokens the agent always sees: hottest memories + cortex index

Layer 2  HIPPOCAMPUS      .mind/graph.json → weighted concept graph
         recall = spreading activation (≤3 hops) fused with direct keyword
         matches via Reciprocal Rank Fusion + IDF, re-ranked by offline
         hash embeddings; near-duplicate results are diversified (pattern
         separation); fuzzy fallback finds memories from partial cues
         (pattern completion)

Layer 3  CORTEX           .mind/cortex/*.md → consolidated durable knowledge
         fed by the dreamer when a cluster of related memories recurs

DREAMER  between sessions  python3 mind.py dream [--dry-run]
         light sleep  count + clear session signals (telemetry, reported in the journal)
         deep sleep   Ebbinghaus decay  R = e^(−t/S)  — stability S grows
                      with each confirmed recall; weak unused nodes pruned;
                      weak edges pruned (synaptic pruning)
         REM          cluster related memories → promote recurring themes
                      to cortex; flag contradictions (never auto-delete)

Wrong memory? Reconsolidation is built in:

python3 mind.py correct "database mysql" "the database is postgres 16"
# old text kept in node history; confidence lowered until re-confirmed

Why not just use ___?

	spreading-activation recall	sleep consolidation	works with any agent	zero setup / zero keys	consolidation costs $0 (no LLM)
mem0	✗	✗	✗ (SDK)	✗ OSS self-host or cloud, but needs LLM + embedder keys	✗
Letta (MemGPT)	✗	✓ sleep-time compute	✗ (own server)	✗ full platform	✗ burns tokens
Zep / Graphiti	~ graph traversal	✗	✗	✗ Neo4j + LLM keys	✗
HippoRAG 2	✓ PageRank	✗	✗ (batch RAG lib)	✗ GPU/API	✗
OpenClaw dreams	✗	✓	✗ (OpenClaw only)	✓ inside OpenClaw	✗ burns tokens
claude-mem	✗	✗ compression	~ several agents	✗ Bun + worker + Chroma	✗
mind	✓	✓ deterministic	✓ AGENTS/CLAUDE/GEMINI	✓ one file	✓

Honest note: Brain Memory is the closest project in spirit (files + decay + sleep phases) — credit where due. mind differs in being a single copy-able file, bilingual EN/AR at the tokenizer level, fully deterministic in consolidation, and shipping with a reproducible benchmark instead of claims.

Commands

command	what it does
`init`	create `.mind/` + export agent files
`remember "text"`	add a memory node
`link "a" "b" [rel]`	connect two memories (weighted edge)
`recall "question"`	spreading-activation recall (prints memory ids)
`confirm <id> [...]`	reinforce memories that actually answered you
`correct "old" "new"`	reconsolidate a wrong memory (history kept)
`dream [--dry-run]`	run the sleep cycle; journal in `.mind/dreams/`
`export`	regenerate agent rule files
`status`	health report

Reinforcement is explicit: recall is a pure read (repeated queries can't skew weights); when a recalled memory actually answers the question, the agent runs confirm <id> — that hardens the memory (+2 weeks stability) and restrengthens its edges. The exported agent instructions teach this loop, and every dream weakens all edges slightly (synaptic homeostasis), so connections that never earn a confirmation decay and prune away.

Safety properties

Atomic, durable, symlink-refusing writes everywhere — O_NOFOLLOW + fsync-before-rename (survives power loss), the lock file itself is opened symlink-safe, and every internal write also rejects a symlinked parent directory so nothing can escape the .mind/ boundary
Never silently destroys data: corrupt graphs are quarantined, not erased; memories pruned by decay are archived to .mind/archive.md — and if the archive cannot be written, nothing is pruned at all; user content in AGENTS.md/CLAUDE.md is preserved outside guard markers
dream --dry-run previews the full plan without touching disk
File-locked saves — safe under concurrent agent processes
Memory files are plain JSON + Markdown: git diff them, sync them, read them

Honest limitations

Recall is lexical + graph-structural. Cross-domain synonymy with no corpus evidence (e.g. "css" → a memory that only says "tailwind") is missed — that's the benchmark's one failing query. True embeddings would fix it at the cost of the zero-dependency promise; a pluggable backend is on the roadmap.
Arabic stemming is light (prefix/suffix + broken-plural seed), not a full morphological analyzer.
Optimized for personal/project agent memory (10²–10³ nodes), not enterprise RAG over millions of documents — use a real graph DB for that.
Tokens shorter than 3 characters (db, ai, os) are not indexed — write them out once ("database", "openai") and the co-occurrence index bridges the rest.
A fact recalled fewer than twice and untouched for longer than the 45-day grace window decays out of the graph (into the archive). Facts you need less often than ~every six weeks should live in cortex notes, not the hippocampus — that's the brain deal: use it or archive it.

Using with Hermes, Claude Code, Codex, Gemini CLI...

mind init writes the working memory into AGENTS.md, CLAUDE.md and GEMINI.md with guard markers, preserving your existing content. If the project already has .cursorrules, .windsurfrules, .clinerules or a .roo/ directory, those rule files are kept in sync too — adopted, never imposed on projects that don't use them. Any agent that reads those files gets the memory and the instructions to use it — nothing else to configure. A ready-made Hermes skill lives in SKILL.md.

Development

python3 -m unittest discover -s tests   # 88 tests
python3 bench/bench.py                  # reproduce the numbers above

Design rationale: docs/DESIGN.md · Arabic README: README.ar.md · License: MIT

Contributing

Issues and PRs welcome — the roadmap issues are scoped and ready to pick up. Ground rules: keep mind.py a single stdlib-only file, every change needs a test, and claims need measurements (bench/bench.py must stay green). Questions → Discussions.

If mind remembers something useful for you, a ⭐ helps other agents' humans find it.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.github/workflows		.github/workflows
assets		assets
bench		bench
docs		docs
tests		tests
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.ar.md		README.ar.md
README.md		README.md
SECURITY.md		SECURITY.md
SKILL.md		SKILL.md
mind.py		mind.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

mind — brain-like memory for any coding agent

Measured, not vibes

How it works — three layers, like a brain

Why not just use ___?

Commands

Safety properties

Honest limitations

Using with Hermes, Claude Code, Codex, Gemini CLI...

Development

Contributing

About

Uh oh!

Releases 6

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

mind — brain-like memory for any coding agent

Measured, not vibes

How it works — three layers, like a brain

Why not just use ___?

Commands

Safety properties

Honest limitations

Using with Hermes, Claude Code, Codex, Gemini CLI...

Development

Contributing

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 6

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages