agent-memory

Zero-config, traceable, MCP-native long-term memory for agents.

agent-memory targets a gap in the current memory stack: a local-first engine that works with pip install, runs on pure SQLite, and makes memory evolution explainable instead of opaque.

Install from PyPI with pip install agent-memory-engine.

Current packaged release: 0.1.1.

Documentation

English docs index: docs/README.md
中文文档索引: docs/zh-CN/README.md
Delivery tutorial: docs/project-delivery-and-tutorial.md
MCP guide: docs/mcp-integration.md
Release guide: docs/release-and-pypi.md
Benchmark report: docs/benchmark-results.md

Why this exists

Mem0 proves demand, but pulls in heavier infra such as Neo4j or Qdrant.
Local agents and personal copilots need a memory layer that is easy to embed, debug, export, and ship.
The project spans a broad technical surface area: storage, retrieval, ranking, decay, provenance, conflict handling, MCP, and evaluation.

Current Status

SQLite backend with WAL, FTS5, audit log, evolution log, entity index, and causal parent links
Schema indexes for type, layer, recency, trust, source, relation, and audit hot paths
Python SDK via MemoryClient
Rule-based intent router with Reciprocal Rank Fusion
Adaptive forgetting utilities with dual-threshold layer transitions
Heuristic conflict detection with contradiction edges and trust-score adjustment
Optional LLM-backed conflict adjudication for top semantic candidates
Governance helpers for health reports, audit reads, and JSONL export/import
Optional MCP server and REST API adapters with dependency-friendly fallbacks
sqlite-vec integration with safe fallback to Python cosine scan when unavailable
Deterministic local fallback embeddings for testability and zero-friction startup
LLM-first conversation extraction with heuristic fallback
Trace graph reports with ancestors, descendants, relations, and evolution history
Idempotent maintenance cycle for decay, promotion/demotion, conflict upkeep, and consolidation
Benchmark helpers and LOCOMO-Lite style starter data

Quickstart

pip install agent-memory-engine

agent-memory store "User prefers SQLite for local-first agents." --source-id demo
agent-memory search "Why SQLite?"
agent-memory health

For development:

pip install -e '.[dev]'
.venv/bin/python -m pytest -q

from agent_memory import MemoryClient

client = MemoryClient()
item = client.add(
    "The user prefers SQLite for local-first agent projects.",
    source_id="demo-session",
)

results = client.search("What database does the user prefer?")
print(results[0].item.content)

trace = client.trace_graph(item.id)
print(trace.descendants)

health = client.health()
print(health.suggestions)

Architecture

graph TD
    A["Python SDK / CLI"] --> B["MemoryClient"]
    C["MCP Server"] --> B
    D["REST API"] --> B
    B --> E["Intent Router"]
    B --> F["Conflict Detector"]
    B --> G["Trust Scorer"]
    B --> H["Forgetting Engine"]
    E --> I["SQLite Backend"]
    F --> I
    G --> I
    H --> I
    I --> J[("SQLite + FTS5 + sqlite-vec")]

Core components

src/agent_memory/client.py — high-level SDK entry point
src/agent_memory/storage/sqlite_backend.py — SQLite persistence, FTS, vector fallback, trace queries
src/agent_memory/controller/router.py — intent-aware retrieval routing and RRF fusion
src/agent_memory/controller/forgetting.py — Ebbinghaus-inspired adaptive forgetting
src/agent_memory/controller/conflict.py — contradiction detection and conflict records
src/agent_memory/controller/consolidation.py — overlap grouping and merge-draft generation
src/agent_memory/controller/trust.py — multi-factor trust scoring
src/agent_memory/governance/health.py — stale/orphan/conflict monitoring
src/agent_memory/interfaces/mcp_server.py — eight MCP tools
src/agent_memory/extraction/pipeline.py — conversation-to-memory extraction
benchmarks/ — storage/retrieval microbenchmarks and synthetic eval seeds

Design choices

SQLite + WAL keeps deployment zero-config while fitting agent workloads: many reads, occasional writes.
Rule routing over LLM routing keeps routing latency predictable and testable.
RRF instead of score averaging avoids calibration problems across lexical, entity, and semantic retrieval.
sqlite-vec plus fallback gives C/SQL vector search when available while keeping the package runnable everywhere.
Soft delete preserves provenance and causal trace integrity.
Hash fallback embeddings make the package runnable even before a local embedding model is available.
Unique relation edges keep maintenance idempotent and health metrics stable.

Project layout

agent-memory/
├── docs/plans/
├── examples/
├── src/agent_memory/
│   ├── controller/
│   ├── embedding/
│   ├── extraction/
│   └── storage/
└── tests/

Benchmarks

Synthetic LOCOMO-Lite run on the bundled starter dataset (30 dialogues / 150 questions):

Metric	`agent-memory`	Semantic-only baseline
Overall hit rate	50.0%	23.3%
Factual recall	53.3%	6.7%
Temporal recall	36.7%	3.3%
Causal recall	53.3%	6.7%
p95 retrieval latency	16.64ms	11.50ms

Full report: docs/benchmark-results.md
Re-run locally: python benchmarks/locomo_lite/evaluate.py

MCP Usage

Install MCP support and launch the stdio server:

pip install -e .[mcp]
python -m agent_memory.interfaces.mcp_server

Claude Desktop configuration:

{
  "mcpServers": {
    "agent-memory": {
      "command": "python",
      "args": ["-m", "agent_memory.interfaces.mcp_server"],
      "env": {
        "AGENT_MEMORY_DB_PATH": "/absolute/path/to/default.db"
      }
    }
  }
}

Typical tools:

memory_store — store a memory with provenance
memory_search — run intent-aware retrieval
memory_trace — inspect causal ancestry and evolution
memory_health — inspect stale/conflict/orphan metrics

More details: docs/mcp-integration.md

Demos

python examples/demo_cross_session.py --db /tmp/agent-memory-demo.db
python examples/interactive_chat.py --db chat_memory.db --provider none
python examples/mcp_server.py

Release Notes

Changelog: CHANGELOG.md
benchmarks/locomo_lite/latest_results.json is regenerated by the evaluation script
docs/screenshots/ is reserved for verified MCP client screenshots
Delivery record and full tutorial: docs/project-delivery-and-tutorial.md
Release and PyPI guide: docs/release-and-pypi.md
Expansion and optimization review: docs/plans/2026-03-24-agent-memory-expansion-review.md

Dev Notes

Run all tests with .venv/bin/python -m pytest -q
Use the built-in CLI with agent-memory --help
sqlite-vec is installed as a package dependency; if the extension cannot be loaded at runtime, vector search safely falls back to Python cosine scan
Try microbenchmarks with python benchmarks/bench_storage.py and python benchmarks/bench_retrieval.py
Try the demo runner with python examples/benchmark_runner.py

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.github/workflows		.github/workflows
benchmarks		benchmarks
docs		docs
examples		examples
src/agent_memory		src/agent_memory
tests		tests
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
README.zh-CN.md		README.zh-CN.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

agent-memory

Documentation

Why this exists

Current Status

Quickstart

Architecture

Core components

Design choices

Project layout

Benchmarks

MCP Usage

Demos

Release Notes

Dev Notes

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

agent-memory

Documentation

Why this exists

Current Status

Quickstart

Architecture

Core components

Design choices

Project layout

Benchmarks

MCP Usage

Demos

Release Notes

Dev Notes

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages