GitHub - darshjme/engram: Lightweight in-process memory for ReAct agents. Short-term context + episodic recall.

Short-term working memory + episodic recall for agents — without a vector database.

What Is an Engram?

In neuroscience, an engram is the physical trace a memory leaves in the brain — the biological substrate of a specific experience. When you remember something, you're activating an engram.

Most agents have no engrams. They wake up blank every session, re-ask questions they've already answered, and forget what the user told them 10 minutes ago.

This library fixes that.

Architecture

flowchart TD
    A[Agent Action / Observation] --> B[ShortTermMemory\nsliding window, fast dict]
    B -->|capacity exceeded| C[compress → EpisodicMemory]
    B -->|manual promote| C
    C[EpisodicMemory\ntimestamped events + tags + importance]
    C -->|recall query + tags| D[Relevant Episodes]
    B -->|to_messages| E[OpenAI-compatible\nmessages list]
    D --> F[context snapshot\ninjected into prompt]
    E --> F

Two tiers. No vector database. No external services. No configuration.

Quick Start

git clone https://github.com/darshjme/engram
cd engram && pip install -e .

from agent_memory import AgentMemory

memory = AgentMemory(short_term_size=10)

# Session 1 — record everything
memory.observe("User asked about quantum computing")
memory.act("Called search_tool('quantum computing basics')")
memory.observe("Found: superposition, entanglement, qubits")
memory.remember(
    "User prefers visual explanations with diagrams",
    tags=["preference"],
    importance=0.9
)

# Session 2 — full context restored
hits = memory.recall(tags=["preference"])
# → [MemoryEntry: "User prefers visual explanations..."]

ctx = memory.context()
# {
#   "short_term": [...recent observations...],
#   "episodic": [...important remembered facts...]
# }

# Inject into your LLM prompt
messages = [
    {"role": "system", "content": f"Context: {ctx}"},
    {"role": "user", "content": user_input},
]

Memory Session Lifecycle

sequenceDiagram
    participant U as User
    participant A as Agent
    participant M as engram

    U->>A: "Help me with quantum computing"
    A->>M: observe("user asked about QC")
    A->>M: act("searched Wikipedia")
    A->>M: remember("user prefers diagrams", importance=0.9)
    A-->>U: [answers with diagrams]

    Note over U,M: Session ends

    U->>A: "Continue from where we left off"
    A->>M: recall(tags=["preference"])
    M-->>A: ["user prefers visual explanations"]
    A->>M: context()
    M-->>A: {short_term: [...], episodic: [...]}
    A-->>U: [picks up with diagram-first approach]

engram vs. Vector Database

	engram	Vector DB (Qdrant/Pinecone)
Best for	Session state, recent context, preferences	Semantic search across 100k+ documents
Latency	Microseconds (dict lookup)	10–100ms (network + index)
Setup	Zero — `pip install -e .`	Database server, embeddings model, indexing pipeline
Persistence	In-process (extend to add file/DB backend)	Native
Token overhead	~200 tokens for context snapshot	~100 tokens per retrieved chunk

Rule of thumb: If your agent needs to remember what happened 5 minutes ago — use engram. If it needs to search a knowledge base — use a vector DB.

API Reference

`ShortTermMemory(max_size=10)`

Method	Returns	Description
`.add(role, content, **meta)`	`MemoryEntry`	Append (evicts oldest at capacity)
`.get_recent(n=5)`	`list[MemoryEntry]`	N most recent entries
`.to_messages()`	`list[dict]`	OpenAI-style `{role, content}` list
`.clear()`	`None`	Wipe buffer

`AgentMemory(short_term_size=10)`

Method	Description
`.observe(content, **meta)`	Add observation to short-term
`.act(content, **meta)`	Add action to short-term
`.remember(content, tags, importance)`	Promote to episodic store
`.recall(query=None, tags=[], limit=10)`	Search episodic by tags + importance
`.context()`	Full snapshot dict — inject into prompt
`.compress(summarize_fn=None)`	Compress short-term into episodic
`.reset()`	Clear everything

Part of Arsenal

verdict · sentinel · herald · engram · arsenal

Repo	Purpose
verdict	Score your agents
sentinel	Stop runaway agents
herald	Semantic task router
engram	← you are here
arsenal	The full pipeline

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
agent_memory		agent_memory
tests		tests
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
README.md		README.md
SECURITY.md		SECURITY.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

What Is an Engram?

Architecture

Quick Start

Memory Session Lifecycle

engram vs. Vector Database

API Reference

`ShortTermMemory(max_size=10)`

`AgentMemory(short_term_size=10)`

Part of Arsenal

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

What Is an Engram?

Architecture

Quick Start

Memory Session Lifecycle

engram vs. Vector Database

API Reference

ShortTermMemory(max_size=10)

AgentMemory(short_term_size=10)

Part of Arsenal

License

About

Topics

Resources

Code of conduct

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`ShortTermMemory(max_size=10)`

`AgentMemory(short_term_size=10)`

Packages