CraniMem: Cranial-Inspired Gated and Bounded Memory for Agentic Systems

This repository contains the implementation of CraniMem, a neurocognitively inspired memory framework for long-horizon LLM agents. CraniMem enables stable, goal-aware memory across extended interactions by combining selective encoding, bounded episodic storage, structured long-term knowledge graphs, and replay-based consolidation.

Overview

Modern LLM agents require persistent memory beyond a single context window. Conventional approaches often treat memory as an external database with heuristic read and write rules, which can lead to unstable retention, noise accumulation, weak consolidation, and poor long-term consistency.

CraniMem models memory as a gated, multi-stage cognitive process inspired by biological systems. Incoming information is filtered for relevance, stored in a bounded episodic buffer for short-term continuity, selectively consolidated into a structured knowledge graph for durable recall, and retrieved through a dual-path mechanism that combines short-term and long-term context.

Key Features

Goal-conditioned input gating to filter irrelevant information
Utility tagging based on importance, surprise, and emotional salience
Bounded episodic buffer for recent interaction traces
Structured long-term semantic memory using a knowledge graph
Replay-based consolidation with pruning of low-utility items
Dual-path retrieval from episodic and semantic stores
Robustness to distractor noise in long interactions

Repository Structure

CraniMem/
├─ pyproject.toml          # Package metadata and build configuration
├─ README.md               # Documentation
├─ example.env             # Environment variable template
├─ cranimem/
│  ├─ __init__.py
│  ├─ config.py            # Settings and environment configuration
│  ├─ agent/
│  │  ├─ workflow.py       # LangGraph workflow assembly
│  │  ├─ nodes.py          # Core processing nodes (gating, retrieval, reasoning, memory, consolidation)
│  │  └─ state.py          # Agent state schema and episodic buffer logic
│  ├─ core/
│  │  ├─ embedding.py      # Embedding model loader
│  │  └─ graph_store.py    # Neo4j-backed semantic memory store
│  ├─ cognitive/
│  │  ├─ prompts.py        # Prompt templates
│  │  ├─ gating.py         # Two-stage gating mechanism
│  │  └─ memory_ops.py     # Memory maintenance utilities
│  └─ utils/
│     ├─ llm_factory.py    # LLM provider factory and caching
│     └─ json_utils.py     # JSON parsing utilities
├─ cranimem.egg-info/      # Build metadata
└─ dist/                   # Built distribution artifacts

Installation

Install from PyPI:

pip install cranimem

Or install locally for development:

pip install -e .

Configuration

Create a .env file based on example.env.

Example configuration:

LLM_PROVIDER=openai
OPENAI_API_KEY=your_api_key

NEO4J_URI=bolt://localhost:7687
NEO4J_USER=neo4j
NEO4J_PASSWORD=password

CraniMem uses environment variables for:

LLM provider configuration
Embedding model settings
Neo4j connection parameters
Memory thresholds and limits

Quick Start

from cranimem.agent.workflow import build_agent

agent = build_agent()

response = agent.run("Plan a trip to Japan next month.")
print(response)

CraniMem will automatically use configured providers and storage backends.

Architecture (High Level)

CraniMem implements a gated, bounded, multi-stage memory pipeline:

Collection Phase Incoming inputs are filtered by relevance and assigned utility scores.
Short-Term Storage Accepted items are stored in a bounded episodic buffer.
Consolidation Phase High-utility traces are replayed into long-term memory while low-utility items are discarded.
Long-Term Storage Durable information is stored in a structured knowledge graph.
Retrieval Phase Context is assembled from both episodic and semantic memory to condition the LLM.

Core Components

Episodic Buffer Stores recent interaction traces with full fidelity for short-term continuity. The buffer is bounded to prevent unbounded growth.

Knowledge Graph Store Maintains durable semantic information as structured entities and relationships, enabling multi-hop retrieval.

Gating Module Filters inputs based on relevance to the current goal to prevent noise from entering memory.

Consolidation Engine Periodically transfers high-utility experiences from the episodic buffer to long-term storage and prunes low-value items.

Dual-Path Retrieval Combines short-range context from the episodic buffer with long-range knowledge from the graph.

Use Cases

CraniMem is suitable for applications requiring persistent, coherent memory:

Long-running conversational agents
Research and analysis assistants
Planning and workflow agents
Multi-session copilots
Personalized systems
Multi-agent coordination

Limitations

Higher latency than simple retrieval systems
Additional infrastructure requirements (e.g., graph database)
Performance depends on gating and utility scoring quality
Increased computational cost during consolidation

Citation

If you find this work useful in your research or projects, please consider citing:

@article{mody2026cranimem,
  title={CraniMem: Cranial Inspired Gated and Bounded Memory for Agentic Systems},
  author={Mody, Pearl and Panchal, Mihir and Kar, Rishit and Bhowmick, Kiran and Karani, Ruhina},
  journal={arXiv preprint arXiv:2603.15642},
  year={2026}
}

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
cranimem		cranimem
eval		eval
.DS_Store		.DS_Store
.gitignore		.gitignore
MANIFEST.in		MANIFEST.in
README.md		README.md
example.env		example.env
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CraniMem: Cranial-Inspired Gated and Bounded Memory for Agentic Systems

Overview

Key Features

Repository Structure

Installation

Configuration

Quick Start

Architecture (High Level)

Core Components

Use Cases

Limitations

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CraniMem: Cranial-Inspired Gated and Bounded Memory for Agentic Systems

Overview

Key Features

Repository Structure

Installation

Configuration

Quick Start

Architecture (High Level)

Core Components

Use Cases

Limitations

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages