Alaya

A memory engine for AI agents that remembers, forgets, and learns.

Alaya (Sanskrit: alaya-vijnana, "storehouse consciousness") is an embeddable Rust library. One SQLite file. No external services. Your agent stores conversations, retrieves what matters, and lets the rest fade. The graph reshapes through use, like biological memory.

let store = AlayaStore::open("memory.db")?;
store.store_episode(&episode)?;           // store
let results = store.query(&query)?;       // retrieve
store.consolidate(&provider)?;            // distill knowledge
store.forget()?;                          // decay what's stale

The Problem

Most AI agents treat memory as flat files. OpenClaw writes to MEMORY.md. Claudesidian writes to Obsidian. Hand-rolled systems write to JSON or Markdown. It works at first.

Then the files grow. Context windows fill. The agent dumps everything into the prompt and hopes the LLM finds what matters.

The cost is measurable. OpenClaw injects ~35,600 tokens of workspace files into every message, 93.5% of which is irrelevant (#9157). Heavy users report $3,600/month in token costs. Community tools like QMD and memsearch cut 70-96% of that waste by replacing full-context injection with ranked retrieval (Levine, 2026).

The structure problem compounds the cost. MEMORY.md conflates decisions, preferences, and knowledge into one unstructured blob. Users independently invent decision.md files, working-context.md snapshots, and 12-layer memory architectures to compensate. Monday you mention "Alice manages the auth team." Wednesday you ask "who handles auth permissions?" The agent retrieves both memories by text similarity but cannot connect them (Chawla, 2026).

How Alaya Solves It

Problem	File-based memory	Alaya
Token waste	Full-context injection (~35K tokens/message)	Ranked retrieval returns only top-k relevant memories
No structure	Everything in one file (users invent `decision.md` workarounds)	Three typed stores: episodes, knowledge, preferences
No forgetting	Files grow until you manually curate	Bjork dual-strength decay: weak memories fade, strong ones persist
No associations	Flat files, no links between memories	Hebbian graph strengthens through co-retrieval; spreading activation finds indirect connections
Brittle preferences	Agent-authored summary, easily drifts	Preferences emerge from accumulated impressions, crystallize at threshold
LLM required	Can't function without one	Optional. No embeddings? BM25-only. No LLM? Episodes accumulate. Every feature works independently

Getting Started

MCP Server (recommended for agents)

The fastest way to add Alaya memory to any MCP-compatible agent (Claude Desktop, OpenClaw, Cline, etc.):

# Build the MCP server
git clone https://github.com/SecurityRonin/alaya.git
cd alaya
cargo build --release --features mcp

Add to your agent's MCP config (e.g. claude_desktop_config.json):

{
  "mcpServers": {
    "alaya": {
      "command": "/path/to/alaya/target/release/alaya-mcp"
    }
  }
}

That's it. Your agent now has 7 memory tools:

Tool	What it does
`remember`	Store a conversation message
`recall`	Search memory with hybrid retrieval
`status`	Get memory statistics
`preferences`	Get learned user preferences
`knowledge`	Get distilled semantic facts
`maintain`	Run memory cleanup (dedup, decay)
`purge`	Delete memories by session, age, or all

Data is stored in ~/.alaya/memory.db (override with ALAYA_DB env var). Single SQLite file, no external services.

Example interaction — what your agent sees when using Alaya:

Agent: [calls remember(content="User prefers dark mode", role="user", session_id="s1")]
Alaya: Stored episode 1 in session 's1'

Agent: [calls recall(query="user preferences")]
Alaya: Found 1 memories:
  1. [user] (score: 0.847) User prefers dark mode

Agent: [calls status()]
Alaya: Memory Status:
  Episodes: 1
  Semantic nodes: 0
  Preferences: 0

Environment variables:

Variable	Default	Description
`ALAYA_DB`	`~/.alaya/memory.db`	Path to SQLite database

Rust Library

For embedding Alaya directly into a Rust application:

[dependencies]
alaya = { git = "https://github.com/SecurityRonin/alaya" }

Quick Start (Rust)

use alaya::{AlayaStore, NewEpisode, Role, EpisodeContext, Query, NoOpProvider};

// Open a persistent database (or use open_in_memory() for tests)
let store = AlayaStore::open("memory.db")?;

// Store a conversation episode
store.store_episode(&NewEpisode {
    content: "I've been learning Rust for about six months now".into(),
    role: Role::User,
    session_id: "session-1".into(),
    timestamp: 1740000000,
    context: EpisodeContext::default(),
    embedding: None, // pass Some(vec![...]) if you have embeddings
})?;

// Query with hybrid retrieval (BM25 + vector + graph + RRF)
let results = store.query(&Query::simple("Rust experience"))?;
for mem in &results {
    println!("[{:.2}] {}", mem.score, mem.content);
}

// Get crystallized preferences
let prefs = store.preferences(Some("communication_style"))?;

// Run lifecycle (NoOpProvider works without an LLM)
store.consolidate(&NoOpProvider)?;
store.transform()?;
store.forget()?;

Run the Demo

The demo walks through all six capabilities with annotated output and no external dependencies:

git clone https://github.com/SecurityRonin/alaya.git
cd alaya
cargo run --example demo

Architecture

Alaya is a library, not a framework. Your agent owns the conversation loop, the LLM, and the embedding model. Alaya owns memory.

Your Agent                          Alaya
─────────                           ─────

Via MCP (stdio):                    alaya-mcp binary
  remember(content, role, session)    ──▶ episodic store + graph links
  recall(query)                       ──▶ BM25 + vector + graph → RRF → rerank
  preferences(domain?)                ──▶ crystallized behavioral patterns
  knowledge(type?, confidence?)       ──▶ consolidated semantic nodes
  maintain()                          ──▶ dedup + decay
  purge(scope)                        ──▶ selective or full deletion

Via Rust library:                   AlayaStore struct
  store_episode()                     ──▶ episodic store + graph links
  query()                            ──▶ BM25 + vector + graph → RRF → rerank
  preferences()                      ──▶ crystallized behavioral patterns
  knowledge()                        ──▶ consolidated semantic nodes
  consolidate(provider)              ──▶ episodes → semantic knowledge
  perfume(interaction, provider)     ──▶ impressions → preferences
  transform()                        ──▶ dedup, prune, decay
  forget()                           ──▶ Bjork strength decay + archival

Three Stores

Store	Analog	Purpose
Episodic	Hippocampus	Raw conversation events with full context
Semantic	Neocortex	Distilled knowledge extracted through consolidation
Implicit	Alaya-vijnana	Preferences and habits that emerge through perfuming

Retrieval Pipeline

flowchart LR
    Q[Query] --> BM25[BM25 / FTS5]
    Q --> VEC[Vector / Cosine]
    Q --> GR[Graph Neighbors]

    BM25 --> RRF[Reciprocal Rank Fusion]
    VEC --> RRF
    GR --> RRF

    RRF --> RR[Context-Weighted Reranking]
    RR --> SA[Spreading Activation]
    SA --> RIF[Retrieval-Induced Forgetting]
    RIF --> OUT[Top 3-5 Results]

Lifecycle Processes

Process	Inspiration	What it does
Consolidation	CLS theory (McClelland et al.)	Distills episodes into semantic knowledge
Perfuming	Vasana (Yogacara Buddhist psychology)	Accumulates impressions, crystallizes preferences
Transformation	Asraya-paravrtti	Deduplicates, resolves contradictions, prunes
Forgetting	Bjork & Bjork (1992)	Decays retrieval strength, archives weak nodes

Integration Guide

Implementing ConsolidationProvider

The ConsolidationProvider trait connects Alaya to your LLM for knowledge extraction:

use alaya::*;

struct MyProvider { /* your LLM client */ }

impl ConsolidationProvider for MyProvider {
    fn extract_knowledge(&self, episodes: &[Episode]) -> Result<Vec<NewSemanticNode>> {
        // Ask your LLM: "What facts/relationships can you extract?"
        todo!()
    }

    fn extract_impressions(&self, interaction: &Interaction) -> Result<Vec<NewImpression>> {
        // Ask your LLM: "What behavioral signals does this contain?"
        todo!()
    }

    fn detect_contradiction(&self, a: &SemanticNode, b: &SemanticNode) -> Result<bool> {
        // Ask your LLM: "Do these two facts contradict each other?"
        todo!()
    }
}

Use NoOpProvider without an LLM. Episodes accumulate and BM25 retrieval works without consolidation.

Lifecycle Scheduling

Method	When to call	What it does
`consolidate()`	After accumulating 10+ episodes	Extracts semantic knowledge from episodes
`perfume()`	On every user interaction	Extracts behavioral impressions, crystallizes preferences
`transform()`	Daily or weekly	Deduplicates, prunes weak links, decays stale preferences
`forget()`	Daily or weekly	Decays retrieval strength, archives truly forgotten nodes

API Reference

impl AlayaStore {
    // Open / create
    pub fn open(path: impl AsRef<Path>) -> Result<Self>;
    pub fn open_in_memory() -> Result<Self>;

    // Write
    pub fn store_episode(&self, episode: &NewEpisode) -> Result<EpisodeId>;

    // Read
    pub fn query(&self, q: &Query) -> Result<Vec<ScoredMemory>>;
    pub fn preferences(&self, domain: Option<&str>) -> Result<Vec<Preference>>;
    pub fn knowledge(&self, filter: Option<KnowledgeFilter>) -> Result<Vec<SemanticNode>>;
    pub fn neighbors(&self, node: NodeRef, depth: u32) -> Result<Vec<(NodeRef, f32)>>;

    // Lifecycle
    pub fn consolidate(&self, provider: &dyn ConsolidationProvider) -> Result<ConsolidationReport>;
    pub fn perfume(&self, interaction: &Interaction, provider: &dyn ConsolidationProvider) -> Result<PerfumingReport>;
    pub fn transform(&self) -> Result<TransformationReport>;
    pub fn forget(&self) -> Result<ForgettingReport>;

    // Admin
    pub fn status(&self) -> Result<MemoryStatus>;
    pub fn purge(&self, filter: PurgeFilter) -> Result<PurgeReport>;
}

Design Principles

Memory is a process, not a database. Every retrieval changes what is remembered. The graph reshapes through use.
Forgetting is a feature. Strategic decay and suppression improve retrieval quality over time.
Preferences emerge, they are not declared. Behavioral patterns crystallize from accumulated observations.
The agent owns identity. Alaya stores seeds. The agent decides which seeds matter and how to present them.
Graceful degradation. No embeddings? BM25-only. No LLM? Episodes accumulate. Every feature works independently.

Research Foundations

Architecture grounded in neuroscience, Buddhist psychology, and information retrieval. For detailed mappings, see docs/theoretical-foundations.md.

Neuroscience: Hebbian LTP/LTD (Hebb 1949, Bliss & Lomo 1973), Complementary Learning Systems (McClelland et al. 1995), spreading activation (Collins & Loftus 1975), encoding specificity (Tulving & Thomson 1973), dual-strength forgetting (Bjork & Bjork 1992), retrieval-induced forgetting (Anderson et al. 1994), working memory limits (Cowan 2001).

Yogacara Buddhist Psychology: Alaya-vijnana (storehouse consciousness), bija (seeds), vasana (perfuming), asraya-paravrtti (transformation), vijnaptimatrata (perspective-relative memory).

Information Retrieval: Reciprocal Rank Fusion (Cormack et al. 2009), BM25 via FTS5, cosine similarity vector search.

Comparison with Alternatives

graph LR
    AGENT["AI Agent"]

    subgraph SIMPLE["Simple"]
        FILE["File-Based<br/><i>MEMORY.md<br/>OpenClaw</i>"]
    end

    subgraph INTEGRATED["Integrated"]
        FW["Framework Memory<br/><i>LangChain · CrewAI<br/>Letta</i>"]
        CODE["Coding Agent<br/><i>Beads · Engram<br/>via MCP</i>"]
    end

    subgraph ENGINES["Memory Engines"]
        DED["Dedicated Systems<br/><i><b>Alaya</b> · Vestige<br/>mem0 · Zep</i>"]
    end

    subgraph INFRA["Infrastructure"]
        VDB["Vector DBs<br/><i>Pinecone · Chroma<br/>Weaviate</i>"]
    end

    RESEARCH["Research<br/><i>Generative Agents<br/>SYNAPSE · HippoRAG</i>"]

    AGENT <--> FILE
    AGENT <--> FW
    AGENT <--> CODE
    AGENT <--> DED
    DED -.->|storage| VDB
    FW -.->|storage| VDB
    RESEARCH -.->|ideas| DED
    RESEARCH -.->|ideas| FW

Alaya is a dedicated memory engine with lifecycle management, hybrid retrieval, and graph dynamics. Closest peers: Vestige (Rust, FSRS-6, spreading activation) and SYNAPSE (unified episodic-semantic graph, lateral inhibition).

Why Alaya over...

Alternative	What it does well	What Alaya adds
MEMORY.md	Zero setup	Ranked retrieval (not full-context injection), typed stores, automatic decay
mem0	Managed cloud memory with auto-extraction	Local-only (single SQLite file), no API keys, Hebbian graph dynamics
Zep	Production-ready with cloud/self-hosted options	No external services, association graph, preference crystallization
Vestige	Rust, FSRS-6 spaced repetition	Three-store architecture, Hebbian co-retrieval, spreading activation
LangChain Memory	Framework-integrated, many backends	Framework-agnostic, lifecycle management, works without an LLM

Full comparison: 90+ systems, grounded in the CoALA taxonomy (Sumers et al., 2024)
Interactive landscape (D3.js force-directed graph)
Theoretical foundations (neuroscience and Buddhist psychology)
The MEMORY.md problem (community workarounds and how Alaya addresses each)

Development

# Run all library tests
cargo test

# Run MCP integration tests
cargo test --features mcp

# Build the MCP server
cargo build --release --features mcp

# Run the demo (no external dependencies)
cargo run --example demo

Support

If Alaya is useful to you, consider supporting development:

Star the repo if you find it useful — it helps others discover Alaya.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 80 Commits
.github/workflows		.github/workflows
bench		bench
docs		docs
examples		examples
north-star-advisor		north-star-advisor
paper		paper
src		src
tests		tests
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
Cargo.toml		Cargo.toml
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
smithery.yaml		smithery.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Alaya

The Problem

How Alaya Solves It

Getting Started

MCP Server (recommended for agents)

Rust Library

Quick Start (Rust)

Run the Demo

Architecture

Three Stores

Retrieval Pipeline

Lifecycle Processes

Integration Guide

Implementing ConsolidationProvider

Lifecycle Scheduling

API Reference

Design Principles

Research Foundations

Comparison with Alternatives

Why Alaya over...

Development

Support

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

License

SecurityRonin/alaya

Folders and files

Latest commit

History

Repository files navigation

Alaya

The Problem

How Alaya Solves It

Getting Started

MCP Server (recommended for agents)

Rust Library

Quick Start (Rust)

Run the Demo

Architecture

Three Stores

Retrieval Pipeline

Lifecycle Processes

Integration Guide

Implementing ConsolidationProvider

Lifecycle Scheduling

API Reference

Design Principles

Research Foundations

Comparison with Alternatives

Why Alaya over...

Development

Support

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages