design: hippocampal memory system for infinite-context and cross-session recall #3234

cy2311 · 2026-06-09T05:36:32Z

cy2311
Jun 9, 2026

Problem

CodeWhale currently has a 1M-token context window, but beyond that there is no real memory system. The current mechanisms are:

/compact: manual compression of early turns into a natural-language summary
note tool: agents can persist key-value facts
Session persistence (SQLite): stores raw transcripts on disk
Ctrl+R session picker: manually switch between sessions

These are not a memory system. They are a flat storage — no indexing, no cross-session retrieval, no consolidation, no active forgetting. The user starts a new session and the AI remembers nothing unless explicitly told.

A True Hippocampal Memory System for AI

Biological hippocampal memory does four things that current AI context management does not:

1. Binding / Indexing

When an AI performs related actions (edits dispatch.rs, adds started_at to subagent error, opens PR #2933), these facts should be cross-indexed as a graph, not stored as independent text fragments:

edit dispatch.rs ── partOf ── PR #2933 ── fixes ── issue #2657
                   │                       │
                   │                       └─ alsoContains ── yolo.md edit
                   │
                   └─ reason ── format_tool_error generic suffixes mislead the agent

This enables pattern completion: given the fragment "tool error message issue", the system reconstructs the full graph — format_tool_error → dispatch.rs → PR #2933 → linked yolo.md and subagent changes.

2. Pattern Completion

A true memory system doesn't do literal full-text search. It takes a partial cue and reconstructs the full context:

Cue: "那个工具错误消息的问题..."
Completion: format_tool_error → dispatch.rs → the generic suffix removal fix → plus related changes in the same PR

This is fundamentally different from keyword search or vector similarity. It requires a structured index that models relationships, not just proximity.

3. Consolidation (Offline Processing)

The hippocampus replays experiences during idle periods (sleep) and transfers important patterns to the cortex. For AI:

During idle time (between user messages, or a background task), scan recent conversation turns
Extract structured decisions: what files were changed, what architecture decisions were made, what approaches were tried and discarded
Discard ephemeral noise: specific error messages that were resolved, intermediate debug output
Commit the extracted structure to long-term storage

4. Active Forgetting

Not "ran out of space." The system actively judges what is worth keeping:

Yesterday's lunch → not important → discarded
"The stove is hot, don't touch it" → important → consolidated
A specific compiler error that was fixed → transitional → discarded after fix is applied
The architecture decision that led to the fix approach → valuable → kept

This judgment should be model-driven (the AI decides what matters), not rule-based.

Proposed Architecture

┌─────────────────────────────────────────┐
│           Working Memory                 │
│      (Current 1M context window)         │
│  Active conversation + loaded memories   │
└────────────────┬────────────────────────┘
                 ↕ real-time binding
┌─────────────────────────────────────────┐
│      Hippocampal Index Layer            │
│  Entity graph (files, issues, PRs,      │
│  decisions, relationships)              │
│  Episodic records (timeline of events)  │
│  Pattern completion engine              │
└────────────────┬────────────────────────┘
                 ↕ consolidation (idle time)
┌─────────────────────────────────────────┐
│      Cortex (Long-term Storage)         │
│  Semantic knowledge (extracted rules,   │
│  user preferences, project conventions) │
│  Independent of raw transcripts         │
└─────────────────────────────────────────┘

Open Questions for Discussion

Storage: Should the index use SQLite with structured relations, or is a graph database needed?
Model-driven decisions: How much context budget should be allocated for the AI to decide what to consolidate/forget?
Trigger: Should consolidation run on a timer, at context pressure thresholds, or explicitly via a tool call?
Pattern completion granularity: When a user references "the tool issue from earlier," how much context should the system retrieve and inject?
Forgetting policy: Who decides what is ephemeral — the AI, the user, a config threshold, or a combination?
Cross-session retrieval: When the user starts a new session, what (if anything) should be pre-loaded into the working memory?
Relationship to existing code: The note tool, session persistence, and compaction infrastructure already exist. How should a hippocampal system build on or replace these?

Desired Outcome

A design discussion. Not an implementation ticket. This should produce a documented architecture that the community can review, critique, and eventually implement in slices.

@cy2311 · 2026-06-09T05:36:41Z

github-actions[bot]
Bot Jun 9, 2026

Thanks @cy2311 for the report.

This issue is staying open for maintainer triage. CodeWhale gets better because people bring us real edge cases from real machines, providers, regions, and workflows.

If you can add a reproduction, logs, version output, screenshots, or the provider/model involved, that makes it much easier for us to verify and harvest the fix. Maintainers may comment /lgtmi to mark recurring issue reporters as approved so this intake note is skipped next time.

0 replies

cy2311 · 2026-06-09T05:45:40Z

cy2311
Jun 9, 2026
Author

First implementation slice submitted in PR #2933 — adds:

New crates/memory/ crate with SQLite-backed entity graph + FTS5 fact search
memorize tool for agent to store structured facts with importance scoring
recall tool for full-text + graph-based query
Entity binding between facts and files/issues/PRs/decisions

Initial implementation focuses on explicit agent-driven storage (agent calls memorize / recall). Automatic scanning + consolidation + active forgetting are follow-up work.

0 replies

Hmbown · 2026-06-15T06:07:08Z

Hmbown
Jun 15, 2026
Maintainer

let me think thru this deeply - i've stayed away from a memory system yet because when I do it, I want to really think deeply about it. if you're able to open a PR as well and show me a bit of what you're imagining or even reference another open source project or how another tool performs, that can help me understand what you're thinking

0 replies

Hmbown · 2026-06-15T06:07:49Z

Hmbown
Jun 15, 2026
Maintainer

I'd also reocmmend joining the wechat group or telegram group or posting on the discussion board :)

0 replies

cy2311 · 2026-06-16T06:17:27Z

cy2311
Jun 16, 2026
Author

oh， thanx man, how can I join the wechat grop

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

design: hippocampal memory system for infinite-context and cross-session recall #3234

Uh oh!

{{title}}

Uh oh!

Replies: 5 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

design: hippocampal memory system for infinite-context and cross-session recall #3234

Uh oh!

cy2311 Jun 9, 2026

Problem

A True Hippocampal Memory System for AI

1. Binding / Indexing

2. Pattern Completion

3. Consolidation (Offline Processing)

4. Active Forgetting

Proposed Architecture

Open Questions for Discussion

Desired Outcome

Replies: 5 comments

Uh oh!

github-actions[bot] Bot Jun 9, 2026

Uh oh!

cy2311 Jun 9, 2026 Author

Uh oh!

Hmbown Jun 15, 2026 Maintainer

Uh oh!

Hmbown Jun 15, 2026 Maintainer

Uh oh!

cy2311 Jun 16, 2026 Author

cy2311
Jun 9, 2026

github-actions[bot]
Bot Jun 9, 2026

cy2311
Jun 9, 2026
Author

Hmbown
Jun 15, 2026
Maintainer

Hmbown
Jun 15, 2026
Maintainer

cy2311
Jun 16, 2026
Author