Research exchange: 5 cognitive memory papers + OpenClaw fork implementation

Hey MemOS team 👋

We noticed your arXiv paper (2507.03724) and the impressive benchmarks (+43.7% vs OpenAI Memory, 35% token savings). We've been pursuing similar research from a different angle and thought a comparison could benefit both projects.

## Our Work

We maintain an [OpenClaw fork](https://github.com/globalcaos/tinkerclaw) with a cognitive memory architecture (~150 files, 7 modules). We've written 5 academic papers, each covering a different aspect of agent memory:

| Paper | Module | Key Contribution |
|-------|--------|-----------------|
| **ENGRAM** | Context compaction | Treats context limits as cache eviction (LRU/LFU-inspired), not summarization. Sleep consolidation pipeline. |
| **CORTEX** | Agent identity | EWMA SyncScore measures persona drift mid-conversation. Auto re-injection when drift > threshold. |
| **HIPPOCAMPUS** | Retrieval indexing | Pre-computed concept index (500 anchors → 9500 chunks). O(1) lookup vs runtime vector search. |
| **LIMBIC** | Humor detection | Bisociation detection in embedding space (Koestler's theory formalized). |
| **SYNAPSE** | Multi-model reasoning | RAAC debate protocol with cognitive diversity scoring. |

## Overlap with MemOS

Your memory type taxonomy (working, episodic, semantic, procedural, strategic) is nearly identical to our ENGRAM's — we'd love to compare the implementation approaches:

- **Your MemOS Cloud API** vs **our SQLite-native zero-infra approach** — different trade-offs worth benchmarking
- **Your token savings claims** vs **our HIPPOCAMPUS pre-computed indexing** — both reduce inference-time cost, different mechanisms
- **Skill memory (your unique contribution)** vs **our procedural memory** — how do they compare on cross-task reuse?

## What We're Exploring Now

**Context Anatomy** — per-turn visual decomposition of the prompt: system prompt vs. history vs. tools vs. retrieval. A debugger for what the model actually sees.

**Curiosity-Driven Exploration** — agent autonomously identifies knowledge gaps and generates questions. Proactive memory, not just reactive.

## Proposal

1. **Benchmark comparison** on LoCoMo / LongMemEval using both systems
2. **Architecture deep-dive** — we share our papers, you share insights from your arXiv work
3. **Joint exploration** of pre-computed indexing + your token optimization

Fork: [globalcaos/tinkerclaw](https://github.com/globalcaos/tinkerclaw)

Happy to share full PDFs of all 5 papers. 🤝

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Research exchange: 5 cognitive memory papers + OpenClaw fork implementation #1129

Our Work

Overlap with MemOS

What We're Exploring Now

Proposal

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Paper	Module	Key Contribution
ENGRAM	Context compaction	Treats context limits as cache eviction (LRU/LFU-inspired), not summarization. Sleep consolidation pipeline.
CORTEX	Agent identity	EWMA SyncScore measures persona drift mid-conversation. Auto re-injection when drift > threshold.
HIPPOCAMPUS	Retrieval indexing	Pre-computed concept index (500 anchors → 9500 chunks). O(1) lookup vs runtime vector search.
LIMBIC	Humor detection	Bisociation detection in embedding space (Koestler's theory formalized).
SYNAPSE	Multi-model reasoning	RAAC debate protocol with cognitive diversity scoring.

Research exchange: 5 cognitive memory papers + OpenClaw fork implementation #1129

Description

Our Work

Overlap with MemOS

What We're Exploring Now

Proposal

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions