Skip to content

Research exchange: 5 cognitive memory papers + OpenClaw fork implementation #1129

@globalcaos

Description

@globalcaos

Hey MemOS team 👋

We noticed your arXiv paper (2507.03724) and the impressive benchmarks (+43.7% vs OpenAI Memory, 35% token savings). We've been pursuing similar research from a different angle and thought a comparison could benefit both projects.

Our Work

We maintain an OpenClaw fork with a cognitive memory architecture (~150 files, 7 modules). We've written 5 academic papers, each covering a different aspect of agent memory:

Paper Module Key Contribution
ENGRAM Context compaction Treats context limits as cache eviction (LRU/LFU-inspired), not summarization. Sleep consolidation pipeline.
CORTEX Agent identity EWMA SyncScore measures persona drift mid-conversation. Auto re-injection when drift > threshold.
HIPPOCAMPUS Retrieval indexing Pre-computed concept index (500 anchors → 9500 chunks). O(1) lookup vs runtime vector search.
LIMBIC Humor detection Bisociation detection in embedding space (Koestler's theory formalized).
SYNAPSE Multi-model reasoning RAAC debate protocol with cognitive diversity scoring.

Overlap with MemOS

Your memory type taxonomy (working, episodic, semantic, procedural, strategic) is nearly identical to our ENGRAM's — we'd love to compare the implementation approaches:

  • Your MemOS Cloud API vs our SQLite-native zero-infra approach — different trade-offs worth benchmarking
  • Your token savings claims vs our HIPPOCAMPUS pre-computed indexing — both reduce inference-time cost, different mechanisms
  • Skill memory (your unique contribution) vs our procedural memory — how do they compare on cross-task reuse?

What We're Exploring Now

Context Anatomy — per-turn visual decomposition of the prompt: system prompt vs. history vs. tools vs. retrieval. A debugger for what the model actually sees.

Curiosity-Driven Exploration — agent autonomously identifies knowledge gaps and generates questions. Proactive memory, not just reactive.

Proposal

  1. Benchmark comparison on LoCoMo / LongMemEval using both systems
  2. Architecture deep-dive — we share our papers, you share insights from your arXiv work
  3. Joint exploration of pre-computed indexing + your token optimization

Fork: globalcaos/tinkerclaw

Happy to share full PDFs of all 5 papers. 🤝

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or improvement | 新功能或改进memosCore MemOS logic (memory, MCP, scheduler, API, database) | 核心模块

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions