Source
arXiv:2601.07190 — "Active Context Compression: Autonomous Memory Management in LLM Agents"
Key Findings
- "Focus" architecture: agents autonomously consolidate interaction history into a persistent Knowledge block, pruning less relevant context
- ~23% token reduction on SWE-bench with no task performance loss
- Biologically inspired exploration/exploitation balance for deciding what to compress vs retain
- Unlike full compaction, only less-relevant segments are compressed — selective and cheaper
Gap in Zeph
Zeph currently uses compress_context in always_on mode — full compaction applied to the entire context window. The Focus approach is more selective: it identifies low-relevance segments and consolidates only those into the Knowledge block, leaving high-relevance recent context intact.
Existing compression issues (#2481 density-aware, #2460 performance-floor) focus on ratio selection, not on selective segment targeting. Focus is architecturally distinct.
Implementation Sketch for Zeph
- In
zeph-memory, add a KnowledgeBlock segment type alongside the existing compaction pipeline
- Add a relevance scorer (cheap model) to identify segments below a threshold
- Route low-relevance segments to consolidation into the Knowledge block instead of full compaction
- Expose
[memory.compression] strategy = "focus" | "always_on" config option
- Surface compression activity in TUI with a spinner ("Consolidating context…")
Agents with equivalent feature
- SWE-bench top performers use selective context management (Cursor shadow workspace, Aider repo-map pruning)
References
Source
arXiv:2601.07190 — "Active Context Compression: Autonomous Memory Management in LLM Agents"
Key Findings
Gap in Zeph
Zeph currently uses
compress_contextinalways_onmode — full compaction applied to the entire context window. The Focus approach is more selective: it identifies low-relevance segments and consolidates only those into the Knowledge block, leaving high-relevance recent context intact.Existing compression issues (#2481 density-aware, #2460 performance-floor) focus on ratio selection, not on selective segment targeting. Focus is architecturally distinct.
Implementation Sketch for Zeph
zeph-memory, add aKnowledgeBlocksegment type alongside the existing compaction pipeline[memory.compression] strategy = "focus" | "always_on"config optionAgents with equivalent feature
References