Add improvements #57 and #58 from Mercury article on agent memory by codenamev · Pull Request #5 · codenamev/claude_memory

codenamev · 2026-04-28T23:46:51Z

Surfaces two ranking-signal gaps from Zaid's 2026-04-28 article "Why
Karpathy's Second Brain Breaks at Agent Scale":

#57: Provenance strength (stated vs inferred) is recorded but never
factors into RRF ranking. Cheap win — data already exists.
#58: last_recalled_at only flags stale facts; reinforcement frequency
and age decay don't influence retrieval order.

Both medium priority; sit behind the 1.0 punchlist but ahead of post-1.0
nudge/drift work since they directly move retrieval-quality benchmarks.

Surfaces two ranking-signal gaps from Zaid's 2026-04-28 article "Why Karpathy's Second Brain Breaks at Agent Scale": - #57: Provenance strength (stated vs inferred) is recorded but never factors into RRF ranking. Cheap win — data already exists. - #58: last_recalled_at only flags stale facts; reinforcement frequency and age decay don't influence retrieval order. Both medium priority; sit behind the 1.0 punchlist but ahead of post-1.0 nudge/drift work since they directly move retrieval-quality benchmarks.

`claude-memory show` prints what memory would inject at the next SessionStart. The trust answer to "is this still worth it?" requires inspectability — a user who can't see what memory will inject can't develop confidence in it. The CLAUDE.md alternative is `cat CLAUDE.md` (instant, plain English, no tooling); this is the same one-line inspect surface for the curated facts the injector picks each session. Runs the exact `Hook::ContextInjector` path real sessions use, so what shows here is what Claude actually receives — not a rebuilt approximation that could drift. Default suppresses the "Pending Knowledge Extraction" raw-transcript dump (intended for LLM distillation, not human reading); pass --pending to see the full fresh-session payload. --source SOURCE simulates startup/resume/clear so users can preview each entrypoint's payload. Footer reports fact count, ~token estimate (via Core::TokenEstimator), and char count so users instantly see the SessionStart cost — pairs with the #47 token-budget telemetry surfaces. Addresses: docs/1_0_punchlist.md (0.11.0 #5) / improvements.md #51

CHANGELOG gains an Unreleased entry summarizing the new show command and its --pending / --source flags. 1.0 punchlist item 0.11.0 #5 stamped landed 2026-04-29 with commit hash so milestone progress is trackable. Addresses: docs/1_0_punchlist.md (0.11.0 #5) / improvements.md #51

codenamev merged commit a77cdbc into main Apr 28, 2026
1 check passed

codenamev deleted the claude/research-article-insights-MHkA3 branch April 28, 2026 23:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add improvements #57 and #58 from Mercury article on agent memory#5

Add improvements #57 and #58 from Mercury article on agent memory#5
codenamev merged 1 commit intomainfrom
claude/research-article-insights-MHkA3

codenamev commented Apr 28, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

codenamev commented Apr 28, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants