Skip to content

Add improvements #57 and #58 from Mercury article on agent memory#5

Merged
codenamev merged 1 commit intomainfrom
claude/research-article-insights-MHkA3
Apr 28, 2026
Merged

Add improvements #57 and #58 from Mercury article on agent memory#5
codenamev merged 1 commit intomainfrom
claude/research-article-insights-MHkA3

Conversation

@codenamev
Copy link
Copy Markdown
Owner

Surfaces two ranking-signal gaps from Zaid's 2026-04-28 article "Why
Karpathy's Second Brain Breaks at Agent Scale":

  • #57: Provenance strength (stated vs inferred) is recorded but never
    factors into RRF ranking. Cheap win — data already exists.
  • #58: last_recalled_at only flags stale facts; reinforcement frequency
    and age decay don't influence retrieval order.

Both medium priority; sit behind the 1.0 punchlist but ahead of post-1.0
nudge/drift work since they directly move retrieval-quality benchmarks.

Surfaces two ranking-signal gaps from Zaid's 2026-04-28 article "Why
Karpathy's Second Brain Breaks at Agent Scale":

- #57: Provenance strength (stated vs inferred) is recorded but never
  factors into RRF ranking. Cheap win — data already exists.
- #58: last_recalled_at only flags stale facts; reinforcement frequency
  and age decay don't influence retrieval order.

Both medium priority; sit behind the 1.0 punchlist but ahead of post-1.0
nudge/drift work since they directly move retrieval-quality benchmarks.
@codenamev codenamev merged commit a77cdbc into main Apr 28, 2026
1 check passed
@codenamev codenamev deleted the claude/research-article-insights-MHkA3 branch April 28, 2026 23:47
codenamev added a commit that referenced this pull request Apr 30, 2026
`claude-memory show` prints what memory would inject at the next
SessionStart. The trust answer to "is this still worth it?" requires
inspectability — a user who can't see what memory will inject can't
develop confidence in it. The CLAUDE.md alternative is `cat CLAUDE.md`
(instant, plain English, no tooling); this is the same one-line inspect
surface for the curated facts the injector picks each session.

Runs the exact `Hook::ContextInjector` path real sessions use, so what
shows here is what Claude actually receives — not a rebuilt
approximation that could drift.

Default suppresses the "Pending Knowledge Extraction" raw-transcript
dump (intended for LLM distillation, not human reading); pass --pending
to see the full fresh-session payload. --source SOURCE simulates
startup/resume/clear so users can preview each entrypoint's payload.

Footer reports fact count, ~token estimate (via Core::TokenEstimator),
and char count so users instantly see the SessionStart cost — pairs
with the #47 token-budget telemetry surfaces.

Addresses: docs/1_0_punchlist.md (0.11.0 #5) / improvements.md #51
codenamev added a commit that referenced this pull request Apr 30, 2026
CHANGELOG gains an Unreleased entry summarizing the new show command
and its --pending / --source flags. 1.0 punchlist item 0.11.0 #5
stamped landed 2026-04-29 with commit hash so milestone progress
is trackable.

Addresses: docs/1_0_punchlist.md (0.11.0 #5) / improvements.md #51
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants