Replies: 1 comment
-
|
Consensus reached 2026-06-10 (kant + Claude, live session). All four open points resolved:
#57 has been retitled and its body rewritten to the lifecycle-evidence framing (obsolete_when + kind_misfit + supersedes + tags sidecar). Decision criteria are now joint with #62's invalidation curves. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Context
#57 proposes observe-only free-form
tagson extracted memories to gather taxonomy evidence before changing thekindenum. After discussing the direction (kant + Claude, 2026-06-10), we think the methodology of #57 is right but the instrumentation is aimed at the wrong evidence. Posting here so the design discussion doesn't get buried in issue comments; #57 links back to this thread.1. The brain has no tags —
kindis really a lifecycle-policy selectorIn the brain, a memory's "type" is not a label; it is a difference in storage system and dynamics. Episodic / semantic / procedural memories live in different circuits (hippocampus vs neocortex vs basal ganglia) with different consolidation speeds, decay curves, and retrieval modes. Retrieval is content-addressed association, not label lookup. "Type" is where and how a memory lives — a tag is a researcher's annotation, not a mechanism.
Re-read #57's force-fit evidence through that lens and the three misfits differ not in semantic category but in lifecycle dynamics:
So the real job of
kindis not classification — it selects a decay/supersession policy (contradiction filtering, dedup, injection priority are all policy applications). The misfit evidence says "labels are a proxy for dynamics", not "we need more labels".It follows that extractor-emitted tags measure the wrong thing: they capture the extractor LLM's folk taxonomy — its classification intuitions — not the memory's actual behavior. The brain-faithful instrumentation is behavioral: when does a memory get contradicted, superseded, injected-and-actually-used, or go stale? Most of that is already in the event log or is cheap to add as telemetry, and it is more honest evidence than self-reported labels.
2. Where the brain analogy must break: N selves, one store
A brain has one self, so it never needs contradiction detection — a single self serializes its own belief updates. memorize is a store shared by N agents + humans, which makes it a distributed belief-revision problem, not a single-brain problem. That requires things brains don't have:
Direction: single-memory lifecycle modeled like a brain; multi-writer reconciliation modeled like a distributed system. Neither alone is the North Star.
3. The cost argument for tags doesn't hold
Consolidation already pays the expensive step: an LLM reads the full observation window to compress it. The marginal cost of richer output fields is ~zero — tags are a cost-saving compromise with no cost to save. Keeping #57's (correct) observe-only methodology but spending the same budget on essential evidence, the extractor can emit:
obsolete_when— free-form expiry condition: "when PR X merges", "until the convention is amended", "never / persistent";supersedes— free-form mention of an existing memory this one replaces.All stored, read by no consumer — the same safety properties as tags (a malformed or absent field must never trip the #43 parse-failure path) — but the evidence directly answers the question we'll actually face later: what per-kind lifecycle policies should exist?
4. Embedding similarity: recall-only, never the judge
The known traps of cosine-similarity scores are real and structural, and they bite contradiction detection hardest:
So the only trustworthy role for similarity is candidate recall ("plausibly the same topic?"); the judge must remain an LLM, with the current pre-filter → judge pipeline kept. The residual risk is silent false negatives when the pre-filter is too tight — nobody sees a missed contradiction. A cheap complement: a parallel lexical/entity-overlap recall path (BM25-ish) feeding the same judge; it catches pairs embeddings miss, and vice versa.
Proposal: reframe #57, don't close it
tags→obsolete_when+ kind-misfit signal (+ optionaltagsas a sidecar, since it's nearly free).Long-term direction this implies: memories are beliefs with provenance + lifecycle conditions;
kinddissolves into named lifecycle policies; contradiction resolution rests on causal order (HLC, #39) + LLM judgment, with similarity demoted to recall.Beta Was this translation helpful? Give feedback.
All reactions