docs: memory build-vs-gate decision (Position C) + universal gate pattern + arch.md index by hanwencheng · Pull Request #146 · litentry/agentKeys

hanwencheng · 2026-05-31T04:03:00Z

Summary

Pre-implementation research + decision + docs. No code changes. Three threads, plus source-of-truth hygiene:

1. Decision: memory = engine / store / gate (Position C)

docs/research/memory-build-vs-gate-decision.md records the call: AgentKeys owns the gated, K3-encrypted, per-actor, audited memory STORE + the GATE; the ranking/extraction ENGINE is delegated to the ecosystem (mem0-self-hosted / Claude memory tool / Hermes-native / agentmemory).

Rejects "pure gate, use their store" — it breaks the founding privacy requirement on 3 of 5 axes (bytes not in our S3; vendor/LLM sees plaintext; Claude-memory couples to the LLM you wanted swappable).
Claude's own memory tool validates the split: the engine is closed, but the store is yours (BetaAbstractMemoryTool backs it with "S3, encrypted files"). AgentKeys can literally be a Claude memory backend.
Durability argument: the engine is per-agent and disposable; the user's memory + access policy is per-user and durable — so own the durable layer.

2. Reframe: `agentkeys-memory-design.md` → "gated memory backend"

Engine sections (/search, /rebuild-index, vector index, BM25, embedding rotation) demoted to optional pluggable stages E1/E2 with 🔌 ENGINE banners.
§9 stages split into CORE (store+gate: M-1 → M0 → M1 → M1.5 namespace gate → M2 → M3 → M4) and ENGINE (E1/E2, deferred).
The build shrinks; the moat (cap-gated encrypted per-actor store) sharpens.

3. Generalization: the universal gate pattern

docs/research/universal-gate-pattern.md — the engine/store/gate split generalizes to every worker (memory / email / payment / home-IoT). The cap-token is the universal policy carrier with four primitives:

Primitive	Example
Operation (read/write/execute)	"read email but not send"
Resource scope (string-set)	"only the kids' room IoT devices"
Quantitative limit	"no single spend > $20"
Attribute constraint (allow/deny set)	"no fast food" → `deny_attrs:["mcc:5814"]`

Plus the determinism principle: the gate decides authorization deterministically on structured attributes; semantic judgment ("is this a burger?") happens upstream and reduces to an attribute the gate checks — never an LLM judgment at the gate (non-deterministic, prompt-injectable, unauditable). New workers (e.g. agentkeys-worker-iot) become template instances.

4. arch.md indexing + source-of-truth hygiene

Per the architecture-as-source-of-truth policy, arch.md now links outward to the new docs, and I fixed pre-existing defects found while anchoring:

Added a "Design records & research" index block (decision, gated-backend spec, universal-gate pattern, survey).
§15.2 memory-service: filled the literal [TODO link] placeholder with the Position-C anchors + store-vs-engine framing + the 4 types × namespaces.
Fixed broken companion links — signer-protocol.md / threat-model-key-custody.md pointed at docs/ but live in docs/spec/.
Removed stray committed reasoning ("Wait —"/"Hmm, line 21 is a stray" leaked into the header).
§17.5: collapsed a corrupted cap-endpoint table — /v1/cap/memory-search was duplicated 41× (88 rows total) — down to the 5 canonical rows; fixed the intro count. arch.md: 2160 → 2077 lines.

0 broken internal links remain in arch.md.

New / changed files

New: docs/research/memory-build-vs-gate-decision.md, docs/research/universal-gate-pattern.md, docs/research/agentmemory-and-claw-code-followup.md
Modified: docs/plan/agentkeys-memory-design.md (reframe), docs/arch.md (index + defect cleanup), docs/research/README.md (index rows)

Test plan

arch.md internal links: 0 broken (Python link-resolver over all ](...) targets).
§17.5 cap table: 5 canonical rows, memory-search duplication removed.
All docs/research/*.md present in docs/research/README.md index.
Reframed plan is internally consistent (CORE/ENGINE numbering, ENGINE banners, no dangling M5/M6 in live content).
Reviewer gut-check on Position C before any M-stage code starts.

Notes for reviewers

This supersedes the framing of the memory plan as merged in docs: AI memory worker design + reorganize docs/ to canonical layout #106 — same file, now reframed to the gated-backend scope. The store/gate half is unchanged; the engine half is now explicitly pluggable/out-of-build.
Per the /create-pr policy: committed from the Claude Code worktree (git), pushed via jj from the main repo; no Co-Authored-By line.
The agentkeys-docs skill referenced in CLAUDE.md does not exist on disk — the docs organization here was done manually against the documented layout policy. Worth either creating that skill or dropping the reference (out of scope for this PR).

…tern + arch.md index Decision + reframe - research/memory-build-vs-gate-decision.md: DECIDED Position C — memory is three layers (engine / store / gate). AgentKeys owns the K3-encrypted, per-actor, cap-gated, audited STORE + the GATE; the ranking/extraction ENGINE is delegated to the ecosystem (mem0-self-hosted / Claude memory tool / Hermes-native / agentmemory). Rejects "pure gate, use their store" (breaks the founding privacy requirement 3/5). Claude's own memory tool validates the split (engine closed, store yours via BetaAbstractMemoryTool). - plan/agentkeys-memory-design.md: reframed from "memory system" to "gated memory backend". Engine sections (vector index, BM25, /search, /rebuild-index, embedding rotation) demoted to optional pluggable stages E1/E2 with ENGINE banners. §9 stages split into CORE (store+gate: M-1→M0→M1→M1.5 namespace gate→M2→M3→M4) and ENGINE (E1/E2, deferred). Generalization - research/universal-gate-pattern.md: the engine/store/gate split generalizes to every worker (memory / email / payment / home-IoT). Four cap-token policy primitives (operation / resource-scope / quantitative-limit / attribute-constraint) + the determinism principle ("no burgers" = deterministic MCC/category check, never an LLM judgment at the gate) + stateless-vs-cumulative-limit distinction. New workers are template instances; home-IoT "control only kids' equipment" = resource scope + op-level. Research inputs - research/ai-memory-systems-survey.md companion + agentmemory-and-claw-code -followup.md (engine landscape that fed the decision). - research/README.md: index rows for the three new docs. arch.md indexing + defect cleanup (single-source-of-truth hygiene) - Added a "Design records & research" index block linking the decision, the gated-backend spec, and the universal-gate pattern from arch.md. - §15.2 memory-service: filled the [TODO link] placeholder with the Position-C anchors; documented store-vs-engine + 4 types × namespaces. - Fixed broken companion-doc links (spec/signer-protocol.md, spec/threat-model-key-custody.md were pointing at docs/ root). - Removed stray committed reasoning ("Wait —"/"Hmm" lines) from the header. - §17.5: collapsed a corrupted cap-endpoint table (88 rows, /v1/cap/ memory-search duplicated 41×) to the 5 canonical rows; fixed the intro count. 0 broken internal links remain. No code changes. Planning + research + docs only.

…cs Phase C) Found by the agentkeys-docs cross-link audit: - ../cloud-setup.md → cloud-bootstrap.md (renamed in #99; arch.md is in docs/) - ../runbook-k3-rotation.md → runbook-k3-rotation.md (drop ../; lives at docs/ top) - ../v2-stage1-migration-and-demo.md → v2-stage1-migration-and-demo.md (drop ../) - spec/spec/heima-open-questions.md → spec/heima-open-questions.md (double-spec typo) - ../wiki/agent-role-and-usage-hdkd-per-agent-omni.md → wiki/agent-role-and-usage-hdkd-per-agent-omni.md (drop ../) arch.md now has 0 broken internal links; 0 stale old-path refs repo-wide; publish-wiki.yml targets docs/wiki/.

…keys-docs Phase C) - arch.md §22a.5a: spec/spec/heima-open-questions.md → spec/heima-open-questions.md (double-spec typo, anchored link the prior pass missed). - New docs/wiki/agent-role-and-usage-hdkd-per-agent-omni.md — the focused operator reference for the agent role that CLAUDE.md + arch.md (§6, §25) link to but that never existed. Defers to arch.md per the wiki-location policy; covers agent-vs-master role, per-agent actor_omni stability, gated capabilities, and the create/run/scope/revoke operator flow. arch.md now has 0 broken internal links (verified). The agent-role page resolves the last 2 broken refs (arch.md §6 + §25).

…c-040fa6 # Conflicts: # docs/research/README.md

…paths after #141 merge

…Phase C cleanup)

hanwencheng added 6 commits May 31, 2026 12:02

Merge remote-tracking branch 'origin/main' into claude/priceless-dira…

f0f6274

…c-040fa6 # Conflicts: # docs/research/README.md

docs(research/README): dedup agent-iam-strategy row + normalize link …

259e1c4

…paths after #141 merge

docs: fix stale spec/architecture.md refs -> arch.md (agentkeys-docs …

e02924e

…Phase C cleanup)

hanwencheng merged commit f47fc64 into main May 31, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: memory build-vs-gate decision (Position C) + universal gate pattern + arch.md index#146

docs: memory build-vs-gate decision (Position C) + universal gate pattern + arch.md index#146
hanwencheng merged 6 commits into
mainfrom
claude/priceless-dirac-040fa6

hanwencheng commented May 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

hanwencheng commented May 31, 2026

Summary

1. Decision: memory = engine / store / gate (Position C)

2. Reframe: agentkeys-memory-design.md → "gated memory backend"

3. Generalization: the universal gate pattern

4. arch.md indexing + source-of-truth hygiene

New / changed files

Test plan

Notes for reviewers

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

2. Reframe: `agentkeys-memory-design.md` → "gated memory backend"