Sanma (秋刀魚)

Claude Code that doesn't forget. Persistent project memory across sessions, compactions, and restarts. Persistent, structured project memory for Claude Code. Two lightweight hooks (Sonnet 4.6 + Haiku 4.5) maintain a per-project knowledge tree on disk and surface the relevant facts back into context on every turn — keeping long-running discussions coherent across compactions, restarts, and project switches.

1. Why this exists

Long-running conversations with Claude Code share three failure modes:

When a session grows past the context window, automatic compaction silently discards earlier turns, and the underlying premise of the discussion starts drifting.
Switching to a new session forces you to re-explain decisions you already made yesterday.
Important technical commitments — the architecture you chose, the option you rejected, the open question you flagged — get buried in the chat history and become unrecoverable.

claude-code-cms writes each turn into a structured correlation map on local disk and re-injects only the facts that matter into the next turn's context. Claude itself doesn't have to know the system exists; it just keeps remembering the right things.

2. What it does

Two hooks are registered in your Claude Code settings.json and fire automatically every turn.

Hook	When it fires	What it does	Model
`Stop`	Right after Claude finishes a response	Reads the latest exchange and updates the correlation map (adds new nodes or increments mass on existing ones)	Claude Sonnet 4.6
`UserPromptSubmit`	The moment you submit a new prompt	Compares your prompt against the existing map and pulls 3–5 relevant facts	Claude Haiku 4.5

Extracted facts are passed to the main session as additionalContext. Your primary model (Opus, Sonnet, whatever you use) doesn't need to be aware of the mechanism — it just sees the facts as part of its prompt.

3. The correlation map

The data model is a three-level tree, taken from the paper Geometric Convergence for Conversational Context Management (DOI: 10.5281/zenodo.19354705):

SUN: top-level topic
├── PLANET (mass=N): subtopic, depth N
│   ├── SAT: detail
│   └── SAT: detail
└── PLANET (mass=N): another subtopic
    └── SAT: ...

Each planet's mass equals the number of satellites accumulated under it, which proxies how deeply the user has engaged with that subtopic. The fact-extraction hook prefers high-mass planets when picking what to inject next, so long-standing concerns aren't forgotten.

4. Per-turn lifecycle

[user submits prompt]
    │
    ↓
[UserPromptSubmit hook]
    │  Haiku 4.5 reads the map + the prompt, extracts relevant facts
    ↓
[facts injected as additionalContext for the main session]
    │
    ↓
[main model produces a response]
    │
    ↓
[Stop hook]
    │  Sonnet 4.6 reads the latest exchange, updates the JSON map
    ↓
[correlation_map.json written atomically to disk]

Both hooks run in a sandboxed subprocess, so the latency footprint on your primary session is small (~0.5–1.5s before the response starts; another 1–2s in the background after it ends).

5. Installation

Requirements

Python 3.11 or newer
Claude Code CLI (the VS Code extension or the npm-installed CLI)
An active Claude Pro / Max subscription, or an Anthropic API key (see §7)

Steps

Linux / macOS / Git Bash:

git clone https://github.com/rkceve/claude-code-cms.git ~/.claude/hooks/cms
cd ~/.claude/hooks/cms
python3 install.py

Windows (PowerShell):

git clone https://github.com/rkceve/claude-code-cms.git "$HOME\.claude\hooks\cms"
cd "$HOME\.claude\hooks\cms"
python install.py

The installer adds the two hook entries to ~/.claude/settings.json idempotently. Existing hooks are not overwritten.

Verify

python install.py --status

If both hooks appear, you're done. The next Claude Code session will pick them up automatically.

Uninstall

python install.py --uninstall

This removes only the entries that point at the CMS scripts — other hooks in your settings.json are left alone.

6. Usage

After installation, there is nothing to do. Just use Claude Code normally; the correlation map populates itself.

Where the map lives

~/.claude/projects/<cwd-slug>/memory/cms/chats/<session_id>/correlation_map.json

Each Claude Code session gets its own correlation map under a directory keyed by both the project (cwd) and the session id. This per-chat layout keeps individual maps small — the prompt sent to the lightweight update model embeds the full current map, so a per-project map shared across all historical conversations would balloon over time and push the model past its timeout.

If you want to inspect maps from a previous session, look for the session_id in cms.log or browse the chats/ directory directly.

Logs

~/.claude/hooks/cms/cms.log

Every hook invocation is recorded with a timestamp. The log auto-rotates when it exceeds ~1 MB.

Disabling for one session

CMS_DISABLE=1 claude

The hooks early-exit when this environment variable is set.

7. Configuration

Copy cms.toml.example to cms.toml and edit. All keys are optional; unspecified ones fall back to defaults.

What you can override:

The Haiku and Sonnet model IDs (e.g., to switch to other Anthropic models)
Per-call timeouts and retry counts
Glob patterns for project directories where the hooks should skip
Soft cap on planet count (low-mass planets are pruned past this limit)
Provider mode: claude_code_cli (default — uses your existing OAuth session via claude -p) or anthropic_sdk (uses the anthropic Python SDK directly; requires ANTHROPIC_API_KEY)

See the comments inside cms.toml.example for detail.

8. Cost

Each turn fires two model calls (one Haiku, one Sonnet). Both run against your existing Claude Pro / Max token allowance — no separate API key is needed unless you opt into the SDK provider mode. The lightweight models don't meaningfully starve your main Opus session, but they do consume additional tokens per turn. To economize, set exclusion patterns in cms.toml, or prefix individual sessions with CMS_DISABLE=1.

9. Known limitations

Sandbox-cwd workaround for a Claude Code CLI bug

Claude Code CLI v2.1.x silently ignores --no-session-persistence whenever the system prompt is large, and writes a transcript file to the current project directory anyway. Those transcripts surface as visible chat tabs in the VS Code extension and pollute your workspace.

To avoid this, the hooks spawn the inner claude -p with cwd set to a dedicated sandbox directory (~/.claude/hooks/cms/_sandbox), then delete the transcript by session_id after the call returns. This is a workaround that becomes unnecessary once Anthropic fixes the upstream bug.

Schema-violation discards

Sonnet occasionally returns JSON with non-canonical keys (label, description, etc.) instead of the expected title, planets. The hook runs strict schema validation and discards any update that violates it, preserving the previous map. The next turn usually catches up; if this happens persistently, check cms.log.

Per-project isolation

Maps are keyed by current working directory. Starting work on the same project from a different cwd gives you a fresh map. Be aware when you switch the folder open in VS Code.

SDK provider mode is unverified

The provider.mode = "anthropic_sdk" path is implemented but has not been tested end-to-end against a real API key in the development environment. Reports and pull requests welcome.

10. Privacy

The correlation map stores summaries of your conversations as plain JSON. Treat it accordingly:

Add correlation_map.json to .gitignore (it lives outside this repo, but you may end up checking the directory by accident)
After conversations involving secrets — API keys, credentials, personal data — manually delete the relevant nodes or wipe the map and start fresh
This project does not phone home; the only outbound network traffic is the Anthropic API calls the hooks themselves make

11. Author

Ryosuke Kawai — independent researcher,

X / Twitter: @rkcevE
Contact: X DM, GitHub issues, or ryosukekawai1224@gmail.com

Active research areas:

Image compression — hybrid pipeline using Structure Graph Data (SGD): edge vectors plus interior raster, with ONNX super-resolution
VR — 3D Gaussian Splatting plus surface EMG (sEMG) for 6-DoF view-prediction; future-projection reprojection to mitigate VR sickness
LLM context management — mass-aware attention bias for long-context coherence

12. Patent context

⚠️ Patent pending — Japan, 2026. Commercial use (productizing this code, hosting it as a paid service, redistributing it as part of a paid offering, and so on) requires a separate conversation. Please reach out via @rkcevE on X or open a GitHub issue first. Personal use, freelance work, and internal use within an organization are not restricted.

This repository is a reference implementation of techniques described in Geometric Convergence for Conversational Context Management: A Distributed Structured Memory Architecture Based on Correlation-Diagram Data (DOI: 10.5281/zenodo.19354705), authored by the project author and made available on Zenodo. A corresponding patent application is pending in Japan (filed 2026).

What this code implements

Local-device creation and update of correlation-map data
Tree structuring with sun / planet / satellite nodes
A consistency-maintenance step (simplified here as schema validation)

What this code does not implement

The mass-aware attention bias (the +wM term in the paper). This modification operates inside the model's attention computation and cannot be implemented from outside the model — Claude API does not expose attention-score injection. That component lives in a separate Gemma + LoRA implementation outside this repository.

13. License

The source code in this repository is released under the MIT License (see LICENSE).

The technology this code implements is patent pending in Japan. The MIT License grants rights to the source code only; it does not grant a patent license.

Use case	Restriction
Personal use, academic research, education	None
Freelance work for clients	None
Internal company use, employee productivity tooling	None
Productizing, SaaS hosting, redistributing as part of a paid offering	Please reach out first

For commercial-use inquiries, contact @rkcevE on X or open a GitHub issue.

14. Citation

If you use this in research or publications, citations are welcome.

To cite this implementation:

@misc{kawai2026cms,
  author    = {Kawai, Ryosuke},
  title     = {claude-code-cms: A reference implementation of structured
               conversation memory for Claude Code},
  year      = {2026},
  publisher = {GitHub},
  url       = {https://github.com/rkceve/claude-code-cms}
}

To cite the underlying paper:

@misc{kawai2026geometric,
  author    = {Kawai, Ryosuke},
  title     = {Geometric Convergence for Conversational Context Management:
               A Distributed Structured Memory Architecture Based on
               Correlation-Diagram Data},
  year      = {2026},
  publisher = {Zenodo},
  doi       = {10.5281/zenodo.19354705},
  url       = {https://doi.org/10.5281/zenodo.19354705}
}

15. Acknowledgements

Anthropic — for Claude Code CLI and the Claude Sonnet / Haiku models this project depends on

16. Contributing

Issues and pull requests are welcome. Help is especially appreciated in these areas:

End-to-end verification of the anthropic_sdk provider mode against a real API key
Linux and macOS testing (development happens on Windows)
Industry collaboration on the underlying paper — please reach out if your organization is interested in joint research

17. Repository Name

Named after sanma (秋刀魚, Pacific saury) — a fish known for being remembered fondly even after the season ends.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.ja.draft.md		README.ja.draft.md
README.md		README.md
_lib.py		_lib.py
cms.toml.example		cms.toml.example
config.py		config.py
inject_facts.py		inject_facts.py
install.py		install.py
pyproject.toml		pyproject.toml
update_map.py		update_map.py

Folders and files

Latest commit

History

Repository files navigation

Sanma (秋刀魚)

1. Why this exists

2. What it does

3. The correlation map

4. Per-turn lifecycle

5. Installation

Requirements

Steps

Verify

Uninstall

6. Usage

Where the map lives

Logs

Disabling for one session

7. Configuration

8. Cost

9. Known limitations

Sandbox-cwd workaround for a Claude Code CLI bug

Schema-violation discards

Per-project isolation

SDK provider mode is unverified

10. Privacy

11. Author

12. Patent context

What this code implements

What this code does not implement

13. License

14. Citation

15. Acknowledgements

16. Contributing

17. Repository Name

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages