MemMark

Memory integrity and watermarking toolkit for AI agent long-term memory systems.

MemMark detects memory poisoning, verifies provenance, generates integrity manifests, and embeds cryptographic watermarks in AI agent memory systems — ensuring the memories your agent trusts are actually legitimate.

Documentation: carlos-projects.github.io/memmark

Features

Feature	Description
🏷️ Memory Watermarking	HMAC-SHA256 + PBKDF2 watermarks with entropy salt
🛡️ Poisoning Detection	Configurable pattern-based injection & manipulation detection
🔍 Provenance Tracking	SHA-256 chain hashing with cycle-safe graph analysis
📋 Integrity Manifests	Generate & verify SHA-256 manifests per entry & state
📊 Memory Diff	Compare memory states (added, removed, modified entries)
🔬 Memory Forensics	Temporal, content & source anomaly scoring
📝 Policy Generation	MCPGuard-compatible YAML policies from scan results
🔄 Pluggable Store	`FileMemoryStore`, `InMemoryMemoryStore`, custom backends
🧩 Composable Pipeline	`ScanPipeline` + `ScanStage` for custom analysis workflows
📋 Structured Logging	JSON logging with correlation IDs for pipeline tracing

Installation

pip install memmark-agent

Quick Start

Scan memory for integrity issues

memmark scan memory.json -k my-secret-key

Full example — inject, detect, verify

# Inject watermarks
memmark watermark memory.json --action inject --key my-key -o watermarked.json

# Detect watermarks
memmark watermark watermarked.json --action detect --key my-key

# Integrity manifest
memmark manifest memory.json -o manifest.json

# Verify against manifest
memmark verify memory.json --manifest manifest.json

# Generate MCPGuard policy
memmark generate-policy memory.json -o policy.yaml

Python API

Full scan pipeline

from memmark import run_full_scan

memories = [{"id": "mem-001", "content": "User likes dark mode"}]
result = run_full_scan(memories, watermark_key="my-secret")
for f in result.findings:
    print(f"  [{f.severity}] {f.description}")

Composable pipeline

from memmark import ScanPipeline

pipeline = ScanPipeline.with_default_stages(watermark_key="my-secret")
result = pipeline.run(memories, scan_id="custom-scan")

# Async variant
result = await pipeline.arun(memories)

Custom stages

from memmark import ScanStage, PipelineContext

class CustomStage(ScanStage):
    def run(self, ctx: PipelineContext) -> None:
        # Access ctx.memories, ctx.findings, ctx.metadata
        ...

pipeline = ScanPipeline.with_default_stages(watermark_key="k")
pipeline.add_stage(CustomStage())

MemoryStore backends

from memmark import FileMemoryStore, InMemoryMemoryStore, MemoryScanner

store = FileMemoryStore("memories.json")
memories = store.read()

scanner = MemoryScanner()
memories = scanner.load_memory(store)  # auto-detects MemoryStore

Architecture

CLI (typer)
  └─ ScanPipeline (composable stages)
       ├─ PoisoningStage     — configurable pattern injection/manipulation detection
       ├─ WatermarkStage     — HMAC-SHA256 + PBKDF2 verification
       └─ ForensicsStage     — temporal/content/source anomaly scoring
  └─ WatermarkInjector / WatermarkDetector
  └─ PoisoningDetector / PoisoningClassifier / PoisoningRemediation
  └─ ProvenanceTracker / ProvenanceVerifier / ProvenanceGraph
  └─ IntegrityManifest / MemoryDiff / MemoryForensics
  └─ MCPGuardPolicy
  └─ MemoryStore (FileMemoryStore / InMemoryMemoryStore)

Development

# Install dev + docs dependencies
pip install -e ".[dev,docs]

# Run tests with coverage
make test        # or: python -m pytest tests/ -v

# Lint + type check
make lint        # ruff check src/ tests/
make typecheck   # mypy src/

# Build docs
make serve-docs  # mkdocs serve → localhost:8000

# Build package
make build       # python -m build

# Run pre-commit hooks
make precommit   # pre-commit run --all-files

# Full CI pipeline
make all         # install → lint → typecheck → test → coverage

Ecosystem Integration

Project	Integration
MCPGuard	MemMark generates memory protection policies
MCPscop	MemMark reports consumable by MCPscop dashboard
mcp-taxonomy	Standardized finding classification

Academic Foundation

arXiv:2605.25073 — State-Evolution Attribution Watermarking (Zhang et al.)
arXiv:2605.24941 — Memory-Induced Tool-Drift in LLM Agents (Dabas et al.)
arXiv:2605.25717 — SAMark: Self-Anchored Text Watermarking
MITRE ATLAS — Agent Memory Attack Patterns

License

MIT — See LICENSE.

Author

Carlos-Projects — GitHub

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.github		.github
docs		docs
examples		examples
src/memmark		src/memmark
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
SECURITY.md		SECURITY.md
mkdocs.yml		mkdocs.yml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MemMark

Features

Installation

Quick Start

Scan memory for integrity issues

Full example — inject, detect, verify

Python API

Full scan pipeline

Composable pipeline

Custom stages

MemoryStore backends

Architecture

Development

Ecosystem Integration

Academic Foundation

License

Author

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MemMark

Features

Installation

Quick Start

Scan memory for integrity issues

Full example — inject, detect, verify

Python API

Full scan pipeline

Composable pipeline

Custom stages

MemoryStore backends

Architecture

Development

Ecosystem Integration

Academic Foundation

License

Author

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages