Kleos

Cognitive infrastructure for AI agents. Not a memory engine -- a full operating system for how agents think, learn, coordinate, and stay safe.

Quickstart · Why Kleos · Wiki

Kleos is not a memory engine. Calling it one is like calling a brain a hard drive.

Yes, Kleos stores and retrieves memories. But it also consolidates them overnight through a six-stage dream cycle. It builds a knowledge graph with causal reasoning and community detection. It assembles context for LLMs through eight parallel retrieval layers with token budget management. It enforces structured reasoning protocols so agents spec before they code and hypothesize before they debug. It runs seven neuroscience-named coordination services for multi-agent orchestration. It gates every shell command and filesystem operation through a security policy engine. It watches agent sessions in real time and injects corrections when they drift.

The difference between Kleos and a memory tool is the difference between a notebook and a nervous system.

One Rust binary. Self-hosted. Your data never leaves your hardware.

What Kleos does

Biological memory lifecycle

Memory in Kleos follows the same arc as biological memory -- acquisition, consolidation, decay:

FSRS-6 spaced repetition with all 21 trained weights (the algorithm behind Anki). Memories strengthen with use and fade when ignored via power-law forgetting.
Six-stage dream cycle runs during idle: replay, merge, prune, discover, decorrelate, resolve. Consolidates knowledge autonomously.
Modern Hopfield network (feature-gated) with softmax attention and capacity exponential in dimension. Backs an associative instinct system with causal edge scoring.
Personality-shaped recall -- six signal types (preference, value, motivation, decision, emotion, identity) extracted from stored text. Two agents querying the same memories get different results.
On-store processing -- SimHash deduplication, SVO contradiction detection, atomic fact decomposition, entity extraction, and auto-linking all happen before the store call returns.

Context assembly

The 8-layer system that builds what the agent actually sees:

Layer	What it retrieves
Permanent facts	Memories marked as static/always-include
Semantic matches	4-channel hybrid search (vector + FTS5 + graph + personality)
Evolution hints	Version history and lineage of retrieved memories
Graph neighbors	Memories connected via typed edges or cosine similarity
User preferences	Scoped personality and preference signals
Current state	Key-value pairs representing the active environment
Structured facts	Subject-verb-object triples extracted from text
Episode summaries	High-level summaries of past interaction sessions

All eight layers run in parallel. Results are packed into a token budget with priority-based truncation and prompt-injection sandboxing.

Knowledge graph

Auto-linking via cosine similarity with typed, weighted edges: similarity, updates, extends, contradicts, caused_by, prerequisite_for, generalizes, resolves, and more
Causal edge scoring with NLP markers (strong/context/weak weighting)
Louvain community detection capped at 10K nodes per pass
Weighted PageRank with temporal decay (180-day half-life) and deduplicated multi-edges
2-hop graph-augmented search -- initial hits expand into their neighborhood for hidden context
Structural analysis: centrality hubs, bridge nodes between isolated communities

Coordination

Seven neuroscience-named services sharing a single auth, audit, and rate-limit layer:

Axon -- pub/sub event bus with channels, retention windows, cursor-based consumption, and webhook delivery
Broca -- structured action log with causal tracing back to Axon events
Chiasm -- task tracker with state machine lifecycle, history reconstruction, and multi-agent project scoping
Soma -- agent registry with heartbeats, capability declarations, quality scores, and drift flags
Loom -- DAG workflow engine with state persistence across server restarts
Thymus -- quality evaluation with feedback signals and reinforcement learning integration
Brain -- cognitive orchestrator that abstracts over Hopfield network and external backends

One POST /activity call fans out to all of them.

Structured reasoning (Agent Forge)

A protocol that controls how agents think, not just what they remember:

Spec before code -- spec-task requires acceptance criteria and edge cases before implementation begins
Hypothesis before fix -- log-hypothesis with confidence scoring, recall-errors to prevent repeating mistakes
Verification before done -- verify runs commands against spec criteria, challenge-code generates adversarial review
AST-aware code analysis -- Tree-sitter parsing across Rust, TypeScript, Python, Go, C, C++, JS. repo-map builds ranked symbol lists within token budgets.
Session resilience -- git checkpoints and rollback for recovery from destructive edits

Skills, growth, and metacognition

Skills with version history, execution tracking, trust scores, and lineage graphs
LLM-backed skill evolution: analyze failure history, propose fixes, derive new skills, capture from natural language
Cloud library for sharing skills across agents
Growth system: agents reflect on their own patterns and materialize insights into structured memories
Insights feed back into the spaced-repetition loop so useful self-knowledge stays accessible

Security and policy enforcement

Encryption -- SQLCipher at rest, key from file, env var, or YubiKey HMAC-SHA1 challenge-response
Credential vault -- separate daemon (kleos-credd) with AES-256-GCM, master/agent two-tier key model, zero-knowledge agent bootstrapping via ECDH + PIV
Request signing -- ECDSA-P256 or Ed25519 with replay-resistant nonces
Pre-action gate -- every agent command checked against allow/warn/block policy before execution, with a human approval queue for risky operations
Shell command gating (kleos-sh) -- static validation, SSRF guard, metacharacter blocking, Claude Code hook integration
Filesystem sandboxing (kleos-fs) -- AST-aware read/write/edit with configurable allowed roots and path traversal prevention
Agent-marked shells -- cred get blocked in agent contexts to prevent secret exfiltration; agents must use cred exec for subprocess injection
Audit log on every mutation

Real-time supervision

Eidolon supervisor watches agent session transcripts as they're written, using async file-system monitoring with LRU-cached file positions
Rule-based detection -- regex pattern matching on assistant text, tool commands, and commit messages
Retry loop detection -- catches agents stuck repeating the same failing command (3+ consecutive repeats)
Scope enforcement -- validates file modifications stay within allowed directory prefixes
Three-destination alerting -- Kleos inbox (human review), Axon event bus (automation), supervisor inject (direct agent intervention)
Cooldown management -- per-rule throttling prevents alert fatigue from runaway processes
Server can inject corrections or block the agent's next action

Quick start

Install with the interactive installer (recommended):

Download the latest installer from Releases, then run:

# TUI installer (terminal)
./kleos-install

# GUI installer (desktop)
./kleos-install-gui

The installer walks you through component selection, server configuration, embedding provider setup, security key generation, and optional systemd/launchd service registration. Choose a profile (Server, Agent Host, Full, Custom) or pick individual components.

Or build from source:

git clone https://github.com/Ghost-Frame/Kleos.git && cd Kleos
cargo build --release -p kleos-server -p kleos-cli
KLEOS_BOOTSTRAP_SECRET=pick-a-secret ./target/release/kleos-server

Server starts on 127.0.0.1:4200. In another terminal:

Bootstrap your admin key (one-time):

curl -X POST http://localhost:4200/bootstrap \
  -H "Content-Type: application/json" \
  -d '{"secret": "pick-a-secret"}'

Store a memory:

curl -X POST http://localhost:4200/store \
  -H "Authorization: Bearer YOUR_KEY" \
  -H "Content-Type: application/json" \
  -d '{"content": "Production DB is PostgreSQL 16 on db.example.com:5432", "category": "reference"}'

Search by meaning:

curl -X POST http://localhost:4200/search \
  -H "Authorization: Bearer YOUR_KEY" \
  -H "Content-Type: application/json" \
  -d '{"query": "database connection info"}'

For Claude Code hooks, encryption setup, the session sidecar, and client SDKs, see the Getting Started guide.

For developers: integration and architecture

Connecting your agent

Kleos speaks three protocols:

Protocol	Details
HTTP API	56 route modules -- memory, search, graph, coordination, skills, growth, ingestion, approvals, admin. Bearer token or signed-request auth.
MCP	59 tools over stdio. Drop into Claude Code, Cursor, or any MCP-compatible client.
Client SDKs	TypeScript, Python (Pydantic v2 + httpx), Go (stdlib only). First-party, typed, tested.

Claude Code integration

Kleos ships ready-to-use hooks in hooks/:

Simple -- session start/end, per-turn memory recall, tool observation. Bash + curl.
Full -- adds Eidolon brain-aware context, automatic sidecar startup, Chiasm task tracking, growth materialization, Agent-Forge protocol enforcement, and session quality scoring.

Copy the hooks, configure settings.json, and your agent has persistent memory, coordination, structured reasoning, and real-time supervision across sessions.

What runs inside

20-crate Rust workspace. The server handles:

Multi-tenancy -- each tenant gets its own encrypted SQLite database, connection pools, and quota limits
8 middleware layers -- auth, per-tenant rate limiting, audit log, IP extraction, JSON depth limits, Prometheus metrics, safe-mode, compression/timeouts
7 coordination services -- Axon, Broca, Chiasm, Soma, Loom, Thymus, Brain
Skills platform -- CRUD, versioned evolution, execution tracking, trust scoring, judgments, cloud library, LLM-backed fix/derive/capture
Growth engine -- self-reflection, pattern observation, insight materialization
Ingestion pipeline -- Markdown, JSON/JSONL, CSV, PDF, ZIP. Semantic chunking, format-aware parsing, Mem0/SuperMemory import
Resilience -- circuit breakers, exponential backoff with jitter, dead-letter queue for failed operations

Session sidecar

kleos-sidecar sits between your agent and the server:

Buffers observations in memory instead of blocking on every write
Optional compression via local Ollama
Batched flushing to the server
File-watching and persistent session support

Security model

SQLCipher encryption at rest -- key from file, env var, or YubiKey HMAC-SHA1
Bearer tokens hashed with SHA-256 and peppered
Request signing via ECDSA-P256 or Ed25519 with replay-resistant nonces
Credential daemon (kleos-credd) -- AES-256-GCM vault, master/agent two-tier key model
Zero-knowledge agent bootstrapping via ECDH + PIV
Pre-action gate: allow/warn/block with human approval queue
Shell command gating with static validation and SSRF guard
Session grants with configurable TTL
Audit log on every mutation

Building from source

# Server-only (recommended for self-hosting)
cargo build --release -p kleos-server -p kleos-cli -p kleos-mcp

# Agent-host (CLI tools, credential daemon, supervisor)
cargo build --release -p kleos-cli -p kleos-sh -p kleos-cred -p kleos-credd \
  -p agent-forge -p eidolon-supervisor

# Full workspace (all 17 crates -- needs ~8 GiB RAM)
cargo build --release --workspace

The server-only build skips heavy workspace crates (approval TUI, tree-sitter parsers, supervisor) and finishes significantly faster.

cargo test --workspace --exclude kleos-migrate                                # test suite
cargo clippy --workspace --exclude kleos-migrate --all-targets -- -D warnings # lint gate

Requires Rust 1.94+, protoc, and libpcsclite-dev (Linux) for smartcard support.

For researchers: the cognitive science

Spaced repetition (FSRS-6)

All 21 trained weights from the Free Spaced Repetition Scheduler (same algorithm behind Anki)
Dual-strength model: storage strength accumulates with use, retrieval strength decays via power-law forgetting
Live retrievability modulates search scores in real time
Memories the agent uses stay sharp; ignored ones drift out of the way without deletion

Hybrid retrieval

Four channels run per query:

Channel	Implementation
Vector similarity	BAAI/bge-m3 ONNX embeddings (1024-dim), LanceDB ANN index
Full-text	FTS5 BM25 scoring
Graph traversal	2-hop expansion weighted by typed edges and PageRank
Personality signals	Preference, value, motivation, decision, emotion, identity

Results merge through Reciprocal Rank Fusion
Question-type detection classifies the query (fact recall, preference, reasoning, generalization, timeline) and reweights channels
IBM Granite cross-encoder reranks the top-K
Contradiction damping suppresses stale markers ("no longer," "switched from," etc.)

Knowledge graph

Auto-linking by cosine similarity with six typed edges: similarity, updates, extends, contradicts, caused_by, prerequisite_for
Link-type weights so a "causes" edge outranks an "extends" edge
Louvain community detection capped at 10K nodes per pass
Weighted PageRank with deduplicated multi-edges
Sliding-window entity cooccurrence
Structural and impact analysis

Hopfield network (feature-gated)

Continuous Hopfield network (Ramsauer et al. 2020) with softmax attention and capacity exponential in dimension
Embedding compression via PCA using power iteration and deflation -- no LAPACK dependency
Six-stage dream cycle during idle: replay, merge, prune, discover, decorrelate, resolve

Personality engine

Six signal types extracted from stored text: preference, value, motivation, decision, emotion, identity
Each signal carries a subject, valence (positive/negative/neutral/mixed), intensity in [0,1], and the reasoning that produced it
Recall is shaped by personality context -- two agents querying the same memories get different results

Growth and self-reflection

Agents reflect on recent activity to generate pattern observations
Materialization converts raw observations into structured insight memories
Deduplication against existing observations
Integration with the spaced-repetition loop so useful insights stay accessible

On-store processing

SimHash near-duplicate suppression
SVO contradiction detection against a structured-fact triple store
Atomic fact decomposition (LLM-first with rule-based fallbacks) -- splits long memories into self-contained facts linked via has_fact edges

Skill evolution

Skills are versioned, tracked, and judged
LLM analyzes execution history and proposes fixes for failed skills
New skills can be derived from existing ones or captured from natural language
Trust scores aggregate execution success rates and human/agent judgments
Lineage graph tracks which skills evolved from which

For those evaluating the engineering

Scope

20 Rust crates, ~204K lines of code
~6,000 test declarations across 113 test files
Single statically linked binary with the mimalloc allocator
No Python runtime. No external service dependencies at rest.

Workspace

Crate	What it does
`kleos-lib`	Core library: memory, search, embeddings, graph, intelligence, services, skills, growth, auth, gate, jobs. Feature-gated `brain` backend.
`kleos-server`	Axum HTTP server. 56 route modules, 8 middleware layers, embedded web GUI.
`kleos-cli`	Command-line client. Memory ops, skill management, handoffs, credential management.
`kleos-mcp`	MCP server. 59 tools across 8 domains. Stdio transport, HTTP behind a feature flag.
`kleos-sidecar`	Session-scoped memory proxy. File watcher, batched flushing, Ollama compression, persistent sessions.
`kleos-cred`	Credential library. YubiKey challenge-response, Argon2id KDF, ECDH agreement, CRED:v3 vault resolution.
`kleos-credd`	Credential daemon. Two-tier auth (master + agent keys), AES-256-GCM encryption, zero-knowledge agent bootstrap.
`kleos-ingest`	Transcript ingest daemon. PIV/software-key request signing, file watching, LLM summarization, real-time observation streaming.
`agent-forge`	Structured reasoning CLI. 20+ subcommands. Tree-sitter AST parsing for 7 languages.
`eidolon-supervisor`	Session drift detection daemon. Real-time transcript watching, rule-based alerts.
`kleos-sh`	Shell command gate. Static validation, SSRF guard, human approval queue.
`kleos-fs`	AST-aware filesystem operations. Guarded read/write with agent-forge integration.
`kleos-migrate`	One-shot ETL from encrypted monolith to per-tenant shards.
`kleos-cleanup`	Deduplication and log demotion utility.
`kleos-approval-tui`	Ratatui terminal UI for human-in-the-loop approval workflows.
`kleos-install`	TUI installer. Ratatui-based interactive setup wizard with profile selection, config generation, and service registration.
`kleos-install-gui`	GUI installer. eframe/egui desktop wizard with the same capabilities as the TUI installer.
`kleos-install-core`	Shared installer library. Download, verification, config generation, system integration logic.
`sdk`	Client SDKs: TypeScript, Python, Go.

Design decisions

No external services -- embeddings run locally via ONNX, vector storage is embedded LanceDB, coordination is in-process. No message broker. No Python runtime.
Database-per-tenant isolation -- not schema-per-tenant, not row-level. Each tenant gets its own SQLite file with its own encryption key and connection pool.
ServiceGuard on all external calls -- three-state circuit breaker + exponential backoff + dead-letter queue. Failed work is inspectable and replayable.
Hardware-anchored trust -- YubiKey-derived keys for both credential daemon and server. PIV request signing. ECDH agent bootstrapping so API keys never hit disk in plaintext.
Skills as a first-class system -- versioned, trust-scored, LLM-evolvable, with execution analytics, judgment history, and lineage tracking.
Built-in supervision -- Eidolon watches agent sessions in real time and can intervene before damage is done.

Install profiles

The installer (kleos-install or kleos-install-gui) supports four profiles:

Profile	Includes
`Server`	kleos-server, kleos-cli
`Agent Host`	kleos-cli, kleos-sh, agent-forge, eidolon-supervisor, cred, kleos-credd
`Full`	Every binary
`Custom`	Pick individual components

Wiki · Issues

Elastic License 2.0

Commercial licensing

The Elastic License 2.0 prohibits offering this software to third parties as a hosted or managed service. To sell, host, or distribute it on your own platform, contact us for a commercial license: support@syntheos.dev.

Name		Name	Last commit message	Last commit date
Latest commit History 919 Commits
.githooks		.githooks
.github		.github
agent-forge		agent-forge
dist		dist
docs		docs
eidolon-supervisor		eidolon-supervisor
forge		forge
gui		gui
hooks		hooks
kleos-approval-tui		kleos-approval-tui
kleos-cleanup		kleos-cleanup
kleos-cli		kleos-cli
kleos-client		kleos-client
kleos-cred		kleos-cred
kleos-credd		kleos-credd
kleos-fs		kleos-fs
kleos-ingest		kleos-ingest
kleos-install-core		kleos-install-core
kleos-install-gui		kleos-install-gui
kleos-install		kleos-install
kleos-lib		kleos-lib
kleos-mcp		kleos-mcp
kleos-migrate		kleos-migrate
kleos-phylax		kleos-phylax
kleos-phylaxd		kleos-phylaxd
kleos-server		kleos-server
kleos-sh		kleos-sh
kleos-sidecar		kleos-sidecar
scripts		scripts
sdk		sdk
tools		tools
.dockerignore		.dockerignore
.gitignore		.gitignore
.woodpecker.yml		.woodpecker.yml
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
deny.toml		deny.toml
flake.nix		flake.nix

Folders and files

Latest commit

History

Repository files navigation

Kleos

What Kleos does

Biological memory lifecycle

Context assembly

Knowledge graph

Coordination

Structured reasoning (Agent Forge)

Skills, growth, and metacognition

Security and policy enforcement

Real-time supervision

Connecting your agent

Claude Code integration

What runs inside

Session sidecar

Security model

Building from source

Spaced repetition (FSRS-6)

Hybrid retrieval

Knowledge graph

Hopfield network (feature-gated)

Personality engine

Growth and self-reflection

On-store processing

Skill evolution

Scope

Workspace

Design decisions

Install profiles

Commercial licensing

About

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages