GitHub - ramsterr/RELAY_context: handover context with a context architecture and instrumentation methods

fear loosing details? Rely on Relay 🐱

Preserve Semantic Continuity Across LLM Session Boundaries

💡 The Problem: LLMs forget everything when a session ends. Standard memory is either too dumb (linear history) or too expensive (full vector re-indexing).

✅ The Solution: context_handover extracts Semantic Atoms, measures Context Drift, and optimally packs context into new sessions using a Bounded Knapsack Algorithm.

Core Capabilities

🔍 Semantic Extraction	📊 Drift Detection	🎯 Smart Packing
Extract meaningful context units	Measure topic shifts in real-time	Optimize token usage mathematically
🔗 DAG Tracking	🛡️ Enterprise Ready	📈 Visual Analytics
Track dependencies across sessions	Circuit breakers, retries, DLQ	Interactive observability dashboard

Visual Overview

How Context Handover Works

graph LR
    A[User Chat] --> B[Atom Extractor]
    B --> C[Atom Registry]
    C --> D[Vector Store]
    D --> E[Drift Detector]
    D --> F[Token Budgeter]
    E --> G[Loss Ledger]
    F --> H[New Session]
    
    style A fill:#e1f5fe
    style B fill:#fff3e0
    style C fill:#f3e5f5
    style D fill:#e8f5e9
    style E fill:#ffebee
    style F fill:#fff8e1
    style G fill:#fce4ec
    style H fill:#c8e6c9

Token Optimization Comparison

Approach	Token Efficiency	Semantic Coherence	Latency
❌ Naive Buffer	Low ⚠️	Low ⚠️	Fast ✅
⚡ Vector Recall	Medium	Medium	Slow ⚠️
✅ Context Handover	High 🎯	High 🎯	Fast ⚡

Quick Start

1. Installation

pip install context-handover
# Optional: For visualizations and vector backends
pip install context-handover[viz,vector]

2. Hello World

from context_handover import SessionManager, SemanticAtom

# Initialize manager
manager = SessionManager(session_id="session_001")

# Add a meaningful interaction
manager.add_message(
    role="user", 
    content="I want to build a rocket engine using methane."
)
manager.add_message(
    role="assistant", 
    content="Understood. Methane (CH4) offers high specific impulse..."
)

# Extract atoms automatically
atoms = manager.extract_atoms() 
print(f"Extracted {len(atoms)} semantic atoms.")

# Handover to a new session (preserving context)
new_session_pkg = manager.build_handover_package()
manager.handover_to_new_session("session_002", new_session_pkg)

3. Visualize Your Context

See your context flow, drift, and missing gaps in real-time:

# Launch the interactive dashboard
streamlit run context_observatory.py

(Opens a local web dashboard at http://localhost:8501)

How It Works

Unlike linear buffers, we treat context as a Directed Acyclic Graph (DAG) of semantic units.

The Architecture Flow

flowchart TD
    subgraph Input["Input Layer"]
        A[User Chat<br/>Raw Text]
    end
    
    subgraph Processing["Processing Layer"]
        B[Atom Extractor<br/>LLM + Regex]
        C[Atom Registry<br/>Dedup + Embed]
        D[Vector Store<br/>Chroma/Qdrant]
    end
    
    subgraph Optimization["Optimization Layer"]
        E[Drift Detector<br/>KL + Cosine]
        F[Token Budgeter<br/>Knapsack Algo]
    end
    
    subgraph Output["Output Layer"]
        G[Loss Ledger<br/>Audit Trail]
        H[New Session<br/>Optimized Prompt]
    end
    
    A --> B
    B --> C
    C --> D
    D --> E
    D --> F
    E --> G
    F --> H
    
    style Input fill:#e3f2fd
    style Processing fill:#fff3e0
    style Optimization fill:#f3e5f5
    style Output fill:#e8f5e9

Key Concepts

Concept	Description	Analogy	Visual
Semantic Atom	Smallest unit of meaningful context	A single Lego brick	🔷
Session DAG	Tracks atom relationships across sessions	Family tree for chat	🌳
Drift Metric	Measures topic change since last handover	Compass checking course	🧭
Knapsack Budget	Selects most valuable atoms for token limit	Packing a suitcase	🎒

Semantic Atom Lifecycle

sequenceDiagram
    participant User
    participant Manager
    participant Extractor
    participant Registry
    participant Vector
    
    User->>Manager: Add Message
    Manager->>Extractor: Trigger Extraction
    Extractor->>Extractor: Parse LLM + Regex
    Extractor->>Registry: Submit Atoms
    Registry->>Registry: Deduplicate
    Registry->>Vector: Generate Embeddings
    Vector-->>Manager: Confirmation
    Manager-->>User: Done

Visualization Dashboard

Don't fly blind. Use our built-in Context Observatory to debug and monitor your agent's memory.

What You Can See

Session DAG Map: Interactive graph of atom dependencies.
Drift Thermometer: Real-time gauge of semantic shift.
Token Knapsack: Visualizes which atoms were kept vs. dropped due to budget.
Semantic Space: 2D clustering of your conversation topics.
Integrity Gaps: Heatmap showing missing data or broken dependencies.

Dashboard Preview

+---------------------------------------------------------------+
|  CONTEXT OBSERVATORY  [Session: 8a7f...]           [Refresh]  |
+---------------------------------------------------------------+
|  [DAG MAP]        |  [DRIFT METRICS]      |  [TOKEN BUDGET]   |
|                   |                       |                   |
|    (O) Fact       |   Gauge: 0.23 (OK)    |   Used: 3.2k/4k   |
|     | \           |   Trend: ↗ Rising     |                   |
|    (D) Dec ---->  |   [||||||....]        |   [####][  ][#]   |
|     |   \         |                       |   Kept  Dropped   |
|    (C) Con        |   KL: 0.12            |                   |
|                   |   Jaccard: 0.45       |                   |
+-------------------+-----------------------+-------------------+
|  [SEMANTIC SPACE]                 |  [INTEGRITY GAPS]         |
|                                   |                           |
|      *   *   (Cluster A)          |   Time ▶                  |
|         *                         |   Topic 1 [||||||] OK     |
|   (Cluster B) *   *               |   Topic 2 [||....] GAP!   |
|                                   |   Topic 3 [||||||] OK     |
+-----------------------------------+---------------------------+

Production Features

Feature	Description	Status
🔄 Idempotency	Duplicate events auto-detected & ignored	✅ Active
🔁 Smart Retries	Exponential backoff for LLM/Redis failures	✅ Active
⚡ Circuit Breakers	Prevents cascading failures	✅ Active
📦 Dead Letter Queue	Failed events saved for replay	✅ Active
🔒 PII Ready	Redaction & encryption hooks	🔧 Configurable

Performance Benchmarks

Comparative Analysis

xychart-beta
    title "Token Efficiency vs Semantic Coherence"
    x-axis ["Naive Buffer", "Vector Recall", "Context Handover"]
    y-axis "Score (0-10)" 0 --> 10
    bar [3, 6, 9]
    line [3, 6, 9]

Detailed Metrics

Metric	Naive Buffer	Vector Recall	Context Handover
Token Efficiency	Low ⚠️	Medium	High 🎯
Semantic Coherence	Low ⚠️	Medium	High 🎯
Latency Overhead	None ✅	High ~200ms ⚠️	Low ~40ms ⚡
Auditability	None ❌	Low ⚠️	Full ✅

Ecosystem Integration

Works seamlessly with your existing stack:

flowchart LR
    A[Context Handover] --> B[LangChain]
    A --> C[LlamaIndex]
    A --> D[AutoGen]
    A --> E[LangGraph]
    A --> F[OpenTelemetry]
    A --> G[Langfuse]
    
    style A fill:#4CAF50,color:white
    style B fill:#2196F3,color:white
    style C fill:#FF9800,color:white
    style D fill:#9C27B0,color:white
    style E fill:#FF5722,color:white
    style F fill:#00BCD4,color:white
    style G fill:#E91E63,color:white

Integration Examples

Framework	Integration Type	Status
LangChain	Custom Memory Module	✅ Ready
LlamaIndex	Node Parser	✅ Ready
AutoGen/LangGraph	State Handover	✅ Ready
OpenTelemetry	Native Tracing	✅ Ready
Langfuse	Observability	✅ Ready

# Example: LangChain Integration
from langchain.memory import ConversationBufferMemory
from context_handover.integrations.langchain import HandoverMemory

memory = HandoverMemory(session_id="langchain_01")
memory.save_context({"input": "Hi"}, {"output": "Hello!"})

Configuration

Create a config.yaml to tune behavior:

pipeline:
  max_tokens: 4096
  drift_threshold: 0.5
  knapsack_strategy: "value_density" # or 'greedy'

storage:
  backend: "chromadb" # or 'qdrant', 'memory'
  path: "./data/atoms"

observability:
  tracing: true
  metrics_export: "otel"

Configuration Options Overview

Category	Option	Default	Description
Pipeline	`max_tokens`	4096	Maximum token budget for handover
Pipeline	`drift_threshold`	0.5	Sensitivity for context drift detection
Pipeline	`knapsack_strategy`	value_density	Optimization strategy
Storage	`backend`	chromadb	Vector store backend
Storage	`path`	./data/atoms	Local storage path
Observability	`tracing`	true	Enable distributed tracing
Observability	`metrics_export`	otel	Metrics export format

Documentation & Resources

📚 Complete User Guide

The comprehensive guide to understanding and using Context Handover.

Read the Full User Guide → 🐱

The User Guide includes:

✅ 5-minute quick start walkthrough
✅ Deep dive into Semantic Atoms lifecycle
✅ Architecture diagrams explained
✅ Advanced tuning for Drift & Budgets
✅ Visualization dashboard guide with visualizations
✅ API reference
✅ Troubleshooting and best practices
✅ Real-world examples

💻 Code Examples

Ready-to-run code snippets for common use cases.

# Run the demo
python examples/run_demo.py

# Run benchmark suite
python examples/benchmark.py

Contributing

We welcome contributions! Check out our Improvement Plan for open tasks.

How to Contribute

graph LR
    A[Fork Repo] --> B[Create Branch]
    B --> C[Make Changes]
    C --> D[Commit]
    D --> E[Push]
    E --> F[Open PR]
    
    style A fill:#4CAF50,color:white
    style B fill:#2196F3,color:white
    style C fill:#FF9800,color:white
    style D fill:#9C27B0,color:white
    style E fill:#00BCD4,color:white
    style F fill:#E91E63,color:white

Development Setup

# Install dependencies
poetry install

# Install pre-commit hooks
pre-commit install

# Run tests
pytest tests/ -v

Step	Command	Purpose
1	`poetry install`	Install all dependencies
2	`pre-commit install`	Set up git hooks
3	`pytest tests/ -v`	Run test suite

License

MIT License - see LICENSE for details.

Built with ❤️ for the future of agentic memory. 🐱

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
__pycache__		__pycache__
docs		docs
examples		examples
src		src
tests		tests
.coverage		.coverage
.gitignore		.gitignore
CHANGES_PHASE1.md		CHANGES_PHASE1.md
CHANGES_PHASE2.md		CHANGES_PHASE2.md
IMPROVEMENTS_SUMMARY.md		IMPROVEMENTS_SUMMARY.md
IMPROVEMENT_PLAN.md		IMPROVEMENT_PLAN.md
LICENSE		LICENSE
README.md		README.md
VISUALIZATION_GUIDE.md		VISUALIZATION_GUIDE.md
context_observatory.py		context_observatory.py
coverage.json		coverage.json
plan.py		plan.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Core Capabilities

Visual Overview

How Context Handover Works

Token Optimization Comparison

Quick Start

1. Installation

2. Hello World

3. Visualize Your Context

How It Works

The Architecture Flow

Key Concepts

Semantic Atom Lifecycle

Visualization Dashboard

What You Can See

Dashboard Preview

Production Features

Performance Benchmarks

Comparative Analysis

Detailed Metrics

Ecosystem Integration

Integration Examples

Configuration

Configuration Options Overview

Documentation & Resources

📚 Complete User Guide

💻 Code Examples

Contributing

How to Contribute

Development Setup

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages