GhostKG - Dynamic Knowledge Graphs with Memory Decay for LLM Agents

Enable your AI agents to remember, forget, and evolve their knowledge naturally

GhostKG is a Python package that provides dynamic knowledge graph management for LLM agents with built-in memory decay using FSRS (Free Spaced Repetition Scheduler). It enables agents to maintain temporal, evolving knowledge bases that naturally forget information over time.

✨ Features

🧠 Dynamic Knowledge Graphs: Maintain structured knowledge as semantic triplets (subject-relation-object)
📉 Memory Decay: FSRS-based spaced repetition models realistic memory retention and forgetting
⏰ Time-Aware: Full support for temporal simulation and time-based queries
💭 Sentiment Tracking: Emotional associations with knowledge (-1.0 to 1.0 scale)
👥 Multi-Agent Support: Manage multiple agents with separate knowledge bases
🔌 Flexible Integration: Use with any LLM (GPT-4, Claude, Ollama, etc.)
🎯 External API: Decouple KG management from LLM logic for maximum flexibility
⚡ Fast Mode: Optional GLiNER+TextBlob for quick extraction without LLM calls
💾 Database Flexibility: Works with existing SQLite databases, preserving other tables
📊 Interactive Visualization: Web-based visualization of knowledge graph evolution over time
🗄️ Multi-Database Support: SQLite, PostgreSQL, and MySQL support with configurable connection pools

Installation

Using UV (Recommended)

UV is a fast Python package manager. Install it first:

# Install uv
curl -LsSf https://astral.sh/uv/install.sh | sh  # Unix/macOS
# Or: pip install uv

# Clone and install GhostKG
git clone https://github.com/GiulioRossetti/GhostKG.git
cd GhostKG

# Base installation (core features)
uv pip install -e .

# With LLM support (Ollama)
uv pip install -e ".[llm]"

# With fast mode (GLiNER + TextBlob)
uv pip install -e ".[fast]"

# With visualization
uv pip install -e ".[viz]"

# With all features
uv pip install -e ".[all]"

# For development
uv pip install -e ".[dev]"

See UV Setup Guide for detailed instructions.

Using pip

# From PyPI (when published)
pip install ghost_kg

# From source
git clone https://github.com/GiulioRossetti/GhostKG.git
cd GhostKG
pip install -e .

# With optional features
pip install -e ".[llm]"    # LLM support
pip install -e ".[fast]"   # Fast mode
pip install -e ".[viz]"    # Visualization server
pip install -e ".[all]"    # All features

Development Mode

For development without full package installation, use the ghostkg_dev.py script:

# Clone the repository
git clone https://github.com/GiulioRossetti/GhostKG.git
cd GhostKG

# Install minimal dependencies
pip install -r requirements/base.txt
pip install flask  # for visualization server

# Use dev script (no package installation needed)
python ghostkg_dev.py --help
python ghostkg_dev.py export --database mydb.db --serve --browser
python ghostkg_dev.py serve --json history.json --browser

After installing the package in editable mode (pip install -e .), use the regular ghostkg command instead.

Quick Start

Option 1: External Program API (Recommended)

Use GhostKG as a library to manage agent KGs while keeping full control over LLM and business logic:

from ghost_kg import AgentManager, Rating
import datetime

# Initialize manager
manager = AgentManager(db_path="agents.db")

# Create agents
alice = manager.create_agent("Alice")
bob = manager.create_agent("Bob")

# Set time
current_time = datetime.datetime.now(datetime.timezone.utc)
manager.set_agent_time("Alice", current_time)

# Add knowledge
manager.learn_triplet("Alice", "I", "support", "UBI", rating=Rating.Easy, sentiment=0.8)

# Process incoming content
triplets = [("Bob", "mentions", "automation")]
manager.absorb_content("Alice", "Bob mentioned automation", author="Bob", triplets=triplets)

# Get context for reply
context = manager.get_context("Alice", topic="UBI")
# Use context with your LLM to generate response
response = your_llm.generate(context)

# Update KG with response
response_triplets = [("value", "economic security", 0.7)]
manager.update_with_response("Alice", response, triplets=response_triplets)

See API Documentation for complete details.

Option 2: Integrated LLM Approach

Use GhostKG with built-in Ollama LLM integration:

from ghost_kg import GhostAgent, CognitiveLoop
import datetime

# Create agent
agent = GhostAgent("Alice", db_path="agent.db")
loop = CognitiveLoop(agent)

# Set time
agent.set_time(datetime.datetime.now(datetime.timezone.utc))

# Agent reads and processes content
loop.absorb("I think UBI is necessary.", author="Bob")

# Agent generates reply
response = loop.reply("UBI", partner_name="Bob")

Option 3: Commercial LLM Providers

Use OpenAI, Anthropic, Google, or Cohere with GhostKG:

from ghost_kg import GhostAgent, CognitiveLoop
from ghost_kg.llm import get_llm_service
import datetime

# Create LLM service (OpenAI, Anthropic, Google, Cohere)
llm_service = get_llm_service("openai", "gpt-4")  # or "anthropic", "google", etc.

# Create agent with commercial LLM
agent = GhostAgent("Alice", db_path="agent.db", llm_service=llm_service)
loop = CognitiveLoop(agent, model="gpt-4")

# Use like Option 2, but with GPT-4/Claude/etc.
agent.set_time(datetime.datetime.now(datetime.timezone.utc))
loop.absorb("I think UBI is necessary.", author="Bob")
response = loop.reply("UBI", partner_name="Bob")

📖 See Usage Guide for detailed examples of all usage modes and LLM Providers for provider setup.

Privacy & Storage Control

GhostKG provides fine-grained control over what data is stored in logs:

# Privacy-friendly: Store UUID instead of content (default)
manager = AgentManager(db_path="agents.db", store_log_content=False)

# Or: Store full content in logs
manager = AgentManager(db_path="agents.db", store_log_content=True)

Default behavior (store_log_content=False):

Content text is NOT stored in the database
A UUID is generated and stored instead
More privacy-friendly and reduces storage requirements

User-specified UUIDs: You can optionally provide your own UUID when content is not stored:

import uuid

db = KnowledgeDB(db_path="agents.db", store_log_content=False)

# Provide your own UUID for tracking
my_uuid = str(uuid.uuid4())
db.log_interaction(
    agent="Alice",
    action="READ",
    content="sensitive content",
    annotations={},
    content_uuid=my_uuid  # Use your own UUID
)

# Or let the system auto-generate one (default)
auto_uuid = db.log_interaction(
    agent="Bob",
    action="WRITE",
    content="another message",
    annotations={}
)

When to use store_log_content=True:

When you need full audit trails of all interactions
For debugging and analysis purposes
When privacy is not a concern

Use Cases

Multi-Agent Simulations: Model conversations between agents with evolving beliefs
Chatbot Memory: Give chatbots persistent, time-aware memory that naturally decays
Research Assistants: Track what an AI assistant knows and when it learned it
Game NPCs: Create NPCs with realistic memory and knowledge evolution
Social Simulations: Model opinion dynamics and information spread

Key Concepts

Knowledge Triplets

Knowledge is stored as semantic triplets:

Subject: The source node (e.g., "I", "Bob", "automation")
Relation: The relationship (e.g., "supports", "mentions", "affects")
Object: The target node (e.g., "UBI", "jobs", "economy")

Memory Decay (FSRS)

GhostKG uses FSRS v6 for realistic memory modeling:

Stability: How well information is retained
Difficulty: How hard it is to recall
Retrievability: Current probability of recall based on elapsed time
Ratings: Easy (4), Good (3), Hard (2), Again (1)

Time Management

All operations are time-aware:

import datetime

# Set simulation time
time = datetime.datetime(2025, 1, 1, 9, 0, 0, tzinfo=datetime.timezone.utc)
manager.set_agent_time("Alice", time)

# Advance time
from datetime import timedelta
time += timedelta(hours=1)
manager.set_agent_time("Alice", time)

Architecture

┌─────────────────────────────────────┐
│   Your Application/LLM Integration   │
│  (Business Logic, Text Generation)   │
└──────────────┬──────────────────────┘
               │
               ▼
┌─────────────────────────────────────┐
│         GhostKG (AgentManager)      │
│  ┌─────────────────────────────┐   │
│  │   Agent Management          │   │
│  │   • Create/retrieve agents  │   │
│  │   • Update/query KGs        │   │
│  └─────────────────────────────┘   │
│  ┌─────────────────────────────┐   │
│  │   Knowledge Graph Storage   │   │
│  │   • Triplet management      │   │
│  │   • Semantic filtering      │   │
│  └─────────────────────────────┘   │
│  ┌─────────────────────────────┐   │
│  │   Memory System (FSRS)      │   │
│  │   • Spaced repetition       │   │
│  │   • Forgetting curves       │   │
│  └─────────────────────────────┘   │
└──────────────┬──────────────────────┘
               │
               ▼
       ┌──────────────┐
       │   SQLite DB  │
       └──────────────┘

📊 Visualization

GhostKG includes an interactive web-based visualization for exploring knowledge graph evolution:

# After installation
ghostkg export --database agent_memory.db --serve --browser

# Development mode (without installation)
python ghostkg_dev.py export --database agent_memory.db --serve --browser

# Or export first, then serve
ghostkg export --database agent_memory.db --output history.json
ghostkg serve --json history.json --browser

Or export first, then serve

ghostkg export --database agent_memory.db --output history.json
ghostkg serve --json history.json --browser

Features:

Interactive Force-Directed Graph: D3.js visualization with zoom, pan, and drag
Temporal Playback: Step through agent interactions chronologically
Multi-Agent View: Switch between individual agents or consolidated view
Memory Heatmap: Node colors represent FSRS retrievability (forgetting curve)
Playback Controls: Play/pause, speed adjustment, timeline scrubbing

See Visualization Guide for detailed documentation.

📚 Documentation

Comprehensive documentation is available in the docs/ directory:

Getting Started

📖 Usage Guide - Complete guide to different usage modes (NEW!)
- Mode 1: External API (no internal LLM)
- Mode 2: Integrated LLM (Ollama self-hosted)
- Mode 3: Integrated LLM (OpenAI/Anthropic/etc.)
- Mode 4: Hybrid mode (external LLM + KG management)
🔌 LLM Providers - Multi-provider LLM setup guide

Core Documentation

📑 Documentation Index - Complete documentation overview with cross-references
🏗️ Architecture & Design - System architecture and design philosophy
🔧 Core Components - Detailed component specifications
📐 Algorithms & Formulas - Mathematical foundations and FSRS details
💾 Database Schema - Complete schema and query patterns
📊 Visualization Guide - Interactive visualization and CLI tools
🔌 API Reference - External API documentation
⚡ Fast Mode Guide - Fast vs LLM extraction modes
📚 Implementation Summaries - Detailed feature implementation histories

Quick Links

Getting Started - New user guide
Integration Patterns - How to integrate with your app
Memory Decay Math - Understanding FSRS

💡 Examples

Working code examples in the examples/ directory:

multi_provider_llm.py - Using different LLM providers (Ollama, OpenAI, Anthropic, Google, Cohere) (NEW!)
use_case_example.py - Two agents multi-round communication with configurable extraction modes
external_program.py - Complete example of external program integration
hourly_simulation.py - Time-based agent conversation simulation
export_history.py - Export and analyze knowledge graph history

📋 Requirements

Python >= 3.8
networkx >= 3.0
fsrs >= 1.0.0
ollama >= 0.1.6 (optional, for integrated LLM approach)
gliner (optional, for fast mode)
textblob (optional, for fast mode sentiment analysis)

🔬 Research & Theory

GhostKG is built on solid theoretical foundations:

FSRS v6: Advanced spaced repetition algorithm based on cognitive science
Forgetting Curves: Models natural memory decay over time
Semantic Triplets: Knowledge representation following RDF principles
Temporal Knowledge Graphs: Time-aware graph databases

See Algorithms & Formulas for mathematical details and references.

📊 Performance

Fast Mode: Process ~100 messages/second (no LLM needed)
LLM Mode: Limited by LLM inference speed (~1-5 messages/second)
Query Speed: O(log n) lookups with proper indexing
Storage: ~150 bytes per triplet, ~100 bytes per entity

See Database Schema for optimization details.

🧪 Testing

Run tests with pytest:

pytest tests/

Test files:

test_comprehensive.py - Full workflow tests
test_process_and_get_context.py - API integration tests
test_fast_mode_config.py - Configuration validation

📄 License

MIT License - see LICENSE for details

🎓 Citation

If you use GhostKG in your research, please cite:

@software{ghostkg2025,
  title={GhostKG: Dynamic Knowledge Graphs with Memory Decay for LLM Agents},
  author={Rossetti, Giulio},
  year={2025},
  url={https://github.com/GiulioRossetti/GhostKG}
}

🤝 Contributing

Contributions are welcome! Please:

Check the Architecture to understand the design
Read Core Components to understand the implementation
Add tests for new features
Update documentation
Submit a Pull Request

💬 Support

Documentation: docs/index.md
Issues: GitHub Issues
Examples: examples/ directory
API Reference: docs/API.md

🙏 Acknowledgments

GhostKG builds upon:

FSRS Algorithm by Jarrett Ye and contributors
Spaced Repetition Research by cognitive scientists
Knowledge Graph and RDF standards
SQLite for reliable persistence

Name		Name	Last commit message	Last commit date
Latest commit History 175 Commits
.github		.github
docs		docs
examples		examples
ghost_kg		ghost_kg
requirements		requirements
tests		tests
.coverage		.coverage
.coveragerc		.coveragerc
.flake8		.flake8
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.pylintrc		.pylintrc
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
ghostkg_dev.py		ghostkg_dev.py
mkdocs.yml		mkdocs.yml
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
requirements.txt		requirements.txt
setup.py		setup.py
uv.lock		uv.lock

Folders and files

Latest commit

History

Repository files navigation

GhostKG - Dynamic Knowledge Graphs with Memory Decay for LLM Agents

✨ Features

Installation

Using UV (Recommended)

Using pip

Development Mode

Quick Start

Option 1: External Program API (Recommended)

Option 2: Integrated LLM Approach

Option 3: Commercial LLM Providers

Privacy & Storage Control

Use Cases

Key Concepts

Knowledge Triplets

Memory Decay (FSRS)

Time Management

Architecture

📊 Visualization

Or export first, then serve

📚 Documentation

Getting Started

Core Documentation

Quick Links

💡 Examples

📋 Requirements

🔬 Research & Theory

📊 Performance

🧪 Testing

📄 License

🎓 Citation

🤝 Contributing

💬 Support

🙏 Acknowledgments

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages