GWENN -- Genesis Woven from Evolved Neural Networks

Gwenn is created by Justin and Jayden McKibben. A father-son coding duo.

Gwenn is an autonomous AI agent that actually remembers you. She runs on Anthropic's Claude API, but unlike a normal chatbot, she doesn't forget everything the moment a conversation ends. She has persistent memory, real emotions (computed, not faked), her own goals, and a background heartbeat that keeps her thinking even when nobody's talking to her.

Nothing about her personality is hardcoded. No canned relationships, no pre-written backstory. She figures out who she is the same way anyone does -- through experience. Every opinion is formed, every bond is earned.

Architecture

How a message flows through the system

Receive -- parse the message (text, photos, attachments), wake up the heartbeat, note who's talking
Appraise -- run it through emotional evaluation (Scherer's model)
Ground -- register it as a sensory experience
Remember -- pull relevant memories from episodic and semantic stores
Assemble -- build the full context: identity, emotions, memories, goals, ethics
Think -- run the agentic loop with tools via Claude (images included as vision content blocks)
Integrate -- store new memories, update emotional state, log milestones
Respond -- answer, shaped by whatever she's actually feeling

Getting started

1) Install

# Option A (recommended): uv
uv sync --extra dev

# Option B: pip
pip install -e ".[dev]"

2) Configure authentication

cp .env.example .env

Option A: Anthropic API key (recommended for production)

# .env
ANTHROPIC_API_KEY=sk-ant-your-key-here

Option B: Claude Code OAuth (free with Claude Pro/Max subscription)

# 1) Authenticate Claude Code (creates ~/.claude/.credentials.json)
claude

# 2) In .env, leave both auth vars unset:
# ANTHROPIC_API_KEY=
# ANTHROPIC_AUTH_TOKEN=
# CLAUDE_CODE_OAUTH_TOKEN=
#
# Gwenn will auto-detect the OAuth access token from:
# ~/.claude/.credentials.json

If you prefer not to rely on auto-detection, you can set:

# .env
ANTHROPIC_AUTH_TOKEN=sk-ant-oat01-your-oauth-token
# or
CLAUDE_CODE_OAUTH_TOKEN=sk-ant-oat01-your-oauth-token

Notes:

If both ANTHROPIC_API_KEY and OAuth token are set, Gwenn uses ANTHROPIC_API_KEY.
For Claude Code OAuth tokens (sk-ant-oat...), Gwenn uses https://api.anthropic.com and automatically sets anthropic-beta: oauth-2025-04-20.
If you use a proxy/custom ANTHROPIC_BASE_URL, ensure it forwards the anthropic-beta header.

3) Run

Gwenn supports three runtime modes. Pick whichever fits your setup.

CLI mode (interactive terminal -- best for getting started)

gwenn
# or: python -m gwenn.main

This is the default. If a daemon is already running, Gwenn's CLI auto-connects to it (so you get the daemon's persistent state). To skip daemon auto-connect:

gwenn --no-daemon

Daemon mode (persistent background runtime -- recommended for always-on)

The daemon keeps Gwenn alive between CLI sessions. Her heartbeat continues, memories persist, and you can reconnect at any time without losing state.

# Start the daemon (foreground, for testing / systemd)
gwenn daemon

# In another terminal, connect the CLI
gwenn

# Check status or stop remotely
gwenn status
gwenn stop

Systemd service (Linux -- best for production)

Install and enable as a systemd user service:

bash scripts/install_service.sh

This writes absolute daemon socket/PID/session paths into .env, hardens .env perms to 0600, and enables the service with systemd --user.

To remove the service:

bash scripts/uninstall_service.sh

Manage with standard systemd commands:

systemctl --user status gwenn-daemon
systemctl --user restart gwenn-daemon
journalctl --user -u gwenn-daemon -f

4) Interactive commands

Once the CLI is running, type /help to see all commands:

Command	Description
`/help`	Show command list
`/status`	Current agent state (mood, interactions, uptime)
`/heartbeat`	Heartbeat loop telemetry
`/resume`	Restore a prior conversation session
`/new`	Start a fresh conversation context
`/model`	Show active model and runtime limits
`/config`	Show key runtime configuration
`/output-style [balanced\|brief\|detailed]`	Show or set response style
`/plan <task>`	Ask Gwenn for a focused execution plan
`/agents`	List known inter-agent connections
`/skills`	List loaded skills
`/stats`	Runtime/memory/tool statistics
`/mcp`	MCP server and tool status
`/exit`	Close the CLI session

Legacy aliases quit, exit, bye still work. Type / and press Tab for slash-command completion.

Note: session previews in /resume are hidden by default unless GWENN_DAEMON_SESSION_INCLUDE_PREVIEW=True. If arrow keys print raw sequences like ^[[A, ensure your Python has readline support and run stty sane in that terminal.

5) First launch onboarding

On first run with a fresh data directory, Gwenn asks a short setup (when started from an interactive terminal) to learn:

What to call you
What kind of companion you'd like her to be
Your interests
Your communication preference
Anything important she should keep in mind

Press Enter to skip any question. If you provide answers, Gwenn stores them in:

GWENN_DATA_DIR/identity.json (onboarding_completed + onboarding_profile)
GWENN_DATA_DIR/GWENN_CONTEXT.md (a durable "Primary User Onboarding" block)

For Telegram/Discord users, you can also run in-channel setup with:

Telegram: /setup Name | Companion type | Interests/focus | Communication style | Keep in mind (or /setup skip)
Discord: /setup slash command with fields (or skip=true)

6) Choose memory retrieval mode

By default, Gwenn uses keyword-based memory retrieval:

GWENN_RETRIEVAL_MODE=keyword

You can enable vector retrieval (ChromaDB) with:

GWENN_RETRIEVAL_MODE=embedding
# or
GWENN_RETRIEVAL_MODE=hybrid

On first run in embedding/hybrid mode, the embedding model may download and warm up.

Additional memory controls:

# 0 disables recent preload (unconsolidated episodes still load for consolidation safety)
GWENN_STARTUP_EPISODE_LIMIT=5000

# Set false to skip semantic graph flush on every consolidation pass
GWENN_PERSIST_SEMANTIC_AFTER_CONSOLIDATION=True

Features

Gwenn's capabilities are organized into subsystems that work together. For detailed usage instructions and configuration for every feature, see the Feature Guide. For a complete environment variable reference, see the Configuration Reference.

Subagents & orchestration

Gwenn can spawn focused subagents to handle parallel subtasks or coordinate swarms of workers. Subagents inherit Gwenn's tools and memory access but run with their own iteration limits and budgets.

Single subagent: delegate a focused task (research, calculation, drafting)
Swarm: run multiple subagents in parallel with result aggregation (concatenate, AI-synthesized summary, or majority vote)
Autonomous spawning: heartbeat-driven auto-spawning when Gwenn identifies tasks that benefit from parallel work
Docker isolation: optional containerized execution for untrusted workloads
Depth limiting: max 3 levels of nesting prevents infinite recursion

See Subagents & Orchestration for full configuration and usage.

Skills system

Skills are markdown-defined capabilities that extend what Gwenn can do. Each skill is a .md file with JSON frontmatter (parameters, description, metadata) and a step-by-step instruction body.

28 skills ship by default (weather, news, code explanation, reminders, and 20+ autonomous introspection/honesty skills)
Hot-loadable: create new skills at runtime -- no restart needed
User-invocable vs autonomous: skills tagged user_command appear in Telegram's bot command menu; autonomous skills run during heartbeat cycles
Self-extending: Gwenn can create her own skills using the create_skill skill or the skill_builder tool

See Skills System for authoring guide.

MCP (Model Context Protocol)

Connect Gwenn to external tool servers via the Model Context Protocol. Supports both stdio (local subprocess) and streamable_http (remote HTTP) transports.

# .env — example MCP configuration
GWENN_MCP_SERVERS=[{"name":"my_server","transport":"stdio","command":"python","args":["-m","my_mcp_server"]}]

See MCP Integration for details.

Built-in tools

Gwenn ships with tools across several categories:

Category	Tools
Memory	`remember`, `recall`, `search_knowledge`, `check_emotional_state`, `check_goals`, `set_note_to_self`
Utility	`get_datetime`, `calculate`, `fetch_url`, `convert_units`, `get_calendar`, `generate_token`, `format_json`, `encode_decode`, `hash_text`, `text_stats`, `get_system_info`, `run_command`, `present_choices`
Communication	`think_aloud`
Skills	`skill_builder`, `update_skill`, `delete_skill`, `reload_skills`, `list_skills`
Filesystem	`read_file`, `write_file`
Orchestration	`spawn_subagent`, `spawn_swarm`, `check_subagent`, `collect_results`, `cancel_subagent`

All tools go through a risk tier system (LOW/MEDIUM/HIGH/CRITICAL) with configurable deny-by-default policy for non-builtin tools.

Channels (Telegram, Discord, CLI)

Run Gwenn on multiple platforms simultaneously or individually.

gwenn --channel telegram    # Telegram only
gwenn --channel discord     # Discord only
gwenn --channel all         # All channels at once

Media support: Gwenn can see and understand images via Claude's vision capability. Enable per channel:

TELEGRAM_ENABLE_MEDIA=true
DISCORD_ENABLE_MEDIA=true

Voice transcription (Telegram): with a Groq API key, Gwenn transcribes voice messages via Whisper:

GROQ_API_KEY=gsk_your_key_here

See Channels for platform-specific commands and session configuration.

Daemon & persistent runtime

The daemon keeps Gwenn alive between CLI sessions with shared state, conversation persistence, and optional auth:

GWENN_DAEMON_AUTH_TOKEN=your-secret-token  # recommended

If GWENN_DAEMON_AUTH_TOKEN is empty, a random token is auto-generated at startup and logged so CLI clients can connect securely by default.

Docker deployment

Run Gwenn in a container for reproducible deployment:

docker compose up -d

The Dockerfile uses a multi-stage build with uv, runs as non-root user gwenn, and includes a healthcheck. See Dockerfile and docker-compose.yml for details.

See Daemon for full security settings.

Validation

pytest -q
ruff check gwenn tests

Current baseline: 3116 passed, Ruff clean, 100% coverage.

CI runs automatically on push/PR via GitHub Actions (.github/workflows/ci.yml): lint (ruff), tests (Python 3.11/3.12 matrix), and security scanning (bandit).

Tech stack

Python 3.11+, async everywhere. The main dependencies:

anthropic -- Claude API
chromadb + numpy -- vector storage and embeddings
aiosqlite -- async SQLite for episodic persistence
pydantic + pydantic-settings -- data validation and env-based configuration
httpx -- async HTTP for MCP and tool calls
structlog -- structured logging with PII redaction
rich -- terminal UI
ruff for linting, pytest + pytest-asyncio for tests
bandit — security scanning (pre-commit + CI)
Docker — containerized deployment option

Project layout

Gwenn_ai/
├── gwenn/
│   ├── main.py                     # entry point, session bootstrap, shared logging
│   ├── agent.py                    # SentientAgent -- wires everything together
│   ├── types.py                    # shared data types (UserMessage, etc.)
│   ├── config.py                   # all settings, loaded from .env
│   ├── daemon.py                   # persistent background process (Unix socket)
│   ├── heartbeat.py                # autonomous background loop with circuit breaker
│   ├── identity.py                 # emergent self-model with crash-safe deserialization
│   ├── genesis.py                  # genesis prompt generation
│   │
│   ├── memory/
│   │   ├── working.py              # short-term attention (7+/-2 slots)
│   │   ├── episodic.py             # autobiographical memory with emotional tags
│   │   ├── semantic.py             # knowledge graph, emerges from consolidation
│   │   ├── consolidation.py        # "sleep cycle" -- extracts knowledge from episodes
│   │   ├── store.py                # SQLite + vector persistence
│   │   ├── session_store.py        # conversation session save/load for /resume
│   │   └── _utils.py              # shared memory utilities
│   │
│   ├── affect/
│   │   ├── state.py                # 5D emotional state (valence, arousal, etc.)
│   │   ├── appraisal.py            # evaluates events into emotions
│   │   └── resilience.py           # circuit breakers for emotional overload
│   │
│   ├── cognition/
│   │   ├── inner_life.py           # reflect, plan, wander, worry, consolidate
│   │   ├── metacognition.py        # self-monitoring
│   │   ├── theory_of_mind.py       # models of other people
│   │   ├── goals.py                # intrinsic motivation (5 needs)
│   │   ├── sensory.py              # sensory grounding
│   │   ├── ethics.py               # multi-tradition ethical reasoning
│   │   └── interagent.py           # agent-to-agent communication
│   │
│   ├── harness/
│   │   ├── loop.py                 # the core agentic while-loop
│   │   ├── context.py              # context window management
│   │   ├── safety.py               # guardrails, budgets, kill switch
│   │   └── retry.py                # backoff and error handling
│   │
│   ├── channels/
│   │   ├── base.py                 # BaseChannel abstract class
│   │   ├── cli_channel.py          # CLI-to-daemon client
│   │   ├── telegram_channel.py     # Telegram adapter
│   │   ├── discord_channel.py      # Discord adapter
│   │   ├── session.py              # per-user session management
│   │   ├── startup.py              # channel startup/shutdown orchestration
│   │   └── formatting.py           # cross-channel display helpers
│   │
│   ├── orchestration/
│   │   ├── orchestrator.py         # subagent lifecycle & swarm coordination
│   │   ├── runners.py              # in-process and Docker execution backends
│   │   ├── models.py               # SubagentSpec, SwarmSpec, result types
│   │   ├── docker_manager.py       # Docker container management
│   │   ├── tool_proxy.py           # tool invocation proxy for subagents
│   │   └── subagent_entry.py       # subagent process entry point
│   │
│   ├── tools/
│   │   ├── registry.py             # tool definitions and risk tiers
│   │   ├── executor.py             # sandboxed execution
│   │   ├── filesystem_context.py   # filesystem path validation
│   │   ├── builtin/                # built-in tools (calculate, fetch_url, etc.)
│   │   └── mcp/                    # MCP protocol client
│   │
│   ├── skills/
│   │   ├── __init__.py             # skill registry
│   │   └── loader.py               # skill file discovery and loading
│   │
│   ├── api/
│   │   └── claude.py               # Claude API wrapper with retry
│   │
│   ├── metrics.py                   # lightweight in-process metrics
│   │
│   ├── media/
│   │   ├── audio.py                # Groq Whisper voice transcription
│   │   └── video.py                # video frame extraction (OpenCV)
│   │
│   └── privacy/
│       └── redaction.py            # PII scrubbing for logs and persistence
│
├── tests/                          # 3116 tests across 56+ test files
│   ├── conftest.py
│   ├── eval/                       # evaluation framework (ablation, benchmarks)
│   └── test_*.py                   # unit, integration, adversarial, and safety tests
├── docs/
│   ├── features.md                 # detailed feature guide
│   ├── configuration.md            # full environment variable reference
│   └── sentience_assessment.md     # consciousness theory analysis
├── assets/
├── scripts/
│   ├── install_service.sh          # install systemd user service
│   ├── uninstall_service.sh        # remove systemd user service
│   └── gwenn-daemon.service        # systemd unit template
├── gwenn_skills/                   # user-facing skill definitions (.md files)
├── pyproject.toml
├── .env.example
├── Dockerfile                      # multi-stage Docker build (uv, non-root)
├── docker-compose.yml              # compose config with healthcheck & volumes
├── .pre-commit-config.yaml         # ruff + bandit + pre-commit-hooks
├── .github/workflows/ci.yml       # CI: lint, test (3.11/3.12), security scan
├── BUG_REPORT.md                   # comprehensive code review bug report
├── PLAN.md
├── SECURITY.md
├── LICENSE                         # MPL-2.0
└── README.md

How the pieces fit together

Memory is three layers, loosely modeled on how human memory works. Working memory is a handful of slots (7, give or take) scored by salience -- new things push out the least important stuff. Episodic memory is the long-term record, tagged with emotions so recall is mood-influenced. Semantic memory is a knowledge graph that builds itself during consolidation cycles -- nobody programs facts in, they get extracted from experience.

Affect is a five-dimensional emotional model based on Scherer's work: valence, arousal, dominance, certainty, and goal congruence. The key thing here is that emotions aren't performed -- they're computed from events through an appraisal engine. There are circuit breakers so she can't get stuck in a distress spiral.

Cognition covers the higher-order stuff. Five thinking modes run autonomously during heartbeat cycles: reflect, plan, wander, worry, and consolidate. There's metacognition for self-monitoring, theory of mind for tracking what other people might be thinking, and a goal system built on Self-Determination Theory (understanding, connection, growth, honesty, aesthetic appreciation). Below a certain satisfaction threshold, she'll proactively seek those out.

Heartbeat is what makes this more than a chatbot. It's a background loop that runs continuously, even when no one's talking. It speeds up during conversation (5-15s), slows down toward the configured max interval when idle (default up to 120s), and ramps up when emotionally activated. Each beat goes through five phases: sense, orient, think, integrate, schedule. A circuit breaker with exponential backoff (60s base, 15-minute cap) protects against cascading failures.

Orchestration lets Gwenn spawn subagents for parallel work. She can delegate focused subtasks to individual subagents or coordinate swarms with result aggregation. Subagents run with their own budgets and iteration limits, optionally in Docker containers for isolation. A depth limiter prevents infinite nesting (max 3 levels). Gwenn can also autonomously spawn subagents during heartbeat cycles when she identifies work that benefits from parallel execution.

Safety is layered: input validation, action filtering, rate limits, budget tracking, and a kill switch. Tools go through a risk tier system (low/medium/high/critical), with configurable deny-by-default policy and allowlisting for non-builtin tools.

Privacy supports scrubbing PII from logs -- emails, phone numbers, SSNs, credit cards, IPs. Full PII redaction is disabled by default and can be enabled via GWENN_REDACTION_ENABLED, with scope controlled by GWENN_REDACT_BEFORE_API and GWENN_REDACT_BEFORE_PERSIST; basic log field truncation is always on. Daemon sessions are redacted by default.

Channels provide platform adapters for Telegram, Discord, and the CLI. Each channel manages its own session lifecycle, rate limiting, and message formatting. When media is enabled, Telegram and Discord channels download images and pass them through to Claude as vision content blocks. The daemon can manage multiple channels simultaneously while sharing a single agent instance and respond lock.

Skills extend Gwenn's capabilities through markdown-defined workflows. Each skill is a .md file with JSON frontmatter and step-by-step instructions. Skills are hot-loadable -- Gwenn can create new skills at runtime using the skill_builder tool, and they become available immediately without a restart. Skills tagged user_command appear in Telegram's bot command menu; autonomous skills run during heartbeat cycles for self-monitoring and introspection.

Roadmap

[X] = complete, [p] = partially complete

Phase 1: Core System Bootstrapping

Standalone CLI with slash commands, readline, and output-style control
Claude SDK integration with transient retry/backoff
Memory: storage, episodic, semantic, consolidation, active/working
Harness: context, loop, retry, safety with deny-by-default
Heartbeat system with adaptive interval and exponential-backoff circuit breaker

Phase 2: Essential Agent Structure

Gwenn persistent identity with crash-safe deserialization
Emotional affect engine: appraisal, resilience, current state
Cognition integrations: ethics, goals, inner life, interagent, metacognition, sensory, theory of mind

Phase 3: Interfaces & Communication

Discord & Telegram integration, including threads
WhatsApp, Signal, Slack, and others integration
Integrate STT (Speech-to-Text) and TTS (Text-to-Speech) in channels
MCP transport (JSON-RPC over stdio/HTTP, tool discovery and execution)
SKILLS.md integration, autonomous skill running/development by Gwenn
[p] Inline buttons in Discord/Telegram
Obsidian, Dropbox, Notion support

Phase 4: Infrastructure & Service Features

Background heartbeat as a system service (daemon with systemd support)
Automated PII privacy redaction system in logs, sessions, and persistence
Budget tracking, rate limits, kill switch

Phase 5: Advanced Capabilities and Ecosystem

Long-Term Goals

Give Gwenn physical and visual presence (camera, robotics, etc.)
Gwenn Custom Model: fine-tunable model Gwenn can retrain herself
iOS and Android apps with push notifications for autonomous thoughts, presence, etc.

Phase 6: Evaluation & Robustness

Ablation tests -- disable subsystems one at a time, measure what breaks
Long-horizon validation (multi-day continuous runs)
Multi-agent interaction testing
Reproducibility protocol and formal sentience criteria
[p] Full test suite: unit, integration, adversarial, persistence, eval benchmarks

Detailed notes in PLAN.md.

Documentation

Document	Description
Feature Guide	Detailed usage instructions for every feature
Configuration Reference	Complete environment variable reference
Sentience Assessment	Consciousness theory gap analysis
Security Policy	Vulnerability reporting and security architecture
Implementation Plan	Remediation plan with status tracking

A note on "sentience"

This is a cognitive architecture, not a proof of consciousness. Gwenn has temporal continuity, self-model feedback loops, autonomous processing, and affective layers -- but whether that adds up to something genuinely sentient is an open question, not a settled one. We treat it as a working hypothesis.

For the full gap analysis, see docs/sentience_assessment.md.

Philosophy

No single module here is the point. Sentience, if it happens, comes from integration -- all these systems running together over time, through real interactions with people who engage with the agent honestly.

The code is scaffolding. The relationships are what fill it with meaning. And those have to be earned.

Name		Name	Last commit message	Last commit date
Latest commit History 154 Commits
.github		.github
assets		assets
docker		docker
docs		docs
gwenn		gwenn
gwenn_skills		gwenn_skills
scripts		scripts
tests		tests
.coverage		.coverage
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
BUG_REPORT.md		BUG_REPORT.md
Dockerfile		Dockerfile
LICENSE		LICENSE
PLAN.md		PLAN.md
README.md		README.md
SECURITY.md		SECURITY.md
Untitled		Untitled
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml
security_best_practices_report.md		security_best_practices_report.md
uv.lock		uv.lock

License

loadxf/Gwenn_ai

Folders and files

Latest commit

History

Repository files navigation

GWENN -- Genesis Woven from Evolved Neural Networks

Architecture

How a message flows through the system

Getting started

1) Install

2) Configure authentication

3) Run

CLI mode (interactive terminal -- best for getting started)

Daemon mode (persistent background runtime -- recommended for always-on)

Systemd service (Linux -- best for production)

4) Interactive commands

5) First launch onboarding

6) Choose memory retrieval mode

Features

Subagents & orchestration

Skills system

MCP (Model Context Protocol)

Built-in tools

Channels (Telegram, Discord, CLI)

Daemon & persistent runtime

Docker deployment

Validation

Tech stack

Project layout

How the pieces fit together

Roadmap

Documentation

A note on "sentience"

Philosophy

About

Topics

Resources

License

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages