Orb - Agentic RP Frontend

Problem Statement

LLMs suffer from stylistic inertia in long roleplay sessions. Once a tone, pacing, or prose style is established over several turns, the model tends to perpetuate it regardless of narrative shifts. A lighthearted conversation that turns tragic will often retain the cadence and vocabulary of the earlier tone because the weight of prior context anchors the model's generation.

Static system prompts cannot solve this. The system prompt is written once and does not adapt to evolving scenes.

Solution Overview

An agentic middleware layer sits between the user and the model. It intercepts each user message, runs a short analytical pass to "read the room," then dynamically assembles prompt directives that shape the model's writing before the actual roleplay generation happens.

The user never sees the agentic layer. The writer model doesn't know it's being directed. The result is a roleplay session that naturally adapts its style, tone, and pacing as the narrative evolves.

Architecture

Single-Model, Three-Pass Design

The system uses a three-pass architecture for each user message:

Director Pass - Tool-calling phase where the LLM selects moods, plot direction, and potentially rewrites user prompts
Writer Pass - Story generation phase where the LLM writes the actual roleplay response
Editor Pass - A ReAct loop - Self-audit for slop and length optimization phase. This is surgical, errors will be programmatically detected, the model only needs to write replacement for targeted sentences

KV Cache Reuse Strategy

For optimal KV cache reuse, the following will remain consistent across passes:

1. System Prompt

The system prompt (character card, instructions, etc.) is identical across all passes
Built once and reused forever
Includes character description, scenario, example dialogue, and additional instructions

2. Chat History

The conversation history (previous messages) is identical across all passes
Maintains exact same message content and ordering

3. Tool Schemas

The same tool definitions must be sent in each LLM call for kv cache reuse
Tool schemas affect the model's internal representation
Inconsistent tool schemas break KV cache alignment

Benefits

Clear direction for Writer: Grounding the story + actively steering the writing style = better output
Customizability: Customizable prompt injection that's automatically used by Director model
Anti-slop: Get rid of overused words, phrases, and patterns often seen in LLM outputs
Length Guard: Actively or passively protect from length degradation as context grows

Drawbacks

Speed: Multiple passes will obviously have a longer time to final response
Cost: Neligible cost increase, which comes naturally with multiple passes, somewhat alleviated by KV cache reuse strategy

Requirements

A model with solid tool/function calling capabilities (recommended: Gemma 4)
OpenAI-compatible LLM inference backend API that supports prompt-caching

Name		Name	Last commit message	Last commit date
Latest commit History 355 Commits
.github/workflows		.github/workflows
backend		backend
frontend		frontend
scripts		scripts
tests		tests
.flake8		.flake8
.gitignore		.gitignore
LICENSE		LICENSE
Orb.png		Orb.png
README.md		README.md
TOFIX.txt		TOFIX.txt
biome.json		biome.json
pytest.ini		pytest.ini
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt
run_unix.sh		run_unix.sh
run_windows.bat		run_windows.bat

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Orb - Agentic RP Frontend

Problem Statement

Solution Overview

Architecture

Single-Model, Three-Pass Design

KV Cache Reuse Strategy

1. System Prompt

2. Chat History

3. Tool Schemas

Benefits

Drawbacks

Requirements

About

Uh oh!

Releases 5

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Orb - Agentic RP Frontend

Problem Statement

Solution Overview

Architecture

Single-Model, Three-Pass Design

KV Cache Reuse Strategy

1. System Prompt

2. Chat History

3. Tool Schemas

Benefits

Drawbacks

Requirements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 5

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages