ActiveHarness

⚠️ Work in progress. The API is under active development and may change between versions without notice.

Running a single LLM call is easy. Running a reliable, observable, cost-controlled AI system is not.

ActiveHarness is a Ruby framework for building production-grade LLM pipelines — with deep observability, consensus-based decisions, automatic fallbacks, and real-time cost and timing control. Made for Rails, works in plain Ruby too.

ActiveHarness gem gives you the scaffolding to build multi-step pipelines where every agent is under full control: its inputs are directed, its outputs are observed, its errors are retried, and its cost is tracked. You define the logic; ActiveHarness handles the infrastructure.

What is a "Harness"?

A harness in software is scaffolding that keeps a component under control — directing its inputs, observing its outputs, and enforcing rules around it. ActiveHarness does exactly that for LLM agents.

Build AI-Based Pipelines!

Build multi-step, trackable, cost-effective, and reliable AI flows with a clean, Rails-native DSL.

Build Nested AI Pipelines!

Group related steps into reusable sub-pipelines, and compose complex workflows from smaller ones. Each pipeline is just another step, with its own stop conditions, context forwarding, and execution time tracking.

Compose Hybrid Pipelines!

Orchestrate deterministic and AI steps together.

Control the Cost of Your AI Calls!

With ActiveHarness you can track time, tokens, and dollars for every agent call, pipeline step, and tribunal.

Cost in Application	Provider's Cost

Use Consensus-Based Decisions!

Use Tribunals to run multiple agents in parallel and make Verdicts based on their agreement — improving reliability and reducing biases and hallucinations.

Provide Event Tracing & Observability!

Use power of event hooks to log and trace every step of your AI flows, from individual agent calls to multi-step pipelines and parallel tribunals.

Event Tracing Architecture	Grafana Dashboard

Backend Agnostic — Built on OpenTelemetry, ready for any collector (Jaeger, Datadog, Honeycomb, or custom).

Use Memory to make your agents stateful!

Store conversation history in JSON, SQLite and PostgreSQL. Inject memory into prompts to make agents that remember past interactions.

Visualize Models' Cost and Type

Add Streaming (SSE)!

Rails App	Console

Key Capabilities

Capability	What it means
Multi-step Pipelines	Chain agents sequentially, with per-step stop conditions and context forwarding
Tribunal Consensus	Run multiple agents in parallel and accept the result only if they agree (unanimous, majority, or custom)
Automatic Fallbacks	If a model fails, the next one in the chain takes over — zero extra code
Retry Policy	Exponential backoff per model, globally configurable or per-agent
Full Observability	Lifecycle hooks on every agent event: `before_call`, `after_call`, `retry`, `failure` — log, stream, or act
Real-time Streaming	SSE-ready token streaming from any agent into your Rails response
Execution Time Tracking	Per-agent and per-pipeline timing built in
Token Cost Tracking	Know exactly what each call cost in tokens and dollars
Rails-native DSL	Clean file structure, Railtie integration, generator support
Event Tracing	OpenTelemetry integration for distributed tracing of agents, tribunals, and pipelines

File Structure

File structure for Ruby and Ruby on Rails applications:

Place all of your AI-related code in app/ai to keep it organized and separate from your core application logic. You can further organize it into subdirectories for prompts, agents, tribunals, pipelines, and memory.

app/
├── models/
├── controllers/
├── views/
└── ai/
    ├── prompts/      # system prompt classes
    ├── agents/       # agent classes
    ├── tribunals/    # parallel verdict panels
    ├── pipelines/    # multi-step pipelines
    └── memory/       # custom memory classes

Name		Name	Last commit message	Last commit date
Latest commit History 132 Commits
.github		.github
docs		docs
lib		lib
.editorconfig		.editorconfig
.gitignore		.gitignore
.prettierignore		.prettierignore
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
active_harness.gemspec		active_harness.gemspec

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ActiveHarness

What is a "Harness"?

Build AI-Based Pipelines!

Build Nested AI Pipelines!

Compose Hybrid Pipelines!

Control the Cost of Your AI Calls!

Use Consensus-Based Decisions!

Provide Event Tracing & Observability!

Use Memory to make your agents stateful!

Visualize Models' Cost and Type

Add Streaming (SSE)!

Key Capabilities

File Structure

Prompt Documentation

Agent Documentation

Pipeline Documentation

Nested Pipelines Documentation

Tribunal Documentation

Memory Documentation

Installation and Configuration

Tracing and Observability

License

About

Uh oh!

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ActiveHarness

What is a "Harness"?

Build AI-Based Pipelines!

Build Nested AI Pipelines!

Compose Hybrid Pipelines!

Control the Cost of Your AI Calls!

Use Consensus-Based Decisions!

Provide Event Tracing & Observability!

Use Memory to make your agents stateful!

Visualize Models' Cost and Type

Add Streaming (SSE)!

Key Capabilities

File Structure

Prompt Documentation

Agent Documentation

Pipeline Documentation

Nested Pipelines Documentation

Tribunal Documentation

Memory Documentation

Installation and Configuration

Tracing and Observability

License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages