21 Jun 05:34

6e8ae8a

Latest

RAI v2.1.0 Release Notes

What's New

Subagents v2 — Async Background Agent System

Agents can now spawn, monitor, and coordinate parallel background agents from within a single session. The new SubagentService manages full lifecycle including HITL, output discipline, and completion notifications injected back into the parent agent's context.

10 new LLM tools available to the agent:

Tool	Description
`Subagent`	Launch a single background agent; returns `task_id` immediately
`MultiSubagents`	Launch multiple agents in parallel
`Pipeline`	DAG of tasks with dependency ordering
`SubagentStatus`	Get current status + output of a task
`SubagentCancel`	Cancel a running task
`SubagentUpdate`	Send a follow-up message to a completed task
`SubagentResponse`	Block and wait for task completion
`SubagentProgress`	Read live checkpoint of a running task
`ListSubagents`	List available subagent types
`ListTasks`	List all tasks for the current run

Output discipline: every subagent writes structured output to /tmp/rai_{task_id}_output.md. The parent agent receives a ≤400-char preview + file path when the task completes — subagent context never pollutes the main agent's checkpoint.

Claude SDK Mode — No API Key Required

RAI now includes a full Claude Code subscription-backed mode that requires no ANTHROPIC_API_KEY.

rai claude                        # Claude TUI (HITL on by default)
rai claude --no-hitl              # autonomous mode
rai claude --agent pentest        # named agent
rai claude --model claude-opus-4-8
rai claude-serve                  # FastAPI SSE proxy at :8001
rai claude --url http://host:8001 # connect TUI to remote claude-serve

The Textual TUI (rai claude) streams tokens live with tool call cards, subagent progress, thinking blocks, and inline HITL approval panels.

Python SDK:

from rai.sdk import ClaudeAgent

async with ClaudeAgent.builder().agent_name("rai").without_hitl().build() as agent:
    result = await agent.run("scan example.com for vulnerabilities")

Remote TUI (`--url`)

Connect the Claude TUI to a remote rai claude-serve without spawning a local subprocess:

rai claude --url http://remote-server:8001 --api-key <key>

Health-checks the remote server before launching; exits with a clear error if unreachable.

Docker Container Routing (`--docker`)

All bash and execute tool calls can be routed to a persistent Docker container instead of the host. Bring any image — Kali, Parrot, or a custom pentest toolkit.

rai chat --docker kalilinux/kali-rolling
rai chat --docker parrotsec/security

A shared workspace is mounted at /workspace inside the container. The container is started on server boot and stopped on exit. No default image is provided — users supply their own.

Container capabilities: NET_RAW, NET_ADMIN, SYS_PTRACE, SETUID, CHOWN, DAC_OVERRIDE, NET_BIND_SERVICE, KILL

API Auth Hardening

Fixed: /docs, /openapi.json, and /redoc were bypassing the X-API-Key guard on the HTTP harness when --server-key was set — an attacker on the network could enumerate all API endpoints and their full schemas without a key. Now only /ok (health check) is exempt.
Both servers (rai http serve and rai claude-serve) now log a startup warning when launched without an API key.

New Middleware (7 layers added)

Middleware	Role
`AllowedToolsMiddleware`	Hard-enforce per-run tool whitelist via `ContextVar`
`ReadFileDedupMiddleware`	Stub unchanged file re-reads via mtime check
`SubagentRestrictMiddleware`	Block `task` tool for subagents (no recursive nesting)
`DockerRouterMiddleware`	Route bash/execute to Docker container when `--docker` is active
`RetryMiddleware`	Auto-retry 429/502/503/timeouts with exponential backoff
`WriteTruncateMiddleware`	Truncate file-write args in old messages; last 5 writes kept verbatim
`CostTrackingMiddleware`	Accumulate per-run cost in `ContextVar`

Assets 2

03 Jun 05:42

th3sanjai

v2.0.2

e618510

V2.0.2 Version Release

What's New

6–8× Cost Reduction via Prompt Caching Parity with Claude Code

RAI now sends requests using the Anthropic wire format (POST /v1/messages) instead of the OpenAI format (POST /chat/completions). This single change unlocks full prompt caching — the same strategy used by Claude Code — saving 60–90% on input token costs for long sessions.

Before v2.0.2: $40–60 for a full 6-step VAPT session
After v2.0.2: $5–7 for the same session

What changed under the hood

Change	Impact
`ChatAnthropic` replaces `ChatLiteLLM` for Claude models	`cache_control` preserved through proxy
System prompt (70k chars) cached with `ephemeral`	~28k tokens saved every turn after first
Tool definitions (90 tools) cached	~35k tokens saved every turn after first
Last human message cached	Full history served from cache on next turn
Cache TTL removed (was 5 min, now 1h default)	12× longer cache lifetime

Automatic upgrade

All existing Claude model configs (litellm:openai/bedrock-claude-*, anthropic:claude-*) are automatically upgraded to ChatAnthropic routing at runtime. No config changes required.

To explicitly use the new routing:

rai agents config-set rai \
  --model "chatanthropic:bedrock-claude-sonnet-4.6-(US)" \
  --api-key "sk-..." \
  --base-url "https://your-litellm-proxy.example.com"

Extended Thinking Enabled by Default

RAI now sends thinking: {type: enabled, budget_tokens: 31999} on every call — matching Claude Code's behavior. This improves reasoning quality for complex security assessments, reducing mistakes and re-runs.

⚠ Temperature override: Anthropic requires temperature=1.0 when extended thinking is enabled. RAI enforces this automatically for all Claude models. Your config.toml temperature setting is ignored while thinking is active. Non-Claude models (OpenAI, Gemini, Ollama) are unaffected.

To disable thinking and restore your configured temperature:

RAI_THINKING=0 rai chat          # per-run
export RAI_THINKING=0            # permanent

Mode	Temperature used	Notes
`RAI_THINKING=1` (default)	`1.0` (forced by Anthropic)	Best reasoning quality
`RAI_THINKING=0`	Your `config.toml` value (default `0.7`)	Standard mode, lower cost

MITM Proxy Support for Debugging

Capture every LLM request in Burp Suite or mitmproxy:

RAI_INSPECT=1 RAI_INSPECT_PROXY=http://127.0.0.1:8080 rai chat

Works correctly with macOS system proxies (WARP, VPN) — those are bypassed automatically.

Bug Fixes

Fixed StaticSystemPromptCacheBreakpointMiddleware not tagging system[0] due to _should_apply_caching returning False for ChatAnthropic in deepagents
Fixed AnthropicPromptCachingMiddleware stamping ttl: "5m" on all cache blocks (now defaults to Anthropic's 1h)
Fixed RequestInspectorMiddleware failing on macOS when WARP/VPN SOCKS proxy is active

Assets 2

02 Jun 16:12

th3sanjai

v2.0.1

90cec8f

V2.0.1 Release

RAI v2.0.1 Release Notes

What's New

Token Cost Reduction (60–80% on long sessions)

RAI now runs a 3-layer compression pipeline before every model call, keeping costs flat as sessions grow longer:

History trim — clips conversation to a token budget before the model sees it
Tool result compression — old bash/grep/file outputs are truncated; recent results stay verbatim
Summarization — only fires when the first two layers aren't enough

Combined with a configurable cheap model for summarization, a typical VAPT or SAST session now costs significantly less than before.

Configure a cheaper summarization model:

rai agents config-set --compact-model "litellm:openai/bedrock-claude-haiku-4.5"

# With explicit credentials if different from main
rai agents config-set \
  --compact-model "litellm:openai/bedrock-claude-haiku-4.5 \
  --compact-api-key "sk-..." \
  --compact-base-url "https://llmproxy.example.com"

# Clear it (inherit main model)
rai agents config-set --compact-model ""

Also configurable via RAI_COMPACT_MODEL env var.

Loop Detection

The agent no longer gets stuck re-executing the same command. When identical tool calls are detected, RAI returns the cached result with a warning instead of executing again:

⚠ DUPLICATE CALL BLOCKED: 'bash' was already executed 5 times and returned:
0 matches for shell_exec|proc_open|...
This result is final. Accept it and proceed to the next step.

Configurable: RAI_LOOP_WINDOW=10 (default 10 recent calls tracked).

Plan Mode Improvements

Plan completion no longer repeats the full step history in the tool result — agent reads its own notes directly
list_plan_steps returns compact summaries for completed steps, full detail for pending ones
Memory phase at plan exit now explicitly guides the agent to write target-specific methodology to scope='target'

TUI Improvements

Thread resume now shows the most recent messages instead of the oldest
Internal control messages no longer appear as user messages in history

Model Call Diagnostics

Enable detailed per-call logging to verify token consumption:

RAI_DEBUG_LOG_CALLS=1 rai chat
# Logs to ~/.rai/debug/model-calls.jsonl

Each entry shows: message count, total chars, estimated tokens, truncated count, message type breakdown.

Bug Fixes

pip install revolt-rai now works — previous wheel was empty due to packaging misconfiguration
Session approval (approve_for_session) now persists correctly across tool calls
Server shuts down cleanly when TUI exits or crashes
Thread resume loads with the correct agent graph when multiple agents are registered
Audit log no longer blocks the event loop on busy servers
content_file in memory_write now restricted to safe paths only

Assets 2

31 May 15:58

0xPugal

v2.0.0

f3c0f44

v2.0.0

RAI v2.0.0 Release Notes

RAI v2 is a full rewrite and release refresh of the open-source AI security operator. This version focuses on making the main workflow faster to start, easier to package, and broader in capability across autonomous security work.

Highlights

rai chat is now part of the default install and starts the HTTP-backed TUI experience out of the box.
The HTTP server stack is included in the base package, so the main interactive flow works without extra dependencies.
The Docker image is now supported for local testing and GHCR publishing.
The project now ships with a single release commit history for the v2 baseline.
Versioning is aligned across the package, CLI, and runtime as 2.0.0.

Top Features in v2

Interactive Security Operator

RAI is built to operate as a terminal-native security assistant, not a generic chat client. It can run autonomous tasks, manage approvals, and coordinate work through the CLI and TUI.

Plan Mode

Complex work can be broken into structured steps, reviewed before execution, and tracked live during the run. This is the main control point for high-trust autonomous workflows.

Persistent Memory

RAI stores user, agent, and target memory across sessions so it can preserve methodology, findings, preferences, and target context over time.

Multi-Agent Execution

RAI can dispatch specialized subagents for recon, research, code analysis, cloud work, reversing, Android analysis, and other focused tasks.

HTTP Streaming API

The FastAPI-backed server exposes runs, threads, tasks, subagents, HITL approval, and SSE event streams for remote control and TUI integration.

Textual TUI

The built-in TUI gives a rich operator interface for approvals, plan review, findings, threads, model selection, and background runs.

MCP and Skills

RAI integrates with MCP tools and Markdown-based skills so capabilities can be extended without changing the core agent runtime.

Security Tooling

The toolkit covers bash execution, findings management, memory, references, web security, cloud, container, Active Directory, Android, and reversing workflows.

Docker and GHCR

The project now includes a container build path and a release workflow that publishes Docker images to GitHub Container Registry.

Packaging Notes

Base install: pip install revolt-rai
No extra is required for rai chat
Optional provider extras remain available for alternate model backends

What Changed at a Glance

Unified release version: 2.0.0
Default interactive path now works without optional HTTP extras
Docker build and publish workflow added
Source tree and release metadata aligned for the v2 baseline

Compatibility

This release is intended as the new v2 baseline. If you are upgrading from an older branch, review agent configs, custom prompts, and any local workflows that assumed the older packaging layout.

Assets 2

23 Jul 16:10

th3gokul

v1.0.1

e91e042

V1.0.1

🛠️ RAI v1.0.1 – Release

✨ New Features & Enhancements

🧠 GUI-Based Agent & Team Creation

Introduced a full-featured GUI for building YAML configurations.
Enables no-code creation of agents and teams, avoiding the need for manual YAML editing.
Ideal for faster prototyping and collaboration with less technical users.

🌐 Web & API Integration Interface

Added built-in Web and REST API interfaces for extending LLM agents with external tools and services.
Allows seamless communication with reconnaissance tools, exploit chains, or third-party automation systems.

🔌 Enhanced MCP Tool Integration

Improved compatibility and deeper integration with Model Context Protocol (MCP) infrastructure.
Supports enhanced tool orchestration and model routing across agents and teams.

🧬 New LLM Model Support

Added support for multiple new LLM models for use in agent and team configurations.
Enhances flexibility in selecting language models optimized for different tasks (recon, exploit writing, reasoning, etc).

🧠 Memory Context Configuration

Introduced user memory context configuration support.
Agents can now recall past interactions, preferences, and task-specific memory to enable smarter and more personalized responses.

💬 Chat Session & State Management

Implemented persistent chat state tracking for both individual agents and teams.
Agents now remember previous tasks, enabling multi-step reasoning and long-term coordination across sessions.

Contributors:

Assets 2

01 May 17:38

RevoltSecurities

v1.0.0

5c6496c

V1.0.0

RAI (Revolt AI Agent) Version V1.0.0 Release Notes

We are proud to announce the release of RAI (Revolt AI Agent) V1.0.0! This marks the first stable version of the framework, packed with exciting features designed to help cybersecurity professionals automate penetration testing, red teaming, and offensive security tasks using advanced LLM-based agents and teams.

New Features in V1.0.0

Interactive Shell Mode
Switch between LLM Teams and Agents seamlessly within an advanced shell interface. Engage in real-time conversations with selected agents and teams, optimizing workflow and enhancing interaction.
Low-Code YAML-Based Agent & Team Building
Easily build and configure agents and teams using simple YAML templates, making automation quick and accessible without the need for traditional coding.
Multiple Agent & Team Support
Manage and run multiple agents or teams in parallel, each with full isolation to ensure efficient, controlled, and secure operations.
Tool Integration (SSE & stdio)
Integrate third-party tools effortlessly using Server-Sent Events (SSE) or standard I/O for live interaction with agents and teams.
Dynamic Team Allocation
Flexibly assign and reassign agents across teams in real-time, ensuring the most effective use of resources based on evolving tasks.
MCP-Compatible Infrastructure
Built with modularity in mind, RAI is ready for future integration with Model Context Protocols (MCP) tools, providing scalability for advanced use cases.
Fast & Flexible Configuration
With intelligent defaults and customizable configurations, setting up RAI is a breeze, providing flexibility for both novice and advanced users.
Built-In Reasoning Engine
Agents are equipped with reasoning capabilities, allowing them to think, analyze, and respond intelligently to complex scenarios before executing tasks.
Cybersecurity Automation Focus
Tailored specifically for offensive security activities such as reconnaissance, exploitation, and team coordination, making it a powerful tool for red teamers and penetration testers.
Agent-to-Agent Communication
Agents within teams can now communicate with one another, ensuring optimal task delegation and efficient collaboration across specialized agents.

What's Included in V1.0.0:

Full Interactive CLI Shell for agent and team interaction.
Pre-configured YAML templates to quickly build agents and teams.
A reasoning engine for smarter, context-aware decision-making.
Support for multi-agent/team orchestration with real-time agent interaction.
Flexible tool integration (SSE, stdio) to enhance agent capabilities.
Real-time agent reconfiguration to adapt to dynamic task requirements.
Cybersecurity-focused tools for offensive security tasks.

Assets 2

Releases: RevoltSecurities/RAI

V2.1. 0 Version Release

RAI v2.1.0 Release Notes

What's New

Subagents v2 — Async Background Agent System

Claude SDK Mode — No API Key Required

Remote TUI (--url)

Docker Container Routing (--docker)

API Auth Hardening

New Middleware (7 layers added)

Uh oh!

V2.0.2 Version Release

What's New

6–8× Cost Reduction via Prompt Caching Parity with Claude Code

What changed under the hood

Automatic upgrade

Extended Thinking Enabled by Default

MITM Proxy Support for Debugging

Bug Fixes

Uh oh!

V2.0.1 Release

RAI v2.0.1 Release Notes

What's New

Token Cost Reduction (60–80% on long sessions)

Loop Detection

Plan Mode Improvements

TUI Improvements

Model Call Diagnostics

Bug Fixes

Uh oh!

v2.0.0

RAI v2.0.0 Release Notes

Highlights

Top Features in v2

Interactive Security Operator

Plan Mode

Persistent Memory

Multi-Agent Execution

HTTP Streaming API

Textual TUI

MCP and Skills

Security Tooling

Docker and GHCR

Packaging Notes

What Changed at a Glance

Compatibility

Uh oh!

V1.0.1

🛠️ RAI v1.0.1 – Release

✨ New Features & Enhancements

🧠 GUI-Based Agent & Team Creation

🌐 Web & API Integration Interface

🔌 Enhanced MCP Tool Integration

🧬 New LLM Model Support

🧠 Memory Context Configuration

💬 Chat Session & State Management

Contributors:

Uh oh!

V1.0.0

RAI (Revolt AI Agent) Version V1.0.0 Release Notes

New Features in V1.0.0

What's Included in V1.0.0:

Uh oh!

Remote TUI (`--url`)

Docker Container Routing (`--docker`)