Skip to content

Releases: RevoltSecurities/RAI

V2.1. 0 Version Release

21 Jun 05:34
6e8ae8a

Choose a tag to compare

RAI v2.1.0 Release Notes

What's New

Subagents v2 — Async Background Agent System

Agents can now spawn, monitor, and coordinate parallel background agents from within a single session. The new SubagentService manages full lifecycle including HITL, output discipline, and completion notifications injected back into the parent agent's context.

10 new LLM tools available to the agent:

Tool Description
Subagent Launch a single background agent; returns task_id immediately
MultiSubagents Launch multiple agents in parallel
Pipeline DAG of tasks with dependency ordering
SubagentStatus Get current status + output of a task
SubagentCancel Cancel a running task
SubagentUpdate Send a follow-up message to a completed task
SubagentResponse Block and wait for task completion
SubagentProgress Read live checkpoint of a running task
ListSubagents List available subagent types
ListTasks List all tasks for the current run

Output discipline: every subagent writes structured output to /tmp/rai_{task_id}_output.md. The parent agent receives a ≤400-char preview + file path when the task completes — subagent context never pollutes the main agent's checkpoint.


Claude SDK Mode — No API Key Required

RAI now includes a full Claude Code subscription-backed mode that requires no ANTHROPIC_API_KEY.

rai claude                        # Claude TUI (HITL on by default)
rai claude --no-hitl              # autonomous mode
rai claude --agent pentest        # named agent
rai claude --model claude-opus-4-8
rai claude-serve                  # FastAPI SSE proxy at :8001
rai claude --url http://host:8001 # connect TUI to remote claude-serve

The Textual TUI (rai claude) streams tokens live with tool call cards, subagent progress, thinking blocks, and inline HITL approval panels.

Python SDK:

from rai.sdk import ClaudeAgent

async with ClaudeAgent.builder().agent_name("rai").without_hitl().build() as agent:
    result = await agent.run("scan example.com for vulnerabilities")

Remote TUI (--url)

Connect the Claude TUI to a remote rai claude-serve without spawning a local subprocess:

rai claude --url http://remote-server:8001 --api-key <key>

Health-checks the remote server before launching; exits with a clear error if unreachable.


Docker Container Routing (--docker)

All bash and execute tool calls can be routed to a persistent Docker container instead of the host. Bring any image — Kali, Parrot, or a custom pentest toolkit.

rai chat --docker kalilinux/kali-rolling
rai chat --docker parrotsec/security

A shared workspace is mounted at /workspace inside the container. The container is started on server boot and stopped on exit. No default image is provided — users supply their own.

Container capabilities: NET_RAW, NET_ADMIN, SYS_PTRACE, SETUID, CHOWN, DAC_OVERRIDE, NET_BIND_SERVICE, KILL


API Auth Hardening

  • Fixed: /docs, /openapi.json, and /redoc were bypassing the X-API-Key guard on the HTTP harness when --server-key was set — an attacker on the network could enumerate all API endpoints and their full schemas without a key. Now only /ok (health check) is exempt.
  • Both servers (rai http serve and rai claude-serve) now log a startup warning when launched without an API key.

New Middleware (7 layers added)

Middleware Role
AllowedToolsMiddleware Hard-enforce per-run tool whitelist via ContextVar
ReadFileDedupMiddleware Stub unchanged file re-reads via mtime check
SubagentRestrictMiddleware Block task tool for subagents (no recursive nesting)
DockerRouterMiddleware Route bash/execute to Docker container when --docker is active
RetryMiddleware Auto-retry 429/502/503/timeouts with exponential backoff
WriteTruncateMiddleware Truncate file-write args in old messages; last 5 writes kept verbatim
CostTrackingMiddleware Accumulate per-run cost in ContextVar

V2.0.2 Version Release

03 Jun 05:42
e618510

Choose a tag to compare

What's New

6–8× Cost Reduction via Prompt Caching Parity with Claude Code

RAI now sends requests using the Anthropic wire format (POST /v1/messages) instead of the OpenAI format (POST /chat/completions). This single change unlocks full prompt caching — the same strategy used by Claude Code — saving 60–90% on input token costs for long sessions.

Before v2.0.2: $40–60 for a full 6-step VAPT session
After v2.0.2: $5–7 for the same session

What changed under the hood

Change Impact
ChatAnthropic replaces ChatLiteLLM for Claude models cache_control preserved through proxy
System prompt (70k chars) cached with ephemeral ~28k tokens saved every turn after first
Tool definitions (90 tools) cached ~35k tokens saved every turn after first
Last human message cached Full history served from cache on next turn
Cache TTL removed (was 5 min, now 1h default) 12× longer cache lifetime

Automatic upgrade

All existing Claude model configs (litellm:openai/bedrock-claude-*, anthropic:claude-*) are automatically upgraded to ChatAnthropic routing at runtime. No config changes required.

To explicitly use the new routing:

rai agents config-set rai \
  --model "chatanthropic:bedrock-claude-sonnet-4.6-(US)" \
  --api-key "sk-..." \
  --base-url "https://your-litellm-proxy.example.com"

Extended Thinking Enabled by Default

RAI now sends thinking: {type: enabled, budget_tokens: 31999} on every call — matching Claude Code's behavior. This improves reasoning quality for complex security assessments, reducing mistakes and re-runs.

Temperature override: Anthropic requires temperature=1.0 when extended thinking is enabled. RAI enforces this automatically for all Claude models. Your config.toml temperature setting is ignored while thinking is active. Non-Claude models (OpenAI, Gemini, Ollama) are unaffected.

To disable thinking and restore your configured temperature:

RAI_THINKING=0 rai chat          # per-run
export RAI_THINKING=0            # permanent
Mode Temperature used Notes
RAI_THINKING=1 (default) 1.0 (forced by Anthropic) Best reasoning quality
RAI_THINKING=0 Your config.toml value (default 0.7) Standard mode, lower cost

MITM Proxy Support for Debugging

Capture every LLM request in Burp Suite or mitmproxy:

RAI_INSPECT=1 RAI_INSPECT_PROXY=http://127.0.0.1:8080 rai chat

Works correctly with macOS system proxies (WARP, VPN) — those are bypassed automatically.


Bug Fixes

  • Fixed StaticSystemPromptCacheBreakpointMiddleware not tagging system[0] due to _should_apply_caching returning False for ChatAnthropic in deepagents
  • Fixed AnthropicPromptCachingMiddleware stamping ttl: "5m" on all cache blocks (now defaults to Anthropic's 1h)
  • Fixed RequestInspectorMiddleware failing on macOS when WARP/VPN SOCKS proxy is active

V2.0.1 Release

02 Jun 16:12
90cec8f

Choose a tag to compare

RAI v2.0.1 Release Notes

What's New

Token Cost Reduction (60–80% on long sessions)

RAI now runs a 3-layer compression pipeline before every model call, keeping costs flat as sessions grow longer:

  1. History trim — clips conversation to a token budget before the model sees it
  2. Tool result compression — old bash/grep/file outputs are truncated; recent results stay verbatim
  3. Summarization — only fires when the first two layers aren't enough

Combined with a configurable cheap model for summarization, a typical VAPT or SAST session now costs significantly less than before.

Configure a cheaper summarization model:

rai agents config-set --compact-model "litellm:openai/bedrock-claude-haiku-4.5"

# With explicit credentials if different from main
rai agents config-set \
  --compact-model "litellm:openai/bedrock-claude-haiku-4.5 \
  --compact-api-key "sk-..." \
  --compact-base-url "https://llmproxy.example.com"

# Clear it (inherit main model)
rai agents config-set --compact-model ""

Also configurable via RAI_COMPACT_MODEL env var.


Loop Detection

The agent no longer gets stuck re-executing the same command. When identical tool calls are detected, RAI returns the cached result with a warning instead of executing again:

⚠ DUPLICATE CALL BLOCKED: 'bash' was already executed 5 times and returned:
0 matches for shell_exec|proc_open|...
This result is final. Accept it and proceed to the next step.

Configurable: RAI_LOOP_WINDOW=10 (default 10 recent calls tracked).


Plan Mode Improvements

  • Plan completion no longer repeats the full step history in the tool result — agent reads its own notes directly
  • list_plan_steps returns compact summaries for completed steps, full detail for pending ones
  • Memory phase at plan exit now explicitly guides the agent to write target-specific methodology to scope='target'

TUI Improvements

  • Thread resume now shows the most recent messages instead of the oldest
  • Internal control messages no longer appear as user messages in history

Model Call Diagnostics

Enable detailed per-call logging to verify token consumption:

RAI_DEBUG_LOG_CALLS=1 rai chat
# Logs to ~/.rai/debug/model-calls.jsonl

Each entry shows: message count, total chars, estimated tokens, truncated count, message type breakdown.


Bug Fixes

  • pip install revolt-rai now works — previous wheel was empty due to packaging misconfiguration
  • Session approval (approve_for_session) now persists correctly across tool calls
  • Server shuts down cleanly when TUI exits or crashes
  • Thread resume loads with the correct agent graph when multiple agents are registered
  • Audit log no longer blocks the event loop on busy servers
  • content_file in memory_write now restricted to safe paths only

v2.0.0

31 May 15:58
f3c0f44

Choose a tag to compare

RAI v2.0.0 Release Notes

RAI v2 is a full rewrite and release refresh of the open-source AI security operator. This version focuses on making the main workflow faster to start, easier to package, and broader in capability across autonomous security work.

Highlights

  • rai chat is now part of the default install and starts the HTTP-backed TUI experience out of the box.
  • The HTTP server stack is included in the base package, so the main interactive flow works without extra dependencies.
  • The Docker image is now supported for local testing and GHCR publishing.
  • The project now ships with a single release commit history for the v2 baseline.
  • Versioning is aligned across the package, CLI, and runtime as 2.0.0.

Top Features in v2

Interactive Security Operator

RAI is built to operate as a terminal-native security assistant, not a generic chat client. It can run autonomous tasks, manage approvals, and coordinate work through the CLI and TUI.

Plan Mode

Complex work can be broken into structured steps, reviewed before execution, and tracked live during the run. This is the main control point for high-trust autonomous workflows.

Persistent Memory

RAI stores user, agent, and target memory across sessions so it can preserve methodology, findings, preferences, and target context over time.

Multi-Agent Execution

RAI can dispatch specialized subagents for recon, research, code analysis, cloud work, reversing, Android analysis, and other focused tasks.

HTTP Streaming API

The FastAPI-backed server exposes runs, threads, tasks, subagents, HITL approval, and SSE event streams for remote control and TUI integration.

Textual TUI

The built-in TUI gives a rich operator interface for approvals, plan review, findings, threads, model selection, and background runs.

MCP and Skills

RAI integrates with MCP tools and Markdown-based skills so capabilities can be extended without changing the core agent runtime.

Security Tooling

The toolkit covers bash execution, findings management, memory, references, web security, cloud, container, Active Directory, Android, and reversing workflows.

Docker and GHCR

The project now includes a container build path and a release workflow that publishes Docker images to GitHub Container Registry.

Packaging Notes

  • Base install: pip install revolt-rai
  • No extra is required for rai chat
  • Optional provider extras remain available for alternate model backends

What Changed at a Glance

  • Unified release version: 2.0.0
  • Default interactive path now works without optional HTTP extras
  • Docker build and publish workflow added
  • Source tree and release metadata aligned for the v2 baseline

Compatibility

This release is intended as the new v2 baseline. If you are upgrading from an older branch, review agent configs, custom prompts, and any local workflows that assumed the older packaging layout.

V1.0.1

23 Jul 16:10
e91e042

Choose a tag to compare

🛠️ RAI v1.0.1 – Release

✨ New Features & Enhancements

🧠 GUI-Based Agent & Team Creation

  • Introduced a full-featured GUI for building YAML configurations.
  • Enables no-code creation of agents and teams, avoiding the need for manual YAML editing.
  • Ideal for faster prototyping and collaboration with less technical users.

🌐 Web & API Integration Interface

  • Added built-in Web and REST API interfaces for extending LLM agents with external tools and services.
  • Allows seamless communication with reconnaissance tools, exploit chains, or third-party automation systems.

🔌 Enhanced MCP Tool Integration

  • Improved compatibility and deeper integration with Model Context Protocol (MCP) infrastructure.
  • Supports enhanced tool orchestration and model routing across agents and teams.

🧬 New LLM Model Support

  • Added support for multiple new LLM models for use in agent and team configurations.
  • Enhances flexibility in selecting language models optimized for different tasks (recon, exploit writing, reasoning, etc).

🧠 Memory Context Configuration

  • Introduced user memory context configuration support.
  • Agents can now recall past interactions, preferences, and task-specific memory to enable smarter and more personalized responses.

💬 Chat Session & State Management

  • Implemented persistent chat state tracking for both individual agents and teams.
  • Agents now remember previous tasks, enabling multi-step reasoning and long-term coordination across sessions.

Contributors:

V1.0.0

01 May 17:38
5c6496c

Choose a tag to compare

RAI (Revolt AI Agent) Version V1.0.0 Release Notes

We are proud to announce the release of RAI (Revolt AI Agent) V1.0.0! This marks the first stable version of the framework, packed with exciting features designed to help cybersecurity professionals automate penetration testing, red teaming, and offensive security tasks using advanced LLM-based agents and teams.


New Features in V1.0.0

  • Interactive Shell Mode
    Switch between LLM Teams and Agents seamlessly within an advanced shell interface. Engage in real-time conversations with selected agents and teams, optimizing workflow and enhancing interaction.

  • Low-Code YAML-Based Agent & Team Building
    Easily build and configure agents and teams using simple YAML templates, making automation quick and accessible without the need for traditional coding.

  • Multiple Agent & Team Support
    Manage and run multiple agents or teams in parallel, each with full isolation to ensure efficient, controlled, and secure operations.

  • Tool Integration (SSE & stdio)
    Integrate third-party tools effortlessly using Server-Sent Events (SSE) or standard I/O for live interaction with agents and teams.

  • Dynamic Team Allocation
    Flexibly assign and reassign agents across teams in real-time, ensuring the most effective use of resources based on evolving tasks.

  • MCP-Compatible Infrastructure
    Built with modularity in mind, RAI is ready for future integration with Model Context Protocols (MCP) tools, providing scalability for advanced use cases.

  • Fast & Flexible Configuration
    With intelligent defaults and customizable configurations, setting up RAI is a breeze, providing flexibility for both novice and advanced users.

  • Built-In Reasoning Engine
    Agents are equipped with reasoning capabilities, allowing them to think, analyze, and respond intelligently to complex scenarios before executing tasks.

  • Cybersecurity Automation Focus
    Tailored specifically for offensive security activities such as reconnaissance, exploitation, and team coordination, making it a powerful tool for red teamers and penetration testers.

  • Agent-to-Agent Communication
    Agents within teams can now communicate with one another, ensuring optimal task delegation and efficient collaboration across specialized agents.


What's Included in V1.0.0:

  • Full Interactive CLI Shell for agent and team interaction.
  • Pre-configured YAML templates to quickly build agents and teams.
  • A reasoning engine for smarter, context-aware decision-making.
  • Support for multi-agent/team orchestration with real-time agent interaction.
  • Flexible tool integration (SSE, stdio) to enhance agent capabilities.
  • Real-time agent reconfiguration to adapt to dynamic task requirements.
  • Cybersecurity-focused tools for offensive security tasks.