A3S Code

Production-Ready AI Coding Agent Framework

Application layer — high-performance gRPC service for AI-powered code generation, tool execution, and multi-session management

Features • Quick Start • Architecture • SDK • API Reference • Development

Overview

A3S Code is a high-performance Rust framework for building AI coding agents. It provides a complete gRPC-based service with 78 RPCs for tool execution, multi-session management, and extensible integrations.

Basic Usage

import { A3sClient } from '@a3s-lab/code';

const client = new A3sClient({ address: 'localhost:4088' });

// Create session and generate response
const { sessionId } = await client.createSession({ workspace: '/project' });

for await (const event of client.streamGenerate(sessionId, [
  { role: 'user', content: 'Explain this codebase' }
])) {
  if (event.content) process.stdout.write(event.content);
}

await client.destroySession(sessionId);

Features

Multi-Session Management: Run multiple independent AI conversations with isolated context and permissions
10 Built-in Tools: bash, read, write, edit, patch, grep, glob, ls, web_fetch, web_search
Permission System: Allow/Deny/Ask rules for fine-grained tool access control
Human-in-the-Loop (HITL): Require user confirmation before sensitive operations
Skills System: Extend the agent with custom tools defined in Markdown files
Subagent System: Delegate specialized tasks to focused child agents (explore, general, plan)
LSP Integration: Code intelligence via Language Server Protocol (hover, definition, references, symbols, diagnostics)
MCP Support: Extend with external tools via Model Context Protocol
Cron Scheduling: Schedule recurring tasks with cron expressions or natural language
Context Compaction: Automatically summarize long conversations to stay within context limits, with auto-compact triggered at configurable usage threshold (default 80%)
Streaming Responses: Real-time event streaming for responsive UI updates
Planning & Goal Tracking: Create execution plans and track goal achievement
Memory System: Episodic, semantic, procedural, and working memory for persistent knowledge
Provider Configuration: Multi-provider LLM support with per-model API key and base URL overrides
API Retry with Backoff: Automatic retry with exponential backoff and jitter for transient LLM API errors (429, 500, 502, 503, 529), with Retry-After header support
File Version History: Automatic file snapshots before write/edit/patch operations with diff generation and version restore
Per-Session Token Cost Tracking: Automatic cost calculation per session using model-specific pricing (input/output/cache tokens)
Session Export to Markdown: Export session conversations to readable Markdown with metadata, tool calls, and usage statistics
Session Fork: Fork existing sessions with full history, configuration, and state copied to a new independent session
Auto Title Generation: LLM-powered automatic session title generation from conversation content
Todo Tracking: Task management within sessions
OpenAI Compatibility: OpenAI-compatible message format and chat completion API
Tool Execution Metrics: Per-session tool call tracking with duration, success/failure rate, and per-tool aggregated statistics

Quality Metrics

Test Coverage

1767 comprehensive unit tests with 73.95% line coverage and 70.28% function coverage:

Module	Lines	Line Coverage	Functions	Function Coverage
`prompts`	24	100.00%	5	100.00%
`todo`	130	100.00%	18	100.00%
`hitl`	607	98.85%	80	100.00%
`context`	420	98.81%	67	97.01%
`hooks`	1035	96.04%	136	97.06%
`queue`	176	96.02%	27	92.59%
`store`	274	95.26%	54	79.63%
`subagent`	435	93.56%	60	80.00%
`security`	1261	93.42%	146	94.52%
`permissions`	545	91.56%	76	90.79%
`lane_integration`	359	89.42%	61	81.97%
`planning`	412	88.83%	52	88.46%
`tools`	1998	88.29%	217	90.32%
`config`	514	86.77%	58	86.21%
`telemetry`	285	82.11%	33	90.91%
`session`	2327	80.53%	289	79.93%
`memory`	513	79.92%	103	69.90%
`agent`	1414	75.74%	139	70.50%
`reflection`	350	74.86%	47	74.47%
`session_lane_queue`	449	69.04%	63	74.60%
`mcp`	742	49.87%	112	42.86%
`service`	1186	36.42%	264	15.91%
`convert`	717	33.33%	48	39.58%
`lsp`	1011	32.64%	148	31.76%
`llm`	942	31.74%	69	47.83%
TOTAL	18126	73.95%	2372	70.28%

Run coverage report:

cargo llvm-cov --lib --summary-only

Architecture

System Architecture

┌─────────────────────────────────────────────────────────────┐
│  Your Application (SDK Client)                              │
│  - TypeScript / Python                                      │
├─────────────────────────────────────────────────────────────┤
│  A3S Code (gRPC Server :4088)                               │
│  ┌─────────────┬─────────────┬─────────────┬──────────────┐ │
│  │   Session   │    Tool     │ Permission  │     LLM      │ │
│  │   Manager   │  Executor   │   System    │   Provider   │ │
│  └─────────────┴─────────────┴─────────────┴──────────────┘ │
│  ┌─────────────┬─────────────┬─────────────┬──────────────┐ │
│  │   Skills    │  Subagent   │     LSP     │     MCP      │ │
│  │   System    │   System    │   Support   │   Support    │ │
│  └─────────────┴─────────────┴─────────────┴──────────────┘ │
│  ┌─────────────┬─────────────┬─────────────┬──────────────┐ │
│  │    Cron     │   Memory    │  Planning   │   Context    │ │
│  │  Scheduler  │   System    │   & Goals   │  Compaction  │ │
│  └─────────────┴─────────────┴─────────────┴──────────────┘ │
└─────────────────────────────────────────────────────────────┘

Built-in Tools

Tool	Purpose	Example
`bash`	Execute shell commands	`git status`, `npm install`
`read`	Read files with line numbers	View source code
`write`	Create/overwrite files	Create new files
`edit`	String replacement editing	Modify existing code
`patch`	Apply unified diff patches	Complex multi-line edits
`grep`	Search file contents (ripgrep)	Find function definitions
`glob`	Find files by pattern	`*/.ts`, `src/*/.rs`
`ls`	List directory contents	Explore project structure
`web_fetch`	Fetch web content	Download documentation
`web_search`	Search the web	Query multiple search engines

Subagent Types

Agent	Permissions	Use Case
`explore`	read, grep, glob, ls	Find code, understand structure
`general`	all except task	Complex multi-step tasks
`plan`	read, grep, glob, ls	Design implementation approach

Quick Start

Installation

# Homebrew (macOS / Linux)
brew tap a3s-lab/tap
brew install a3s-code

# Cargo
cargo install a3s-code

# From source
cargo build --release

Run

a3s-code --config ~/.a3s/config.json

Configuration

Create ~/.a3s/config.json:

{
  "defaultProvider": "anthropic",
  "defaultModel": "claude-sonnet-4-20250514",
  "providers": [{
    "name": "anthropic",
    "apiKey": "sk-ant-...",
    "models": [{
      "id": "claude-sonnet-4-20250514",
      "name": "Claude Sonnet 4",
      "toolCall": true
    }]
  }]
}

SDK

TypeScript

npm install @a3s-lab/code

import { A3sClient } from '@a3s-lab/code';

const client = new A3sClient({ address: 'localhost:4088' });

const { sessionId } = await client.createSession({ workspace: '/project' });

for await (const event of client.streamGenerate(sessionId, [
  { role: 'user', content: 'Explain this codebase' }
])) {
  if (event.content) process.stdout.write(event.content);
}

await client.destroySession(sessionId);

Python

pip install a3s-code

from a3s_code import A3sClient

async with A3sClient(address="localhost:4088") as client:
    result = await client.create_session(workspace="/project")
    session_id = result["session_id"]

    async for event in client.stream_generate(session_id, [
        {"role": "user", "content": "Explain this codebase"}
    ]):
        if event.get("content"):
            print(event["content"], end="", flush=True)

    await client.destroy_session(session_id)

Permission System

// Set permission policy for a session
await client.setPermissionPolicy(sessionId, {
  enabled: true,
  deny: [{ rule: 'Bash(rm -rf:*)' }],
  allow: [{ rule: 'Read(src/**)' }, { rule: 'Grep(src/**)' }],
  ask: [{ rule: 'Bash(*)' }, { rule: 'Write(*)' }],
  defaultDecision: 'PERMISSION_DECISION_ASK',
});

Human-in-the-Loop (HITL)

// Set confirmation policy
await client.setConfirmationPolicy(sessionId, {
  enabled: true,
  autoApproveTools: ['read', 'grep', 'glob', 'ls'],
  requireConfirmTools: ['bash', 'write', 'edit'],
  defaultTimeoutMs: 30000,
  timeoutAction: 'TIMEOUT_ACTION_REJECT',
});

// Confirm or reject tool execution
await client.confirmToolExecution(sessionId, toolId, true, 'Approved');

Skills System

# ~/.a3s/skills/deploy.md
---
name: deploy
description: Deploy to production
allowed_tools: Bash(kubectl:*)
---
Run kubectl apply to deploy the application.

await client.loadSkill(sessionId, 'deploy');
const skills = await client.listSkills();

LSP Integration

await client.startLspServer('rust', 'file:///path/to/project');

const hover = await client.lspHover('/path/to/file.rs', 10, 5);
const defs = await client.lspDefinition('/path/to/file.rs', 15, 10);
const refs = await client.lspReferences('/path/to/file.rs', 20, 8);
const symbols = await client.lspSymbols('main');
const diags = await client.lspDiagnostics('/path/to/file.rs');

Supported: rust-analyzer, gopls, typescript-language-server, pyright, clangd

MCP Support

await client.registerMcpServer({
  name: 'filesystem',
  transport: { stdio: { command: 'npx', args: ['-y', '@modelcontextprotocol/server-filesystem'] } },
  enabled: true,
  env: {},
});

await client.connectMcpServer('filesystem');
const tools = await client.getMcpTools();

Cron Scheduling

const result = await client.createCronJob('backup', 'every day at 2am', 'backup.sh');
const jobs = await client.listCronJobs();
await client.runCronJob(result.job.id);
const history = await client.getCronHistory(result.job.id);
await client.pauseCronJob(result.job.id);
await client.resumeCronJob(result.job.id);

// Parse natural language to cron expression
const parsed = await client.parseCronSchedule('every 5 minutes');
// { cronExpression: '*/5 * * * *', description: 'every 5 minutes' }

Supported natural language formats:

English: every 5 minutes, daily at 2am, every monday at 9:30
Chinese: 每5分钟, 每天凌晨2点, 每周一上午9点30分

Memory System

// Store a memory
await client.storeMemory(sessionId, {
  content: 'User prefers TypeScript over JavaScript',
  importance: 0.8,
  tags: ['preference', 'language'],
  memoryType: 'MEMORY_TYPE_SEMANTIC',
});

// Search memories
const results = await client.searchMemories(sessionId, 'TypeScript', ['preference'], 10);

// Get memory stats
const stats = await client.getMemoryStats(sessionId);

Planning & Goal Tracking

const plan = await client.createPlan(sessionId, 'Refactor auth module');
const goal = await client.extractGoal(sessionId, 'Improve test coverage to 90%');
const check = await client.checkGoalAchievement(sessionId, goal, 'Current coverage: 85%');

API Reference

Lifecycle (4 RPCs)

Method	Description
`healthCheck()`	Check agent health status
`getCapabilities()`	Get agent capabilities, tools, and models
`initialize(workspace, env)`	Initialize agent with workspace
`shutdown()`	Graceful shutdown

Session Management (6 RPCs)

Method	Description
`createSession(config, sessionId, initialContext)`	Create a new session
`destroySession(sessionId)`	Destroy a session
`listSessions()`	List all sessions
`getSession(sessionId)`	Get session details
`configureSession(sessionId, config)`	Update session configuration
`getMessages(sessionId, limit, offset)`	Get conversation history

Code Generation (4 RPCs)

Method	Description
`generate(sessionId, messages)`	Generate response (unary)
`streamGenerate(sessionId, messages)`	Generate response (streaming)
`generateStructured(sessionId, messages, schema)`	Generate structured output (unary)
`streamGenerateStructured(sessionId, messages, schema)`	Generate structured output (streaming)

Skill Management (4 RPCs)

Method	Description
`loadSkill(sessionId, skillName, skillContent)`	Load a skill into session
`unloadSkill(sessionId, skillName)`	Unload a skill
`listSkills(sessionId)`	List available skills
`getClaudeCodeSkills(name)`	Get Claude Code compatible skills

Context Management (3 RPCs)

Method	Description
`getContextUsage(sessionId)`	Get token usage
`compactContext(sessionId)`	Compact conversation context (also auto-triggered when `auto_compact` is enabled and usage exceeds `auto_compact_threshold`)
`clearContext(sessionId)`	Clear all context

Event Streaming (1 RPC)

Method	Description
`subscribeEvents(sessionId, eventTypes)`	Subscribe to agent events

Control Operations (3 RPCs)

Method	Description
`cancel(sessionId, operationId)`	Cancel an operation
`pause(sessionId)`	Pause a session
`resume(sessionId)`	Resume a session

Human-in-the-Loop (3 RPCs)

Method	Description
`confirmToolExecution(sessionId, toolId, approved, reason)`	Confirm or reject tool execution
`setConfirmationPolicy(sessionId, policy)`	Set HITL confirmation policy
`getConfirmationPolicy(sessionId)`	Get current confirmation policy

External Task Handling (4 RPCs)

Method	Description
`setLaneHandler(sessionId, lane, config)`	Set lane handler mode
`getLaneHandler(sessionId, lane)`	Get lane handler config
`completeExternalTask(sessionId, taskId, success, result, error)`	Complete an external task
`listPendingExternalTasks(sessionId)`	List pending external tasks

Permission System (4 RPCs)

Method	Description
`setPermissionPolicy(sessionId, policy)`	Set permission policy
`getPermissionPolicy(sessionId)`	Get permission policy
`checkPermission(sessionId, toolName, args)`	Check tool permission
`addPermissionRule(sessionId, ruleType, rule)`	Add a permission rule

Todo Tracking (2 RPCs)

Method	Description
`getTodos(sessionId)`	Get todos for session
`setTodos(sessionId, todos)`	Set todos for session

Provider Configuration (7 RPCs)

Method	Description
`listProviders()`	List all providers
`getProvider(name)`	Get provider details
`addProvider(provider)`	Add a new provider
`updateProvider(provider)`	Update a provider
`removeProvider(name)`	Remove a provider
`setDefaultModel(provider, model)`	Set default model
`getDefaultModel()`	Get default model

Planning & Goal Tracking (4 RPCs)

Method	Description
`createPlan(sessionId, prompt, context)`	Create an execution plan
`getPlan(sessionId, planId)`	Get an existing plan
`extractGoal(sessionId, prompt)`	Extract goal from prompt
`checkGoalAchievement(sessionId, goal, currentState)`	Check goal progress

Memory System (5 RPCs)

Method	Description
`storeMemory(sessionId, memory)`	Store a memory item
`retrieveMemory(sessionId, memoryId)`	Retrieve memory by ID
`searchMemories(sessionId, query, tags, limit)`	Search memories
`getMemoryStats(sessionId)`	Get memory statistics
`clearMemories(sessionId, clearLongTerm, clearShortTerm, clearWorking)`	Clear memories

MCP - Model Context Protocol (5 RPCs)

Method	Description
`registerMcpServer(config)`	Register an MCP server
`connectMcpServer(name)`	Connect to MCP server
`disconnectMcpServer(name)`	Disconnect from MCP server
`listMcpServers()`	List all MCP servers
`getMcpTools(serverName)`	Get available MCP tools

LSP - Language Server Protocol (8 RPCs)

Method	Description
`startLspServer(language, rootUri)`	Start language server
`stopLspServer(language)`	Stop language server
`listLspServers()`	List running servers
`lspHover(filePath, line, column)`	Get hover information
`lspDefinition(filePath, line, column)`	Go to definition
`lspReferences(filePath, line, column)`	Find all references
`lspSymbols(query, limit)`	Search workspace symbols
`lspDiagnostics(filePath)`	Get diagnostics

Cron - Scheduled Tasks (10 RPCs)

Method	Description
`listCronJobs()`	List all cron jobs
`createCronJob(name, schedule, command, timeoutMs)`	Create a cron job
`getCronJob(id, name)`	Get cron job by ID or name
`updateCronJob(id, schedule, command, timeoutMs)`	Update a cron job
`pauseCronJob(id)`	Pause a cron job
`resumeCronJob(id)`	Resume a cron job
`deleteCronJob(id)`	Delete a cron job
`getCronHistory(id, limit)`	Get execution history
`runCronJob(id)`	Manually trigger a job
`parseCronSchedule(input)`	Parse natural language to cron

Observability (1 RPC)

Method	Description
`getToolMetrics(sessionId, toolName)`	Get per-tool execution metrics (calls, duration, success/failure rate)

Development

Dependencies

Dependency	Install	Purpose
`just`	`cargo install just`	Task runner
`cargo-llvm-cov`	`cargo install cargo-llvm-cov`	Code coverage (optional)

Build Commands

# Build
just build                   # Debug build
just release                 # Release build

# Test (with colored progress display)
just test                    # All tests with pretty output
just test-v                  # Verbose output

# Format & Lint
just fmt                     # Format code
just lint                    # Clippy lint
just ci                      # Full CI checks (fmt + lint + test)

# Utilities
just check                   # Fast compile check
just doc                     # Generate and open docs
just clean                   # Clean build artifacts

Project Structure

code/
├── Cargo.toml
├── justfile
├── README.md
├── CLAUDE.md
├── proto/
│   └── code_agent.proto     # gRPC service definition (78 RPCs)
└── src/
    ├── lib.rs               # Library entry point
    ├── service.rs           # gRPC service implementation
    ├── session/             # Session management
    ├── tools/               # Built-in tools (bash, read, write, edit, grep, glob, ls, web)
    ├── skills/              # Skills system (Markdown-based custom tools)
    ├── subagent/            # Subagent system (explore, general, plan)
    ├── provider/            # LLM provider management
    ├── permission/          # Permission system (allow/deny/ask rules)
    ├── hitl/                # Human-in-the-loop confirmation
    ├── lsp/                 # Language Server Protocol integration
    ├── mcp/                 # Model Context Protocol support
    ├── cron/                # Cron scheduling
    ├── memory/              # Memory system
    ├── planning/            # Planning & goal tracking
    └── context/             # Context compaction

A3S Ecosystem

A3S Code is the application layer of the A3S ecosystem — the AI coding agent that runs inside A3S Box.

┌──────────────────────────────────────────────────────────┐
│                    A3S Ecosystem                         │
│                                                          │
│  Infrastructure:  a3s-box     (MicroVM sandbox runtime)  │
│                      │                                   │
│  Application:     a3s-code    (AI coding agent)          │
│                    /   \        ▲                        │
│  Utilities:   a3s-lane  a3s-context  You are here       │
│              (priority   (memory/                        │
│               queue)     knowledge)                      │
└──────────────────────────────────────────────────────────┘

Project	Package	Relationship
box	`a3s-box-*`	Sandbox runtime that hosts `a3s-code`
code	`a3s-code`	AI coding agent (this project)
lane	`a3s-lane`	Priority queue used by `a3s-code` for command scheduling
context	`a3s-context`	Context management used by `a3s-code` for memory

Roadmap

Phase 1: Core ✅ (Complete)

Phase 2: Extensibility ✅ (Complete)

Skills system (Markdown-based custom tools with binary, HTTP, script backends)
Subagent system (explore, general, plan agents with isolated permissions)
Lane integration for priority-based command scheduling
Todo/task tracking within sessions
Provider configuration (multi-provider, per-model overrides)
OpenAI-compatible message format

Phase 3: Ecosystem ✅ (Complete)

LSP integration (hover, definition, references, symbols, diagnostics)
MCP support (register, connect, disconnect, tool discovery)
Cron scheduling (natural language + cron expressions, execution history)
Planning & goal tracking (execution plans, goal extraction, achievement checking)
Memory system (episodic, semantic, procedural, working memory)
Web search with multiple engine support
Claude Code skills compatibility

Phase 4: SDK & API ✅ (Complete)

TypeScript SDK with full 78 RPC coverage (@a3s-lab/code)
Python SDK with full 78 RPC coverage (a3s-code on PyPI)
OpenAI-compatible chat completion API
Comprehensive type exports and documentation
Proto file synchronization across all SDKs

Phase 5: Observability 🚧

End-to-end distributed tracing across the agent lifecycle:

OpenTelemetry Spans: Instrument agent loop with structured spans ✅
- a3s.agent.execute → a3s.agent.turn → a3s.llm.completion / a3s.tool.execute
- Span attributes: session_id, turn_number, model, tool_name, token counts, stop_reason, exit_code, duration_ms, permission
Per-Session Cost Tracking: Per-call recording of model / input_tokens / output_tokens / cost
- Aggregate by: agent, session, day, model
- Export to Prometheus / OTLP for Cost Dashboard
Tool Execution Metrics: Per-session tool call tracking — duration, success/failure rate, per-tool aggregated stats (GetToolMetrics RPC)
Multi-Agent Trace Propagation: Trace context forwarded across subagent calls
SigNoz Dashboard Template: Pre-built dashboard for A3S Code metrics

Phase 6: Production 📋

WebSocket transport (in addition to gRPC)
Redis/PostgreSQL session persistence
Rate limiting per session/user
Prometheus metrics endpoint
Health check endpoint for load balancers

Phase 7: Security Guards (Architecture Redesign) ✅

Generic security module for TEE-based execution. The src/security/ module provides general-purpose security features independent of any specific crate.

See Known Architecture Issues in the SafeClaw crate for the full design review.

Phase 8: Distributed TEE 📋

Support for SafeClaw's split-process-merge security model:

Coordinator Role: Task decomposition and result aggregation in TEE
Secure Worker Role: Partial sensitive data access in TEE
General Worker Role: Sanitized data only in REE
Validator Role: Independent output verification in TEE
Inter-Agent Communication: Secure channels via a3s-transport with data minimization

License

MIT

Built by A3S Lab

Name		Name	Last commit message	Last commit date
Latest commit History 75 Commits
.a3s/cron		.a3s/cron
examples		examples
prompts		prompts
proto		proto
sdk		sdk
skills		skills
src		src
tests		tests
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md
build.rs		build.rs
justfile		justfile

License

A3S-Lab/Code

Folders and files

Latest commit

History

Repository files navigation

A3S Code

Overview

Basic Usage

Features

Quality Metrics

Test Coverage

Architecture

System Architecture

Built-in Tools

Subagent Types

Quick Start

Installation

Run

Configuration

SDK

TypeScript

Python

Permission System

Human-in-the-Loop (HITL)

Skills System

LSP Integration

MCP Support

Cron Scheduling

Memory System

Planning & Goal Tracking

API Reference

Lifecycle (4 RPCs)

Session Management (6 RPCs)

Code Generation (4 RPCs)

Skill Management (4 RPCs)

Context Management (3 RPCs)

Event Streaming (1 RPC)

Control Operations (3 RPCs)

Human-in-the-Loop (3 RPCs)

External Task Handling (4 RPCs)

Permission System (4 RPCs)

Todo Tracking (2 RPCs)

Provider Configuration (7 RPCs)

Planning & Goal Tracking (4 RPCs)

Memory System (5 RPCs)

MCP - Model Context Protocol (5 RPCs)

LSP - Language Server Protocol (8 RPCs)

Cron - Scheduled Tasks (10 RPCs)

Observability (1 RPC)

Development

Dependencies

Build Commands

Project Structure

A3S Ecosystem

Roadmap

Phase 1: Core ✅ (Complete)

Phase 2: Extensibility ✅ (Complete)

Phase 3: Ecosystem ✅ (Complete)

Phase 4: SDK & API ✅ (Complete)

Phase 5: Observability 🚧

Phase 6: Production 📋

Phase 7: Security Guards (Architecture Redesign) ✅

Phase 8: Distributed TEE 📋

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Contributors 2

Uh oh!

Languages

Packages