Agent Runner

CLI workflow orchestrator for AI agents written in Go. Runs multi-step workflows by spawning separate agent sessions for each step, keeping orchestration deterministic and outside the agent.

Why

Agents are good at execution, bad at orchestration. When given a complex multi-step workflow, they lose track of sequence, skip steps, accumulate stale context, and ignore instructions buried deep in prompts. Agent Runner solves this by moving orchestration out of the agent entirely. Each step gets a fresh or resumed session, a focused prompt in the highest-attention position, and a single responsibility.

Why not use an existing workflow tool?

There are many YAML-based workflow engines (Argo, Kestra, Step Functions) and CLI task runners (Taskfile, Just, Make). The cloud/server orchestrators have rich control flow but can't run local CLI processes. The CLI task runners can run shell commands but collapse into bash scripts the moment you need loop-until with multi-step bodies, mid-pipeline output capture, or conditional branching. None of them have the concepts that agent orchestration requires: session management across steps, interactive/headless mode switching, prompt-based agent steps, or signal-based advancement. Agent Runner borrows proven workflow primitives (for-each, loop-until, sub-workflows, output capture) from these systems and adds a purpose-built runtime for orchestrating stateful conversational agents. See docs/WHY-AGENT-RUNNER.md for the full comparison.

Features

Multi-CLI support: invoke Claude, Codex, or other agent backends through a uniform adapter interface
Three step modes: interactive (collaborative), headless (autonomous), shell (CLI commands)
Session management: new, resume, or inherit sessions across steps and sub-workflows
Loops: counted loops (loop: { max: N }) and for-each loops (loop: { over, as }) with break_if conditions
Sub-workflows: compose workflows from reusable workflow files with parameter passing
Output capture: capture shell stdout into variables for use in subsequent steps (capture field with tee behavior)
Flow control: continue_on_failure, skip_if: previous_success, break_if: success|failure
Per-step model override: specify which model an agent step should use (model field)
State and resumption: state.json persists after each step for resume on interruption
Audit logging: structured log of every execution event (step start/end, iterations, sub-workflows) for post-failure troubleshooting
Engines: pluggable lifecycle hooks for prompt enrichment, step validation, and state management
PTY support: improved terminal I/O for interactive agent sessions (via Go's creack/pty)

Install

Homebrew (macOS / Linux)

brew tap Codagent-AI/tap
brew install agent-runner

From source

Requires Go 1.23+.

go install github.com/codagent/agent-runner/cmd/agent-runner@latest

Or clone and build:

make build       # compiles to bin/agent-runner
make test        # run tests
make lint        # run golangci-lint

Quick start

# Validate a workflow
./agent-runner -validate flokay

# Run a workflow with parameters
./agent-runner flokay my-change-name

# Resume an interrupted workflow (most recent session)
./agent-runner -resume

# Resume a specific session
./agent-runner --session <session-id>

How it works

Agent Runner reads a YAML workflow file and executes steps sequentially. Each step is one of several types:

Type	What happens	Use case
interactive	Agent runs with full stdin. User works with it, types `/continue` to advance.	Collaborative steps (proposal, specs, design)
headless	Agent runs with `-p` flag. Output streams to terminal. Auto-advances on exit.	Autonomous steps (tasks, review, implementation)
shell	Runs a shell command directly, no agent.	CLI operations (`openspec new`, `git commit`)
loop	Repeats child steps (counted or for-each).	Iterating over tasks, retry loops
sub-workflow	Invokes another workflow file.	Reusable workflow composition

agent-runner (harness)
  |
  +-- step 1: shell        -> sh -c "openspec new change my-feature"
  +-- step 2: interactive  -> claude "Write the proposal..."
  +-- step 3: headless     -> claude -p "Generate specs..."
  +-- step 4: loop (per-task)
  |     +-- step 4a: headless  -> claude -p "Implement {{task_file}}"
  |     +-- step 4b: sub-workflow -> workflows/core/run-validator.yaml
  +-- step 5: headless     -> claude -p "Finalize..."

Session management

Each agent step declares a session strategy:

session: new -- Fresh session, no prior context. Agent reads what it needs from disk.
session: resume -- Continues the most recent session within the current workflow. Also picks up a session seeded via --session.
session: inherit -- Crosses sub-workflow boundaries to resume the parent workflow's most recent session.

State and resumption

Agent Runner writes state.json after each step. If a workflow is interrupted, agent-runner resume picks up from where it left off, including persisted session IDs, captured variables, and parameters. State is recursive -- nested loops and sub-workflows track their own position.

Engines

Workflows can declare an engine that hooks into the execution lifecycle:

enrichPrompt -- Append context (templates, output paths, dependencies) to step prompts
validateStep -- Verify expected output was created after a step
validateWorkflow -- Check workflow structure at load time
getStateDir -- Control where the state file lives

The built-in openspec engine integrates with the OpenSpec CLI to inject artifact context and validate artifact completion.

Workflow format

name: my-workflow
description: "What this workflow does"

params:
  - name: change_name
    required: true

engine:                          # optional
  type: openspec
  change_param: change_name

steps:
  - id: create
    mode: shell
    command: openspec new change "{{change_name}}"

  - id: proposal
    mode: interactive
    session: new
    prompt: /codagent:propose "{{change_name}}"

  - id: implement
    workflow: implement-change.yaml
    params:
      change_name: "{{change_name}}"

  - id: verify
    mode: headless
    session: new
    model: sonnet
    prompt: "Verify the implementation"

  - id: codex-review
    mode: headless
    cli: codex
    model: o3
    prompt: "Review the implementation"

Step fields

Field	Required	Description
`id`	yes	Unique step identifier. Used for `--from`, state tracking, and engine matching.
`mode`	agent/shell	`interactive`, `headless`, or `shell`
`prompt`	agent steps	Prompt passed to the agent. Supports `{{param}}` interpolation.
`command`	shell steps	Shell command to execute. Supports `{{param}}` interpolation.
`session`	no	`new` (default), `resume`, or `inherit`. Only applies to agent steps.
`cli`	no	CLI backend for agent steps: `claude` (default) or `codex`.
`model`	no	Model override for agent steps. Passed through the CLI adapter.
`capture`	no	Variable name to capture shell stdout into. Shell steps only.
`continue_on_failure`	no	If `true`, workflow continues even if this step fails.
`skip_if`	no	`previous_success` (skip on prior success) or `sh: <cmd>` (skip when interpolated shell command exits 0).
`break_if`	no	`success` or `failure` -- break out of enclosing loop on this condition.
`loop`	no	`{ max: N }` for counted loops, `{ over: glob, as: var }` for for-each.
`steps`	loop/group	Nested child steps (required for loops, optional for groups).
`workflow`	sub-workflow	Path to another workflow YAML file.
`params`	sub-workflow	Parameters to pass to the sub-workflow.

Parameter interpolation

Parameters declared in params: are passed as positional arguments:

./agent-runner my-workflow value1 value2

Referenced in prompts and commands as {{param_name}}. Captured variables from shell steps are also available via {{var_name}}.

CLI reference

agent-runner [flags] <workflow-name> [params...]
agent-runner -validate <workflow-name>
agent-runner -resume [--session <id>]
agent-runner --session <id>

-session

Seeds the resume with a specific session ID. Implies --resume.

# Resume a specific session
./agent-runner --session abc-123-def

The seed propagates through sub-workflows and loop iterations, so it works regardless of nesting depth. If no step uses session: resume, the seeded session is ignored.

Configuration

Configuration resolves in layers, each overriding the previous:

Global (user-level) — default_model
Project-level — project-specific defaults
Step-level — cli and model fields on individual steps

Steps default to the claude CLI adapter. Use cli: codex on a step to invoke Codex instead. The model field is passed through the CLI adapter.

Planned: Workflow extensibility

Users will be able to extend base workflows without redefining the entire pipeline:

Extend a base workflow and inherit its steps
Override specific steps (agent, prompt, mode) while keeping the rest
Add new steps at specific positions

Design goal: modify behavior without rewriting entire workflows.

Architecture

.
├── cmd/
│   └── agent-runner/
│       ├── main.go            # CLI entry (flag-based)
│       └── helpers.go         # process runner, glob expander
│
├── internal/
│   ├── model/
│   │   ├── step.go            # Step, Loop, Param, Workflow structs
│   │   ├── context.go         # ExecutionContext, nesting
│   │   └── state.go           # RunState serialization
│   │
│   ├── loader/
│   │   └── loader.go          # YAML loading, param interpolation
│   │
│   ├── runner/
│   │   ├── runner.go          # Workflow execution loop
│   │   └── resume.go          # State restoration
│   │
│   ├── cli/
│   │   ├── adapter.go         # Adapter interface & registry
│   │   ├── claude.go          # Claude CLI adapter
│   │   └── codex.go           # Codex CLI adapter
│   │
│   ├── exec/                  # Step executors
│   │   ├── agent.go           # Agent step executor
│   │   ├── shell.go           # Shell step executor
│   │   ├── loop.go            # Loop executor
│   │   ├── subworkflow.go     # Sub-workflow executor
│   │   ├── dispatch.go        # Step type routing
│   │   └── interfaces.go      # ProcessRunner, GlobExpander, Logger
│   │
│   ├── engine/
│   │   ├── engine.go          # Engine interface & registry
│   │   └── openspec/
│   │       └── openspec.go    # OpenSpec engine implementation
│   │
│   ├── session/
│   │   └── session.go         # Session resolution (new, resume, inherit)
│   │
│   ├── flowctl/
│   │   └── flowctl.go         # skip_if, break_if evaluation
│   │
│   ├── textfmt/
│   │   ├── interpolation.go   # {{variable}} interpolation
│   │   └── format.go          # Formatting utilities
│   │
│   ├── stateio/
│   │   └── stateio.go         # State file read/write
│   │
│   ├── audit/
│   │   ├── types.go           # Event types
│   │   └── logger.go          # AuditLogger
│   │
│   └── validate/
│       └── workflow.go        # Workflow constraint validation
│
├── go.mod
├── go.sum
└── Makefile

Development

make build        # compile to agent-runner binary
make test         # run all tests
make test-verbose # run tests with output
make test-cover   # run tests with coverage
make lint         # run golangci-lint
make fmt          # format code (goimports)

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 319 Commits
.agent-runner		.agent-runner
.claude		.claude
.config		.config
.github		.github
.validator		.validator
cmd/agent-runner		cmd/agent-runner
docs		docs
internal		internal
openspec		openspec
testdata		testdata
workflows		workflows
.gitignore		.gitignore
.golangci.yml		.golangci.yml
.goreleaser.yaml		.goreleaser.yaml
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
codex-pty-poc.log		codex-pty-poc.log
dev.sh		dev.sh
go.mod		go.mod
go.sum		go.sum
pty-poc		pty-poc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Agent Runner

Why

Why not use an existing workflow tool?

Features

Install

Homebrew (macOS / Linux)

From source

Quick start

How it works

Session management

State and resumption

Engines

Workflow format

Step fields

Parameter interpolation

CLI reference

-session

Configuration

Planned: Workflow extensibility

Architecture

Development

License

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Agent Runner

Why

Why not use an existing workflow tool?

Features

Install

Homebrew (macOS / Linux)

From source

Quick start

How it works

Session management

State and resumption

Engines

Workflow format

Step fields

Parameter interpolation

CLI reference

-session

Configuration

Planned: Workflow extensibility

Architecture

Development

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages