Helmsman ⚓

A framework for building AI agents in Elixir.

Install it from Hex.pm and browse the guides on HexDocs.

Helmsman treats AI agents as first-class OTP processes that can reason, use tools, and orchestrate complex workflows. Built on Erlang/OTP primitives for reliability and concurrency.

Motivation 💡

Helmsman grew out of practical work building agentic workflows. We needed a framework that:

Integrates naturally with OTP supervision trees
Supports streaming for responsive user experiences
Works with multiple LLM providers without vendor lock-in
Provides extensible tooling for domain-specific capabilities

Rather than wrapping JavaScript agent frameworks, we built Helmsman from scratch using idiomatic Elixir patterns. We are sharing it because Elixir is an excellent fit for building reliable AI agents.

Features ✨

OTP-native: Agents are GenServers that integrate naturally with supervision trees
Streaming: Real-time event streaming for responsive UIs
Tool System: Extensible tools for file operations, shell commands, and more
Multi-Provider: 18+ LLM providers via ReqLLM (Anthropic, OpenAI, Google, etc.)
Telemetry: Built-in observability with :telemetry events

Installation 📦

Add helmsman to your dependencies in mix.exs:

def deps do
  [
    {:helmsman, "~> 0.1.0"}
  ]
end

Quick Start 🚀

1. Define an Agent

defmodule MyApp.CodingAgent do
  use Helmsman

  @impl true
  def tools do
    Helmsman.Tools.coding_tools()
  end
end

2. Start and Use the Agent

# Start the agent with an explicit system prompt override
{:ok, agent} = MyApp.CodingAgent.start_link(
  api_key: System.get_env("ANTHROPIC_API_KEY"),
  system_prompt: """
  You are an expert software engineer.
  Write clean, well-documented code.
  Always run tests after making changes.
  """
)

# Run a prompt
{:ok, response} = Helmsman.run(agent, "Create a GenServer that manages a counter")

# Stream responses for real-time output
Helmsman.stream(agent, "Add documentation to the counter module")
|> Stream.each(fn
  {:text, chunk} -> IO.write(chunk)
  {:tool_call, name, _id, _args} -> IO.puts("\n📦 Using tool: #{name}")
  {:tool_result, _id, result} -> IO.puts("   Result: #{inspect(result)}")
  :done -> IO.puts("\n✅ Done!")
  _ -> :ok
end)
|> Stream.run()

3. Add to Supervision Tree

defmodule MyApp.Application do
  use Application

  def start(_type, _args) do
    children = [
      {MyApp.CodingAgent,
        api_key: System.get_env("ANTHROPIC_API_KEY"),
        system_prompt: "You are a helpful coding assistant."}
    ]

    Supervisor.start_link(children, strategy: :one_for_one)
  end
end

LiveBook 📓

Helmsman works well in LiveBook notebooks with Mix.install/1:

Mix.install([
  {:helmsman, "~> 0.1.0"}
])

Application.put_env(:helmsman, :api_key, System.fetch_env!("ANTHROPIC_API_KEY"))

defmodule NotebookAgent do
  use Helmsman

  @impl true
  def tools do
    Helmsman.Tools.read_only_tools()
  end
end

{:ok, agent} =
  NotebookAgent.start_link(
    system_prompt: "You are a helpful LiveBook assistant."
  )

{:ok, response} =
  Helmsman.run(agent, "Summarize the current notebook context.")

response

For richer notebook output, you can stream events and render them with LiveBook/Kino cells as they arrive.

Configuration ⚙️

API Keys

Set your API key via environment variable, application config, or option:

# Environment variable (recommended) - ReqLLM auto-discovers these
export ANTHROPIC_API_KEY="sk-ant-..."
export OPENAI_API_KEY="sk-..."

# Application config
config :helmsman,
  api_key: "sk-ant-...",
  system_prompt: "You are a helpful coding assistant."

# Per-agent option
MyApp.CodingAgent.start_link(api_key: "sk-ant-...")

Values passed to start_link/1 take precedence over config :helmsman, which takes precedence over agent module defaults.

Agent Options

MyApp.CodingAgent.start_link(
  api_key: "sk-ant-...",                        # Overrides config :helmsman, :api_key
  model: "anthropic:claude-sonnet-4-20250514",  # Overrides config/module default
  system_prompt: "You are helpful.",            # Overrides config/module default
  thinking_level: :medium,                      # Overrides config/module default
  cwd: "/path/to/project",                      # Overrides config/default cwd
  name: MyApp.CodingAgent                       # GenServer name
)

Supported Providers

Thanks to ReqLLM, Helmsman supports 18+ providers:

Provider	Model Format
Anthropic	`anthropic:claude-sonnet-4-20250514`
OpenAI	`openai:gpt-4o`
Google Gemini	`google:gemini-2.0-flash`
Groq	`groq:llama-3.3-70b-versatile`
OpenRouter	`openrouter:anthropic/claude-3.5-sonnet`
xAI	`xai:grok-3`
And 12+ more...	See ReqLLM docs

Built-in Tools 🧰

Default Tool Sets

# Full coding tools: Read, Bash, Edit, Write
def tools, do: Helmsman.Tools.coding_tools()

# Read-only: Read, Bash
def tools, do: Helmsman.Tools.read_only_tools()

Individual Tools

Tool	Description
`Helmsman.Tools.Read`	Read file contents, supports images
`Helmsman.Tools.Bash`	Execute shell commands
`Helmsman.Tools.Edit`	Surgical file edits (find & replace)
`Helmsman.Tools.Write`	Create or overwrite files

Custom Tools 🛠️

Define custom tools by implementing the Helmsman.Tool behaviour:

defmodule MyApp.Tools.Weather do
  use Helmsman.Tool

  @impl true
  def name, do: "get_weather"

  @impl true
  def description, do: "Gets the current weather for a location"

  @impl true
  def parameters do
    %{
      type: "object",
      properties: %{
        location: %{type: "string", description: "City name"}
      },
      required: ["location"]
    }
  end

  @impl true
  def call(%{"location" => location}, _context) do
    case WeatherAPI.get(location) do
      {:ok, data} -> {:ok, "Temperature: #{data.temp}°F"}
      {:error, reason} -> {:error, reason}
    end
  end
end

Events and Callbacks 📡

Handle events during agent execution:

defmodule MyApp.LoggingAgent do
  use Helmsman

  @impl true
  def handle_event({:tool_call, name, _id, _args}, state) do
    Logger.info("Agent calling tool: #{name}")
    {:noreply, state}
  end

  @impl true
  def handle_event({:text, chunk}, state) do
    # Stream to WebSocket, etc.
    {:noreply, state}
  end

  @impl true
  def handle_event(_event, state), do: {:noreply, state}
end

Telemetry

Helmsman emits telemetry events for observability:

:telemetry.attach_many(
  "my-handler",
  [
    [:helmsman, :agent, :start],
    [:helmsman, :agent, :stop],
    [:helmsman, :tool_call, :start],
    [:helmsman, :tool_call, :stop]
  ],
  fn event, measurements, metadata, _config ->
    Logger.info("#{inspect(event)}: #{inspect(measurements)}")
  end,
  nil
)

Streaming API

The streaming API returns an enumerable of events:

Helmsman.stream(agent, "Hello")
|> Enum.each(fn event ->
  case event do
    {:text, chunk} -> IO.write(chunk)
    {:thinking, chunk} -> IO.write(IO.ANSI.faint() <> chunk <> IO.ANSI.reset())
    {:tool_call, name, id, args} -> IO.inspect({name, args})
    {:tool_result, id, result} -> IO.inspect(result)
    {:error, reason} -> IO.puts("Error: #{inspect(reason)}")
    :agent_start -> IO.puts("Agent started")
    :agent_end -> IO.puts("Agent finished")
    :turn_start -> nil
    :turn_end -> nil
    :done -> IO.puts("\nDone")
  end
end)

License

MIT License - see LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
.github/workflows		.github/workflows
lib		lib
livebooks		livebooks
scripts		scripts
test		test
.formatter.exs		.formatter.exs
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
MIT.md		MIT.md
README.md		README.md
cliff.toml		cliff.toml
mise.toml		mise.toml
mix.exs		mix.exs
mix.lock		mix.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Helmsman ⚓

Motivation 💡

Features ✨

Installation 📦

Quick Start 🚀

1. Define an Agent

2. Start and Use the Agent

3. Add to Supervision Tree

LiveBook 📓

Configuration ⚙️

API Keys

Agent Options

Supported Providers

Built-in Tools 🧰

Default Tool Sets

Individual Tools

Custom Tools 🛠️

Events and Callbacks 📡

Telemetry

Streaming API

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Helmsman ⚓

Motivation 💡

Features ✨

Installation 📦

Quick Start 🚀

1. Define an Agent

2. Start and Use the Agent

3. Add to Supervision Tree

LiveBook 📓

Configuration ⚙️

API Keys

Agent Options

Supported Providers

Built-in Tools 🧰

Default Tool Sets

Individual Tools

Custom Tools 🛠️

Events and Callbacks 📡

Telemetry

Streaming API

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages