GitHub - mrobinson2/AzureAgentForge: Turnkey, self-hostable multi-agent AI platform on Azure — PaperClip orchestration + Hermes runtime + Honcho memory + OpenAI-compatible model router, deployed via Terraform to Container Apps.

An open-source Azure foundation for running AI agent teams with private memory, tool control, cost guardrails, voice/chat interfaces, and observability.

Most agent demos look amazing for five minutes.

Then you try to run one for real work and the uncomfortable questions show up:

Where does memory live?
Who is allowed to call which tools?
How do you stop one agent from burning the expensive model budget?
What happened at 2 a.m. when the agent made that decision?
How do people use this from Teams, voice, web, or chat without creating four more side projects?
Can this run in Azure without becoming a weekend science project?

AzureAgentForge is built for that gap.

It brings together leading open-source agent tools — PaperClip for orchestration and UI, Hermes for agent execution, and Honcho for private memory — then wraps them in an Azure-ready foundation with Terraform, Key Vault, Container Apps, PostgreSQL, budget-aware model routing, and centralized logging.

Most LLM calls are routed through Azure AI Foundry, which keeps model integration simple while aligning with the Azure security model. Where supported, the platform is designed to favor Microsoft Entra ID and managed identity patterns over long-lived API keys.

It is not trying to be another cute local demo.

It is a practical starting point for people who want agent teams they can deploy, monitor, constrain, talk to, and improve.

The mental model

Think of AzureAgentForge as a small operations center for agent teams.

PaperClip is the front desk and work dashboard.
Hermes is where the agents actually do the work.
OpenClaw support is planned as an additional agent execution option.
Honcho is the memory layer that keeps context private.
Model Router is the spending guardrail.
Azure AI Foundry is the preferred model gateway.
Azure is the secure building everything runs inside.

You bring the goals.

The platform gives your agents a place to work, remember, call tools, stay within budget, talk to people, and leave an audit trail.

What's under the hood

                                  +---------------------------+
                                  |        Human Users         |
                                  | Browser / Teams / Voice    |
                                  | Telegram / Discord / Web   |
                                  +-------------+-------------+
                                                |
                                                v
                    +---------------------------------------------------+
                    |                    PaperClip                      |
                    |          Orchestrator, UI, task dashboard          |
                    |      Optional Telegram / Discord chat bridges      |
                    |      Microsoft Teams integration planned in v1.1   |
                    +----------------------+----------------------------+
                                           |
                                           | dispatches work
                                           v
                    +---------------------------------------------------+
                    |                Agent Runtime Layer                 |
                    |                                                   |
                    |   Hermes today                                    |
                    |   OpenClaw support planned                        |
                    |                                                   |
                    |   Role profiles, toolsets, skills, delegation      |
                    +-----------+---------------------------+-----------+
                                |                           |
                                | model calls               | memory calls
                                v                           v
          +--------------------------------+       +-------------------------------+
          |          Model Router          |       |            Honcho             |
          | OpenAI-compatible API facade   |       | Self-hosted agent memory      |
          | Tier budgets, auth, fallback   |       | PostgreSQL + pgvector         |
          +---------------+----------------+       +---------------+---------------+
                          |                                |
                          v                                v
          +--------------------------------+       +-------------------------------+
          |       Azure AI Foundry         |       |      PostgreSQL Flexible      |
          | Preferred model gateway        |       |      Server with pgvector     |
          | Managed identity where possible|       |      Private network path     |
          | OpenAI-compatible fallback     |       +-------------------------------+
          +--------------------------------+


      +-----------------------------------------------------------------------+
      |                              Azure                                    |
      |                                                                       |
      |  Container Apps  |  Private VNet  |  Key Vault  |  ACR  |  Log Analytics |
      |                                                                       |
      |  Terraform provisions the foundation. Key Vault provides secrets.       |
      |  Logs flow into Azure observability. Services run inside one VNet.      |
      +-----------------------------------------------------------------------+

For the full architecture, diagrams, component details, and data flow, see docs/architecture.md.

Why AzureAgentForge?

Because real agent systems need boring things that are easy to ignore in a demo:

private memory
scoped tool access
budget limits
deployment repeatability
identity-aware model access
secrets management
logs and traces
fallback providers
human review points
chat and voice channels
infrastructure that can be rebuilt

The boring stuff is the production stuff.

AzureAgentForge gives you a practical Azure-native base so you can focus on what the agents should do instead of rebuilding the platform plumbing from scratch.

What it solves

Memory that stays in your network

Honcho stores per-session and per-user memory in PostgreSQL with pgvector. Agent memory stays inside your Azure network instead of disappearing into someone else's hosted black box.

Model access through Azure AI Foundry

Most LLM integrations are designed to go through Azure AI Foundry first.

That gives the platform a cleaner model gateway, simpler model management, and a better security posture for Azure-native environments. The goal is to reduce one-off API key sprawl and make model access feel like part of the platform, not an afterthought.

OpenAI-compatible endpoints remain supported as fallback options.

Cost control with real budget caps

The model router enforces per-tier daily budgets. Agents on the economy tier cannot accidentally burn through the frontier model budget.

Two Terraform cost profiles are included:

cost-optimized — targets under $150/month in Azure infrastructure
hardened — zone-redundant posture, private endpoints, and longer log retention

LLM token costs are separate and depend on your provider usage.

Safe tool use across defined roles

The agent team uses role profiles that define model tier and allowed toolsets. Roles do not get broad capability by accident.

A dedicated CostGuardian role exists specifically to watch spend.

Deployable on Azure, not just runnable on a laptop

Terraform provisions the Azure foundation, including:

Azure Container Apps
Azure Database for PostgreSQL Flexible Server
Azure Container Registry
Azure Key Vault
Log Analytics
private networking

CI validates and plans clean on every commit.

Observable without container archaeology

Every service logs to a shared Log Analytics workspace. Application Insights is optional.

You should be able to understand what happened without SSHing into containers and spelunking through logs like it is 2007.

Built for where people already work

Telegram and Discord can be enabled through Terraform variables:

telegram_enabled = true
discord_enabled  = true

Both are off by default.

Full Microsoft Teams integration is planned for v1.1 so agent teams can move closer to the place many organizations already coordinate work.

Voice as a first-class interface

A future release will include first-class integration with Microsoft Voice Live for low-cost, low-latency speech-to-text and text-to-speech.

The goal is simple: agents should not be trapped behind a text box. You should be able to talk to them naturally, interrupt them, hear responses, and use voice where voice makes sense.

AzureAgentForge is for you if...

You want agents that do more than chat.

You want to give agents goals, tools, memory, and budgets — then see what they did, what they spent, and where they got stuck.

You want to run that system on Azure instead of duct-taping together a laptop demo, a hosted memory service, a mystery dashboard, and a pile of API keys.

You may be building:

a private agent platform for your own projects
a small "AI company" made of specialized agents
an automation lab
an internal enterprise prototype
a safer way to experiment with long-running autonomous workflows
a voice-enabled assistant stack
an Azure-native agent stack you can actually reason about

AzureAgentForge gives you the starting foundation: orchestration, runtime, memory, model routing, Terraform, secrets, logs, cost controls, and human-facing channels.

What is included today

AzureAgentForge v1.0 includes the working foundation of the platform:

Full Terraform IaC for the Azure foundation
Two infrastructure cost profiles
13 predefined agent roles with tests
Azure AI Foundry-first model routing pattern
Model router that runs locally
OpenAI-compatible fallback support
Sanitized Dockerfiles and service configuration
Key Vault-based secret loading
Log Analytics integration
Private VNet design
Local working slice with PostgreSQL and the model router
Optional Telegram and Discord surfaces
Multi-tenant architecture design and early scaffolding

The current local quickstart brings up PostgreSQL and the model router. The full one-command local stack and full end-to-end Azure installer are planned for v1.1.

What's not finished yet

AzureAgentForge is not a one-click SaaS product.

The v1.0 release gives you the architecture, Terraform, model router, role schema, Docker/config scaffolding, and a working local slice. Standing up the full Azure deployment still takes real setup work: Azure subscription, GitHub-to-Azure IAM, image build and push, Key Vault seeding, and environment configuration.

The v1.1 CLI installer is intended to automate that full path.

If you want magic, this is not magic.

If you want a serious foundation you can inspect, fork, improve, and run in your own Azure environment, you are in the right place.

AI-assisted setup

Until the v1.1 CLI installer is available, AzureAgentForge includes an AI-assisted setup path.

Use AI-ASSISTED-SETUP.md with Claude Code, Codex, or another coding agent that can inspect your local repo. The prompt walks the agent through repo discovery, local setup, Azure prerequisites, Terraform deployment, container image build and push, Key Vault configuration, Azure AI Foundry model routing, optional integrations, and post-deployment smoke testing.

This is not a replacement for the installer. It is a guided setup assistant for developers who want help understanding and deploying the repo today.

Quickstart

Prefer a guided walkthrough before running commands? Start with AI-ASSISTED-SETUP.md. It gives Claude Code or Codex a structured prompt to inspect the repo and guide you through setup step by step.

Local

cp .env.example .env

# Fill in one of the following:
# - AZURE_FOUNDRY_ENDPOINT + AZURE_FOUNDRY_API_KEY
# - OPENAI_COMPAT_BASE_URL

docker compose up

This starts PostgreSQL and the model router.

The router registers an LLM tier on boot if credentials are present. Without credentials, it starts with no tiers.

PaperClip and Honcho currently require:

docker compose --profile full up

plus upstream sources.

A one-command full local stack is planned for v1.1.

See docs/getting-started.md for the full local walkthrough.

Azure

# Initialize Terraform
terraform -chdir=infrastructure/environments/dev init

Create terraform.tfvars for your subscription and environment values, including:

subscription_id             = "..."
location                    = "..."
keyvault_admin_object_ids   = ["..."]
container_registry_name     = "..."

Apply with the cost-optimized profile:

terraform -chdir=infrastructure/environments/dev apply \
  -var-file=../../profiles/cost-optimized.tfvars \
  -var-file=terraform.tfvars

This provisions infrastructure. It does not yet build and push all service images or seed every runtime secret.

A complete end-to-end deploy flow is planned for the v1.1 CLI installer.

See docs/getting-started.md for the full Azure walkthrough, including Key Vault secret seeding.

Roadmap

Available now

✅ Azure-hosted production stack open-sourced
✅ Full Terraform IaC
✅ Azure Container Apps foundation
✅ PostgreSQL Flexible Server with pgvector
✅ Private VNet architecture
✅ Azure Key Vault secret pattern
✅ Log Analytics integration
✅ Two infrastructure cost profiles
✅ 13 predefined agent roles
✅ Agent role schema with automated tests
✅ Model router running locally
✅ Azure AI Foundry-first LLM integration pattern
✅ OpenAI-compatible model router API
✅ Per-tier daily budget caps
✅ Azure AI Foundry as primary model backend
✅ OpenAI-compatible fallback backend
✅ Dockerfiles and service configuration
✅ Local working slice with PostgreSQL and model router
✅ Optional Telegram bridge
✅ Optional Discord bridge
✅ Multi-tenant architecture designed
✅ Early multi-tenant scaffolding

Coming in v1.1

⬜ CLI installer
✅ AI-assisted setup documentation using Claude Code or Codex
⬜ ANSI terminal UI for guided setup
⬜ Preflight checks
⬜ Azure configuration wizard
⬜ Automated Terraform provision flow
⬜ Image build and push automation
⬜ Key Vault secret seeding
⬜ Full service deployment automation
⬜ Smoke tests after deployment
⬜ One-command full local stack
⬜ Full Microsoft Teams integration
⬜ Measured Azure cost figures based on real bills
⬜ First fully validated end-to-end Azure deployment from a clean subscription

Future releases

⬜ OpenClaw support as an additional agent runtime
⬜ First-class Microsoft Voice Live integration
⬜ Low-latency STT/TTS voice interface
⬜ Complete multi-tenant implementation
⬜ Multiple human users
⬜ Agent reviews and approvals
⬜ Work queues
⬜ Scheduled routines
⬜ Skills manager
⬜ Artifacts and work products
⬜ Cloud sandbox agents
⬜ Deeper observability pipeline
⬜ Agent inventory and operational dashboard patterns
⬜ Private enterprise RAG patterns with Azure AI Search
⬜ Azure Container Apps Sandboxes exploration
⬜ Microsoft Foundry Agent Service alignment
⬜ Microsoft 365 and Agent 365 publishing exploration
⬜ Work IQ API exploration
⬜ More chat surfaces
⬜ Desktop app exploration
⬜ CEO chat experience
⬜ Automatic organizational learning
⬜ Self-organization experiments
⬜ Deep planning workflows
⬜ Enforced outcomes
⬜ MAXIMIZER MODE

See ROADMAP.md for the full roadmap.

Microsoft ecosystem alignment

AzureAgentForge is intentionally aligned with where Microsoft is moving the agent platform:

Azure AI Foundry for model access and agent development
Foundry Agent Service as a managed path for hosted agents and scale-out runtime patterns
Microsoft Teams as a primary collaboration surface
Microsoft Voice Live for real-time voice agents
Azure Container Apps for containerized agent services
Azure Container Apps Sandboxes as a future path for safer agentic workload execution
Azure AI Search for private RAG over enterprise data
Log Analytics and Application Insights for operations and troubleshooting
Microsoft 365 and Agent 365 as future distribution points where agents can meet users where they already work

The goal is not to chase every new service.

The goal is to make AzureAgentForge a practical bridge between open-source agent tooling and the Microsoft cloud capabilities that make agent systems safer, easier to operate, and easier to adopt inside real organizations.

Cost

The cost-optimized profile targets under $150/month in Azure infrastructure spend.

LLM token costs are billed separately by your provider.

Cost estimates are modeled, not guaranteed. Real cost depends on region, usage, log volume, database sizing, redundancy choices, and model consumption.

See docs/cost.md for the per-service breakdown.

Platform status

This repository is the open, sanitized version of a multi-agent platform running on Azure.

The architecture, IaC, and components are tested through day-to-day use. The repo CI validates Terraform, Docker Compose configuration, agent role definitions, and model router behavior.

The single-tenant stack is the current working path.

Multi-tenant support is designed and partially scaffolded.

Documentation

Doc	What's in it
`docs/architecture.md`	System context, Azure architecture, components, data flow, maturity
`docs/getting-started.md`	Fork, configure, deploy locally, deploy to Azure
`AI-ASSISTED-SETUP.md`	Claude Code / Codex prompt for guided repo analysis, deployment, and usage
`docs/cost.md`	Per-service infrastructure estimates for both profiles
`docs/security.md`	Secrets, network posture, and pre-production checklist
`docs/why-azure.md`	The case for building agents on Azure
`docs/agents.md`	The 13-role model and how to add your own

Built on

AzureAgentForge is intentionally built on strong open-source projects instead of reinventing every layer.

Project	Role
PaperClip	Orchestrator UI and agent coordination layer
Hermes	Agent runtime
OpenClaw	Planned additional agent runtime support
Honcho	Self-hosted agent memory
Cloudflared	Optional tunnel ingress

Security notes

AzureAgentForge is designed with a private-by-default posture:

services run inside a private VNet
secrets are loaded from Key Vault
model routing is designed around Azure AI Foundry first
managed identity and Entra ID patterns are preferred where supported
memory is stored in PostgreSQL with pgvector
chat bridges are disabled by default
Application Insights is opt-in
hardened profile supports stronger production posture

Before using this for sensitive workloads, review docs/security.md, validate your own Azure policies, and complete your own threat model.

This is infrastructure you control, not a security guarantee you outsource.

Contributing

Issues, ideas, and pull requests are welcome.

Good contributions include:

cleaner setup paths
better Azure deployment automation
cost tuning
additional observability
safer default policies
documentation fixes
agent role improvements
Microsoft Teams integration
Microsoft Voice Live integration
OpenClaw runtime support
Azure AI Foundry integration improvements
AI-assisted setup prompt improvements
tested integrations

Please keep contributions practical. The goal is not to win buzzword bingo. The goal is to help people run useful agent systems with less chaos.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.github		.github
agents		agents
docs		docs
infrastructure		infrastructure
integrations		integrations
roadmap		roadmap
services		services
.env.example		.env.example
.gitignore		.gitignore
.gitleaks.toml		.gitleaks.toml
AI-ASSISTED-SETUP.md		AI-ASSISTED-SETUP.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
ROADMAP.md		ROADMAP.md
SECURITY.md		SECURITY.md
docker-compose.yml		docker-compose.yml

Folders and files

Latest commit

History

Repository files navigation

An open-source Azure foundation for running AI agent teams with private memory, tool control, cost guardrails, voice/chat interfaces, and observability.

The mental model

What's under the hood

Why AzureAgentForge?

What it solves

Memory that stays in your network

Model access through Azure AI Foundry

Cost control with real budget caps

Safe tool use across defined roles

Deployable on Azure, not just runnable on a laptop

Observable without container archaeology

Built for where people already work

Voice as a first-class interface

AzureAgentForge is for you if...

What is included today

What's not finished yet

AI-assisted setup

Quickstart

Local

Azure

Roadmap

Available now

Coming in v1.1

Future releases

Microsoft ecosystem alignment

Cost

Platform status

Documentation

Built on

Security notes

Contributing

License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages