agent.cpp

Building blocks for local agents in C++.

Note

This library is designed for running small language models locally using llama.cpp. If you want to call external LLM APIs, this is not the right fit.

Examples

Context Engineering - Use callbacks to manipulate the context between iterations of the agent loop.
Memory - Use tools that allow an agent to store and retrieve relevant information across conversations.
Multi-Agent - Build a multi-agent system with weight sharing where a main agent delegates to specialized sub-agents.
Shell - Allow an agent to write shell scripts to perform multiple actions at once. Demonstrates human-in-the-loop interactions via callbacks.
Tracing - Use callbacks to collect a record of the steps of the agent loop with OpenTelemetry.

You need to download a GGUF model in order to run the examples, the default model configuration is set for granite-4.0-micro:

wget https://huggingface.co/ibm-granite/granite-4.0-micro-GGUF/resolve/main/granite-4.0-micro-Q8_0.gguf

Important

The examples use default ModelConfig values optimized for granite-4.0-micro. If you use a different model, you should adapt these values (context size, temperature, sampling parameters, etc.) to your specific use case.

Building Blocks

We define an agent with the following building blocks:

Agent Loop

In the current LLM (Large Language Models) world, and agent is usually a simple loop that intersperses Model Calls and Tool Executions, until a stop condition is met.

Important

There are different ways to implement stop conditions. By default, we let the agent decide when to end the loop, by generating an output without tool executions. You can implement additional stop conditions via callbacks.

Callbacks

Callbacks allow you to hook into the agent lifecycle at specific points:

before_agent_loop / after_agent_loop - Run logic at the start/end of the agent loop
before_llm_call / after_llm_call - Intercept or modify messages before/after model inference
before_tool_execution / after_tool_execution - Validate, skip, or handle tool calls and their results

Use callbacks for logging, context manipulation, human-in-the-loop approval, or error recovery.

Instructions

A system prompt that defines the agent's behavior and capabilities. Passed to the Agent constructor and automatically prepended to conversations.

Model

Encapsulates local LLM initialization and inference using llama.cpp. This is tightly coupled to llama.cpp and requires models in GGUF format.

Handles:

Loading GGUF model files (quantized models recommended for efficiency)
Chat template application and tokenization
Text generation with configurable sampling (temperature, top_p, top_k, etc.)
KV cache management for efficient prompt caching

Tools

Tools extend the agent's capabilities beyond text generation. Each tool defines:

Name and description - Helps the model understand when to use it
Parameters schema - JSON Schema defining expected arguments
Execute function - The actual implementation

When the model decides to use a tool, the agent parses the tool call, executes it, and feeds the result back into the conversation.

Usage

C++ Standard: Requires C++17 or higher.

Option 1: FetchContent (Recommended)

The easiest way to integrate agent.cpp into your CMake project:

include(FetchContent)
FetchContent_Declare(
    agent-cpp
    GIT_REPOSITORY https://github.com/mozilla-ai/agent.cpp
    GIT_TAG main  # or a specific release tag like v0.1.0
)
FetchContent_MakeAvailable(agent-cpp)

add_executable(my_app main.cpp)
target_link_libraries(my_app PRIVATE agent-cpp::agent)

Option 2: Installed Package

Build and install agent.cpp, then use find_package:

# Clone and build
git clone --recursive https://github.com/mozilla-ai/agent.cpp
cd agent.cpp
cmake -B build -DAGENT_CPP_INSTALL=ON -DCMAKE_BUILD_TYPE=Release
cmake --build build

# Install (use --prefix for custom location)
cmake --install build --prefix ~/.local/agent-cpp

Then in your project:

# If installed to a custom prefix, tell CMake where to find it
list(APPEND CMAKE_PREFIX_PATH "~/.local/agent-cpp")

find_package(agent-cpp REQUIRED)

add_executable(my_app main.cpp)
target_link_libraries(my_app PRIVATE agent-cpp::agent)

Option 3: Git Submodule

Add agent.cpp as a submodule and include it directly:

git submodule add https://github.com/mozilla-ai/agent.cpp agent.cpp
git submodule update --init --recursive

add_subdirectory(agent.cpp)
target_link_libraries(my_app PRIVATE agent-cpp::agent)

Hardware Acceleration

This project uses llama.cpp as a submodule. You can enable hardware-specific acceleration by passing the appropriate CMake flags when building. For example:

# CUDA (NVIDIA GPUs)
cmake -B build -DGGML_CUDA=ON

# OpenBLAS (CPU)
cmake -B build -DGGML_BLAS=ON -DGGML_BLAS_VENDOR=OpenBLAS

For a complete list of build options and backend-specific instructions, see the llama.cpp build documentation.

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
.github/workflows		.github/workflows
cmake		cmake
examples		examples
llama.cpp @ 02e409a		llama.cpp @ 02e409a
src		src
tests		tests
.clang-format		.clang-format
.clang-tidy		.clang-tidy
.gitignore		.gitignore
.gitmodules		.gitmodules
.pre-commit-config.yaml		.pre-commit-config.yaml
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

agent.cpp

Examples

Building Blocks

Agent Loop

Callbacks

Instructions

Model

Tools

Usage

Option 1: FetchContent (Recommended)

Option 2: Installed Package

Option 3: Git Submodule

Hardware Acceleration

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

mozilla-ai/agent.cpp

Folders and files

Latest commit

History

Repository files navigation

agent.cpp

Examples

Building Blocks

Agent Loop

Callbacks

Instructions

Model

Tools

Usage

Option 1: FetchContent (Recommended)

Option 2: Installed Package

Option 3: Git Submodule

Hardware Acceleration

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages