AgentLLM - Agno Provider for LiteLLM

A custom LiteLLM provider that exposes Agno agents through an OpenAI-compatible API, enabling seamless integration with Open WebUI and other OpenAI-compatible clients.

Note: This project uses LiteLLM's official CustomLLM extension mechanism with dynamic registration via custom_provider_map. No forking or monkey patching required!

Quick Start

Get the full stack running in under 5 minutes:

Prerequisites: Podman and a Gemini API key

# 1. Clone and navigate
git clone https://github.com/durandom/agentllm
cd agentllm

# 2. Configure environment
cp .env.secrets.template .env.secrets
# Edit .env.secrets and add:
# - GEMINI_API_KEY (get from https://aistudio.google.com/apikey)
# - AGENTLLM_TOKEN_ENCRYPTION_KEY (generate with command below)

# Generate encryption key for token storage
python -c "from cryptography.fernet import Fernet; print(Fernet.generate_key().decode())"
# Copy the output and add to .env.secrets as AGENTLLM_TOKEN_ENCRYPTION_KEY

# 3. Start everything (easiest way!)
podman compose up

Access Open WebUI: http://localhost:9500

Available Agents:

agno/release-manager - RHDH release management assistant
agno/sprint-reviewer - Sprint review assistant
agno/demo-agent - Example agent with color tools

Note: Agent data (session history, credentials) is stored in the tmp/ directory, which persists across restarts.

Development Quick Start

For development with hot reload (code changes reflected instantly):

# Install just command runner (one-time setup)
brew install just  # macOS
# or see https://github.com/casey/just#installation for other platforms

# Start development environment with source code mounted
just dev

# After making code changes, restart to pick them up (no rebuild needed!)
just dev-restart

# View all available commands
just

The just dev command uses a development overlay that mounts your source code into containers, enabling hot reload without rebuilding images.

Security: Token Encryption

AgentLLM encrypts all sensitive tokens (Jira, GitHub, Google Drive, RHCP) at rest using industry-standard Fernet encryption. You must set an encryption key:

# Generate a new key
python -c "from cryptography.fernet import Fernet; print(Fernet.generate_key().decode())"

# Add to .env.secrets
echo "AGENTLLM_TOKEN_ENCRYPTION_KEY=<your-generated-key>" >> .env.secrets

Important:

The application will fail to start without an encryption key
Keep your key secure and backed up
If you lose the key, all stored tokens become unrecoverable (users must re-enter)
Use different keys for development and production

📖 For production deployment, see docs/deployment-encryption.md

Architecture

flowchart LR;
    Client-->LiteLLM_Proxy;
    LiteLLM_Proxy-->Agno_Provider;
    Agno_Provider-->Agno_Agent;
    Agno_Agent-->LLM_APIs;

Components:

LiteLLM Proxy: OpenAI-compatible API gateway with authentication
Agno Provider: Custom LiteLLM handler (litellm.CustomLLM) for Agno agents
Agno Agents: Intelligent agents with tools (Google Drive, Jira, etc.)
Gemini API: Underlying LLM (Google Gemini 2.5 Flash)

The provider uses LiteLLM's official custom_provider_map for dynamic registration:

litellm_settings:
  custom_provider_map:
    - provider: "agno"
      custom_handler: custom_handler.agno_handler

See CLAUDE.md for detailed architecture documentation.

Development

Prerequisites

Python 3.11+
uv package manager (install: curl -LsSf https://astral.sh/uv/install.sh | sh)
just command runner (install: brew install just on macOS)
Podman (install)
Google Gemini API key (get here)

Development Modes

Choose based on your workflow:

Mode	Command	Use When	Proxy	OAuth	Hot Reload
Quick Start ⭐	`podman compose up`	Easiest - just running the system	Container	-	❌
Containerized Dev	`just dev`	Development with hot reload	Container	Container	✅
Local Services	Terminal 1: `just proxy` Terminal 2: `just oauth-callback`	Maximum flexibility	Local	Local	✅

Notes:

just dev uses a dev overlay (compose.dev.yaml) that mounts your source code for hot reload
After code changes with just dev, simply run just dev-restart to pick up changes (no rebuild needed!)
Agent data is stored in tmp/ directory and persists across restarts
OAuth callback server is always available at http://localhost:9502 in dev mode

Port reference:

Open WebUI: http://localhost:9500 (external) → container port 8080 (internal)
LiteLLM Proxy: http://localhost:9501 (external) → container port 8890 (internal)
OAuth Callback: http://localhost:9502 (external) → container port 8501 (internal)

Common Commands

# Quick reference
just                                           # Show all available commands
just info                                      # Show project info and service URLs

# Development (containerized with hot reload)
just dev                                       # Start all services with source mounted
just dev --build                               # Rebuild images and start
just dev-restart                               # Restart services after code changes
just dev-logs -f                               # Follow all logs
just dev-logs proxy -f                         # Follow proxy logs only
just dev-stop                                  # Stop all services (preserve data)
just dev-reset                                 # Remove containers and volumes

# Development (local services)
just proxy                                     # Run proxy locally (foreground)
just oauth-callback                            # Run OAuth callback server locally

# Container management
just container-build                           # Build container images
just container-rebuild                         # Rebuild without cache
just container-update --proxy                  # Rebuild and restart specific services

# Testing
just test                                      # Run unit tests
just test *ARGS                                # Run tests with custom args
just test-integration                          # Run integration tests
just test-eval                                 # Run accuracy evaluations
just hello                                     # Test running proxy
uv run pytest tests/test_custom_handler.py -v  # Run specific test

# Code quality
just format                                    # Format code with ruff
just lint                                      # Run linting (via make)
just clean                                     # Clean build artifacts

# Examples
just example-rhai-releases USER_ID            # Run RHAI releases example

# Utilities
just sync                                      # Sync dependencies with uv
just tokens                                    # List users and tokens
just clean-test-dbs                            # Clean up test databases

Testing the Proxy

# Start proxy locally
just proxy

# Or start containerized with dev overlay
just dev

# Make a test request
curl -X POST http://localhost:9501/v1/chat/completions \
  -H "Authorization: Bearer sk-agno-test-key-12345" \
  -H "Content-Type: application/json" \
  -H "X-OpenWebUI-User-Id: test-user" \
  -d '{
    "model": "agno/demo-agent",
    "messages": [{"role": "user", "content": "Hey"}]
  }'

Available Models

Agno Agents (powered by Gemini 2.5 Flash):

agno/release-manager - RHDH release management assistant (Google Drive, Jira integration)
agno/sprint-reviewer - Sprint review assistant (Google Drive, Jira integration)
agno/demo-agent - Example agent with color tools (educational reference)

Direct Gemini Models:

gemini-2.5-pro - Most capable model
gemini-2.5-flash - Fast and efficient (used by Agno agents)

List models from running proxy:

curl -X GET http://localhost:9501/v1/models \
  -H "Authorization: Bearer sk-agno-test-key-12345"

Note: All models require GEMINI_API_KEY in your .env.secrets file.

Adding New Agents

See docs/agents/creating-agents.md for a complete guide. AgentLLM now supports a plugin-based architecture with automatic agent discovery.

Quick Start (Plugin System - Recommended)

Create configurator in src/agentllm/agents/my_agent_configurator.py:

from agentllm.agents.base import AgentConfigurator, BaseToolkitConfig

class MyAgentConfigurator(AgentConfigurator):
    def _initialize_toolkit_configs(self) -> list[BaseToolkitConfig]:
        return []  # Add toolkit configs here

    def _build_agent_instructions(self) -> list[str]:
        return ["You are my agent.", "Your purpose is..."]

    def _get_agent_name(self) -> str:
        return "my-agent"

    def _get_agent_description(self) -> str:
        return "My agent description"

Create agent wrapper in src/agentllm/agents/my_agent.py:

from agentllm.agents.base import BaseAgentWrapper, AgentFactory
from .my_agent_configurator import MyAgentConfigurator

class MyAgent(BaseAgentWrapper):
    def _create_configurator(self, user_id, session_id, shared_db, **kwargs):
        return MyAgentConfigurator(
            user_id=user_id,
            session_id=session_id,
            shared_db=shared_db,
            **kwargs
        )

class MyAgentFactory(AgentFactory):
    @staticmethod
    def create_agent(shared_db, token_storage, user_id, session_id=None,
                    temperature=None, max_tokens=None, **kwargs):
        return MyAgent(shared_db=shared_db, user_id=user_id,
                      session_id=session_id, **kwargs)

    @staticmethod
    def get_metadata():
        return {
            "name": "my-agent",
            "description": "My agent description",
            "mode": "chat",
        }

Register in pyproject.toml:

[project.entry-points."agentllm.agents"]
my-agent = "agentllm.agents.my_agent:MyAgentFactory"

Add to proxy config in proxy_config.yaml:

- model_name: agno/my-agent
  litellm_params:
    model: agno/my-agent
    custom_llm_provider: agno

Restart proxy: just proxy or just dev-restart - Your agent will be auto-discovered!

Configuration

Environment Variables

Required:

GEMINI_API_KEY - Google Gemini API key (get here)
LITELLM_MASTER_KEY - Proxy API key (default: sk-agno-test-key-12345)

Optional:

OPENAI_API_BASE_URL - LiteLLM proxy URL for Open WebUI
- Containerized mode (just dev): http://litellm-proxy:8890/v1 (recommended)
- Local proxy mode (just proxy): http://host.docker.internal:9501/v1
- Production (Kubernetes): http://litellm-proxy-service:8890/v1
- Note: Internal container port is 8890, external host port is 9501
GDRIVE_CLIENT_ID, GDRIVE_CLIENT_SECRET - For Google Drive integration
RELEASE_MANAGER_WORKBOOK_GDRIVE_URL - Release Manager workbook URL (Google Sheets)

See .env.secrets.template for all available configuration options.

Proxy Configuration

Edit proxy_config.yaml to:

Add/remove models (Agno agents, Gemini, or other LLM providers)
Configure authentication
Adjust logging and server settings

Data Storage

AgentLLM stores all agent data in the tmp/ directory:

tmp/
├── agno_sessions.db        # Session history and conversation context
├── agno_credentials.db     # User credentials (OAuth tokens, API keys)
└── agno_handler.log        # Application logs

Key Points:

Data persists across container restarts (volume-mounted in compose.yaml)
Delete tmp/ to reset all agents and clear credentials
Session history is per-user and per-agent
Credentials are encrypted and stored securely per-user

Key Features

LiteLLM Custom Provider

The Agno provider extends litellm.CustomLLM and implements:

completion() - Synchronous completions with full agent execution
streaming() - Synchronous streaming (GenericStreamingChunk format)
acompletion() - Async completions using agent.arun()
astreaming() - True real-time streaming with agent.arun(stream=True) ⚡

Key Benefits:

No LiteLLM modifications required
Parameter pass-through (temperature, max_tokens → agent model)
Conversation context preserved in agent sessions
Per-user agent isolation

Plugin-Based Architecture

AgentLLM uses a plugin system with automatic agent discovery:

AgentFactory: Entry point registration via pyproject.toml
AgentConfigurator: Separates configuration management from execution
BaseAgentWrapper: Provides common execution interface
AgentRegistry: Automatically discovers and registers agents at runtime

Benefits:

Agents can be distributed as separate packages
No manual imports needed - automatic discovery
Clean separation between configuration and execution
Metadata system for agent capabilities

Streaming Support

LiteLLM's CustomLLM requires GenericStreamingChunk format:

{
    "text": "content here",           # Use "text", not "content"
    "finish_reason": "stop" or None,
    "index": 0,
    "is_finished": True or False,
    "tool_use": None,
    "usage": {...}
}

Async streaming provides true real-time token-by-token streaming using Agno's native async for with agent.arun(stream=True).

Project Structure

agentllm/
├── src/agentllm/
│   ├── custom_handler.py              # LiteLLM CustomLLM implementation
│   ├── proxy_config.yaml              # LiteLLM proxy configuration
│   ├── agents/
│   │   ├── base/                      # Plugin system base classes
│   │   │   ├── factory.py             #   AgentFactory ABC
│   │   │   ├── registry.py            #   AgentRegistry (auto-discovery)
│   │   │   ├── configurator.py        #   AgentConfigurator (config mgmt)
│   │   │   ├── wrapper.py             #   BaseAgentWrapper (execution)
│   │   │   └── toolkit_config.py      #   BaseToolkitConfig (re-export)
│   │   ├── release_manager.py         # Production agent wrapper
│   │   ├── release_manager_configurator.py  # Release manager config
│   │   ├── demo_agent.py              # Demo agent (reference impl)
│   │   ├── demo_agent_configurator.py # Demo agent config
│   │   ├── sprint_reviewer.py         # SprintReviewer wrapper class
│   │   ├── sprint_reviewer_configurator.py  # SprintReviewer config
│   │   └── toolkit_configs/
│   │       ├── gdrive_config.py       # Google Drive OAuth
│   │       ├── jira_config.py         # Jira API token
│   │       └── favorite_color_config.py  # Demo config
│   ├── tools/
│   │   ├── gdrive_toolkit.py          # Google Drive tools
│   │   ├── jira_toolkit.py            # Jira tools
│   │   └── color_toolkit.py           # Demo color tools
│   └── db/
│       └── token_storage.py           # SQLite credential storage
├── tests/
│   ├── test_custom_handler.py         # Provider tests
│   ├── test_release_manager.py        # ReleaseManager tests
│   ├── test_sprint_reviewer.py        # SprintReviewer tests
│   └── test_demo_agent.py             # Demo agent tests
├── docs/
│   ├── agents/
│   │   ├── creating-agents.md         # Complete agent creation guide
│   └── templates/                     # Documentation templates
├── justfile                           # Task automation (just command runner)
├── just/
│   ├── dev.just                       # Development commands
│   └── container.just                 # Container management
├── proxy_config.yaml                  # Proxy config (symlink to src/)
├── AGENTS.md                          # Architecture patterns & developer guide
└── CLAUDE.md                          # Reference to AGENTS.md

Documentation

Creating Agents - Complete guide to building custom agents with tools and configuration
AGENTS.md - Architecture patterns, plugin system, and developer guide for contributors
Release Manager - RHDH release management assistant

Troubleshooting

Tests Fail with "No module named 'agentllm'"

uv pip install -e .

Agent Fails to Initialize

Ensure GEMINI_API_KEY is set in .env.secrets. Get your key from Google AI Studio.

Proxy Won't Start

Check that port 9501 is available:

lsof -i :9501

Can't Access Open WebUI

Verify container is running: podman ps or just dev-status
Check port mapping: Should see 0.0.0.0:9500->8080/tcp
Try http://localhost:9500 (external port, not 8080)
Check container logs: just dev-logs or just dev-logs open-webui -f

Reset Agent Data or Clear Credentials

To reset all agent sessions and credentials:

# Stop containers first
just dev-stop

# Remove agent data
rm -rf tmp/

# Restart
just dev

This clears:

All conversation history
Stored OAuth tokens and API keys
Agent session state

Contributing

Write tests for new features (TDD workflow)
Run tests: just test
Format code: just format
Run linting: just lint
Update documentation

See all available commands with just or just --list.

License

GPL-3.0-only

Name		Name	Last commit message	Last commit date
Latest commit History 143 Commits
.claude		.claude
.vscode		.vscode
docs		docs
examples		examples
exports/release manager_sheets		exports/release manager_sheets
just		just
knowledge/demo-agent		knowledge/demo-agent
scripts		scripts
src/agentllm		src/agentllm
tests		tests
.containerignore		.containerignore
.env.secrets.template		.env.secrets.template
.env.shared		.env.shared
.gitignore		.gitignore
.gitleaks.toml		.gitleaks.toml
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
.sops.yaml		.sops.yaml
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
CRONJOB-QUICKSTART.md		CRONJOB-QUICKSTART.md
Containerfile		Containerfile
LICENSE		LICENSE
LOCAL-TESTING.md		LOCAL-TESTING.md
Makefile		Makefile
README.md		README.md
REFACTOR_PROMPTS_SHEET.md		REFACTOR_PROMPTS_SHEET.md
RHDH_Release_Manager_Reference.xlsx		RHDH_Release_Manager_Reference.xlsx
VERIFY_MARKDOWN_LOGGING.md		VERIFY_MARKDOWN_LOGGING.md
compose.dev.yaml		compose.dev.yaml
compose.yaml		compose.yaml
cronjob.yaml		cronjob.yaml
custom_handler.py		custom_handler.py
justfile		justfile
proxy_config.yaml		proxy_config.yaml
pyproject.toml		pyproject.toml
rm_scenarios.md		rm_scenarios.md
uv.lock		uv.lock

Folders and files

Latest commit

History

Repository files navigation

AgentLLM - Agno Provider for LiteLLM

Quick Start

Development Quick Start

Security: Token Encryption

Architecture

Development

Prerequisites

Development Modes

Common Commands

Testing the Proxy

Available Models

Adding New Agents

Quick Start (Plugin System - Recommended)

Configuration

Environment Variables

Proxy Configuration

Data Storage

Key Features

LiteLLM Custom Provider

Plugin-Based Architecture

Streaming Support

Project Structure

Documentation

Troubleshooting

Tests Fail with "No module named 'agentllm'"

Agent Fails to Initialize

Proxy Won't Start

Can't Access Open WebUI

Reset Agent Data or Clear Credentials

Contributing

License

References

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages