ConductorAPI

Your Central Hub for Local AI Intelligence.

The ConductorAPI is a powerful organization layer for your local AI API traffic. Think of it as the air traffic controller for your local AI runtimes (Ollama, LM Studio, llama.cpp, and more). It provides a single, unified OpenAI-compatible endpoint that intelligently routes your requests to the right model, spinning runtimes and models up and down automatically to ensure you never run out of VRAM.

Why you want this:

Run Multiple Automation Tasks: Execute coding agents, chat bots, and summarizers simultaneously without conflict. The orchestrator queues them and switches models instantly.
Intelligent Resource Management: Only one VRAM-heavy model runs at a time. The system automatically unloads idle models and loads the next one required.
Unified Front Door: Point all your apps to http://127.0.0.1:8000. No more juggling port numbers (11434, 1234, 8080...).
Route Aliases: Use stable names like route:coding or route:chat. If your primary model is down or overloaded, the system can automatically fall back to another model (local or cloud).
Plug-and-Play Extensibility: Add any OpenAI-compatible provider (Groq, weak-to-strong-generalization rigs, custom RAG APIs) just by dropping a YAML file in the providers/ folder.

Setup

Install Python 3.11+

Create Virtual Environment:

python -m venv venv
.\venv\Scripts\activate

Install Dependencies:
```
pip install -r requirements.txt
```

Running the Gateway

python conductorAPI.py

Run python conductorAPI.py to start the server on http://127.0.0.1:8000. The ConductorAPI includes a simple web-based user interface to monitor and manage your AI models and routes. To access the UI:

From the UI, you can:

View active routes and their statuses.
Monitor running models and their resource usage.
Configure settings dynamically without restarting the server.

Viewing the User Interface

The ConductorAPI includes a simple web-based user interface to monitor and manage your AI models and routes. To access the UI:

Start the server by running:
```
python conductorAPI.py
```
Open your browser and navigate to http://127.0.0.1:8000.

From the UI, you can:

View active routes and their statuses.
Monitor running models and their resource usage.
Configure settings dynamically without restarting the server.

Non-UI-Configuration

config.yaml: Main settings (port, logging, etc).
routes.yaml: Define route aliases (e.g., route:planner -> gemini-2.0-flash).
models.yaml: Override detailed scoring/priority for specific models.
providers/*.yaml: Define your backend providers.

Providers/runtimes & Extensibility to any API endopint

"Just a YAML file away."

The system is designed to be fully extensible. Each "provider" is defined by a config file in the providers/ directory. You can share these files like extensions.

Built-in Support:

Ollama: Auto-starts ollama serve.
llama.cpp: Auto-starts internal server.
LM Studio: Can auto-start via lms CLI (see providers/lmstudio.yaml).

Adding a Custom Provider (e.g., Hugging Face Transformers, KoboldCpp, any resource intensive API):

Create a file providers/my_custom_api.yaml:

provider_id: "my_rag_api"
provider_type: "openai_compat"
api:
  base_url: "http://localhost:9090"
  health:
    method: "GET"
    path: "/health"
    success_codes: [200]
start:
  enabled: true
  command: "python"
  args: ["run_my_rag.py"]

The Orchestrator will now manage this process and can route requests to it! Set it up like you would an OpenAI endpoint

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.gemini		.gemini
logs		logs
providers		providers
src		src
tests		tests
.gitignore		.gitignore
README.md		README.md
conductorAPI.py		conductorAPI.py
config.yaml		config.yaml
instructions.md		instructions.md
models.yaml		models.yaml
requirements.txt		requirements.txt
routes.yaml		routes.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ConductorAPI

Setup

Running the Gateway

Viewing the User Interface

Non-UI-Configuration

Providers/runtimes & Extensibility to any API endopint

Built-in Support:

Adding a Custom Provider (e.g., Hugging Face Transformers, KoboldCpp, any resource intensive API):

About

Uh oh!

Releases

Packages

Uh oh!

Languages

fork-archive-hub/ConductorAPI-router

Folders and files

Latest commit

History

Repository files navigation

ConductorAPI

Setup

Running the Gateway

Viewing the User Interface

Non-UI-Configuration

Providers/runtimes & Extensibility to any API endopint

Built-in Support:

Adding a Custom Provider (e.g., Hugging Face Transformers, KoboldCpp, any resource intensive API):

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages