AI Horde OpenAI API Interposer

An OpenAI-compatible API layer for AI Horde that enables opencode and any OpenAI-compatible client to use AI Horde's distributed GPU network for text generation.

Overview

AI Horde is a crowdsourced distributed cluster of text and image generation workers. This interposer translates OpenAI chat completion requests to AI Horde's native async format and handles the polling workflow.

┌─────────────┐     ┌──────────────────┐     ┌─────────────┐
│  opencode   │───▶│  Interposer      │───▶│  AI Horde   │
│  or any     │     │  (this project)  │     │  API        │
│  OpenAI SDK │◀───│                  │◀───│             │
└─────────────┘     └──────────────────┘     └─────────────┘
                          │
                          ▼
                  ┌───────────────┐
                  │ Model Registry│
                  │ (from workers)│
                  └───────────────┘

Features

OpenAI-compatible endpoints: /v1/chat/completions and /v1/models
Automatic request translation: Converts OpenAI format to AI Horde format
Async polling: Handles submit/poll/retrieve workflow automatically
Model discovery: Fetches capabilities from /v2/workers?type=text
Instruct format support: ChatML, Mistral, and Alpaca prompt formats
OpenCode integration: Auto-updating opencode.json with available models

Installation

pip install -e .

Quick Start

1. Start the Server

SET AI_HORDE_API_KEY=your_api_key(defaults to low priority queue if left empty)
uvicorn horde_openai.server:app --host 0.0.0.0 --port 8080

You WILL randomly hit 403 errors if this is not specified, it's not a bug.

2. Make a Request

curl -X POST http://localhost:8080/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "awsome_engine/splendid_model",
    "messages": [{"role": "user", "content": "Hello! How are you?"}],
    "max_tokens": 50
  }'

3. List Models

curl http://localhost:8080/v1/models

OpenCode Integration

Generate opencode.json

python update_opencode_models.py --once

This creates an opencode.json with:

All available AI Horde text models
Proper OpenCode provider format
Model-specific context/output limits
Default model set to most available

Keep Models Updated

# Run continuously with 5-minute refresh (default)
python update_opencode_models.py

# Custom refresh interval
python update_opencode_models.py --interval 600

Project Structure

HordeStreaming/
├── src/horde_openai/
│   ├── __init__.py       # Package exports
│   ├── client.py         # AI Horde HTTP client with async polling
│   ├── models.py         # Model registry from /v2/workers
│   ├── translate.py      # Request/response translation
│   └── server.py         # FastAPI server with OpenAI endpoints
├── tests/
│   └── test_interposer.py # 26 unit tests
├── docs/
│   └── INTERPOSER_SPEC.md # API specification
├── opencode.json         # OpenCode provider config (auto-generated)
├── pyproject.toml        # Package configuration
└── update_opencode_models.py  # Model updater script

API Reference

Chat Completions

POST /v1/chat/completions

{
  "model": "koboldcpp/Fimbulvetr-11B-v2",
  "messages": [
    {"role": "system", "content": "You are helpful."},
    {"role": "user", "content": "Tell me a joke."}
  ],
  "temperature": 0.7,
  "max_tokens": 100
}

List Models

GET /v1/models

Returns all available text generation models with their capabilities.

How It Works

Request received at /v1/chat/completions
Translate OpenAI messages to AI Horde prompt format
Submit to /api/v2/generate/text/async → get job ID
Poll /api/v2/generate/text/status/{id} until done
Translate response back to OpenAI format
Return completion response

Testing

pytest tests/ -v

Environment Variables

Variable	Description	Default
`AI_HORDE_API_KEY`	AI Horde API key	`0000000000` (anonymous)

Even though you can use the anon apikey, still, its suggested for you to obtain your key (and contribute back to the horde).

Limitations

No true streaming: AI Horde's async API doesn't support real-time streaming
Latency: 2-30 seconds depending on queue
Token limit: Maximum 4096 tokens per generation
Availability: Depends on volunteer workers

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
docs		docs
src/horde_openai		src/horde_openai
tests		tests
LICENSE		LICENSE
README.md		README.md
example_usage.py		example_usage.py
pyproject.toml		pyproject.toml
update_opencode_models.py		update_opencode_models.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Horde OpenAI API Interposer

Overview

Features

Installation

Quick Start

1. Start the Server

2. Make a Request

3. List Models

OpenCode Integration

Generate opencode.json

Keep Models Updated

Project Structure

API Reference

Chat Completions

List Models

How It Works

Testing

Environment Variables

Limitations

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

TriDefender/OpenCorde

Folders and files

Latest commit

History

Repository files navigation

AI Horde OpenAI API Interposer

Overview

Features

Installation

Quick Start

1. Start the Server

2. Make a Request

3. List Models

OpenCode Integration

Generate opencode.json

Keep Models Updated

Project Structure

API Reference

Chat Completions

List Models

How It Works

Testing

Environment Variables

Limitations

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages