Super Agent Backend

Multi-model AI agent with semantic routing — automatically sends each request to the cheapest adequate model.

Model	Used for
Gemini Flash	Classification, extraction, translation, short Q&A
DeepSeek Chat	Coding, debugging, math, structured reasoning
Claude Sonnet	Writing, summarization, email drafting, nuanced tasks

Quick Start (local, no Docker)

# 1. Clone and enter the project
cd super-agent

# 2. Create virtual environment
python -m venv .venv
source .venv/bin/activate      # macOS/Linux
# .venv\Scripts\activate       # Windows

# 3. Install dependencies
pip install -r requirements.txt

# 4. Set your API keys
cp .env.example .env
# Edit .env and fill in ANTHROPIC_API_KEY, GEMINI_API_KEY, DEEPSEEK_API_KEY

# 5. Run the server
uvicorn app.main:app --reload --port 8000

Open http://localhost:8000/docs for the interactive Swagger UI.

Quick Start (Docker)

cp .env.example .env   # fill in your keys
docker compose up --build

API Endpoints

`POST /chat`

Auto-routes to the best model via semantic classifier.

curl -X POST http://localhost:8000/chat \
  -H "Content-Type: application/json" \
  -d '{"message": "Draft a follow-up email after a sales call.", "session_id": "user_123"}'

Response:

{
  "response": "...",
  "model_used": "CLAUDE",
  "routed_by": "classifier",
  "session_id": "user_123"
}

`POST /chat/direct`

Force a specific model (skip classifier).

curl -X POST http://localhost:8000/chat/direct \
  -H "Content-Type: application/json" \
  -d '{"message": "Write a Python quicksort", "model": "DEEPSEEK", "session_id": "dev"}'

`GET /history/{session_id}`

Retrieve conversation history for a session.

`DELETE /history/{session_id}`

Clear conversation history for a session.

`GET /health`

Liveness check.

Running Tests

pytest tests/ -v

All tests are mocked — no API keys required to run them.

Project Structure

super-agent/
├── app/
│   ├── main.py              # FastAPI app + endpoints
│   ├── config.py            # Pydantic settings (reads .env)
│   ├── prompts.py           # System prompts + routing prompt
│   ├── models/
│   │   ├── claude.py        # Anthropic SDK wrapper
│   │   ├── gemini.py        # Google GenAI SDK wrapper
│   │   └── deepseek.py      # OpenAI-compat wrapper → DeepSeek
│   ├── routing/
│   │   ├── classifier.py    # Semantic router (Gemini Flash)
│   │   └── dispatcher.py    # Routes message to correct model
│   ├── tools/
│   │   └── base_tools.py    # LangChain @tool wrappers
│   └── memory/
│       └── session.py       # SQLite-backed session memory
├── tests/
│   ├── test_models.py
│   ├── test_routing.py
│   └── test_api.py
├── Dockerfile
├── docker-compose.yml
├── requirements.txt
└── .env.example

Cost-Routing Policy

cheap → Gemini Flash    (classification, extraction, short tasks)
mid   → DeepSeek Chat   (reasoning, code)
top   → Claude Sonnet   (writing, polish, nuanced output)

The classifier itself runs on Gemini Flash to minimise cost.

Deploying to Railway

Push this repo to GitHub
Go to railway.app → New Project → Deploy from GitHub
Add environment variables from .env.example in the Railway dashboard
Railway auto-detects the Dockerfile and deploys

Connecting to n8n / WhatsApp

Point your n8n HTTP Request node at POST /chat:

{
  "message": "{{ $json.body.data.message.conversation }}",
  "session_id": "{{ $json.body.data.key.remoteJid }}"
}

This integrates directly with the Evolution API → n8n WhatsApp workflow.

Phase Roadmap

Phase	What
✅ 1	FastAPI backend + 3-model routing
🔜 2	LangChain tool use (Gmail draft, Sheets)
🔜 3	LangGraph (approval nodes, retries, state)
🔜 4	Alexa Custom Skill voice interface
🔜 5	Android App Actions voice interface

Name		Name	Last commit message	Last commit date
Latest commit History 68 Commits
.devcontainer		.devcontainer
.github/workflows		.github/workflows
app		app
mobile		mobile
n8n		n8n
static		static
tests		tests
website/code_sandbox_light_8d343f77_1775128831		website/code_sandbox_light_8d343f77_1775128831
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
Dockerfile		Dockerfile
Dockerfile.website		Dockerfile.website
INSTALL_GUIDE.md		INSTALL_GUIDE.md
README.md		README.md
docker-compose.yml		docker-compose.yml
entrypoint.sh		entrypoint.sh
nginx-website.conf		nginx-website.conf
nginx.conf.template		nginx.conf.template
remove_bg.py		remove_bg.py
requirements.txt		requirements.txt
supervisord.conf		supervisord.conf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Super Agent Backend

Quick Start (local, no Docker)

Quick Start (Docker)

API Endpoints

`POST /chat`

`POST /chat/direct`

`GET /history/{session_id}`

`DELETE /history/{session_id}`

`GET /health`

Running Tests

Project Structure

Cost-Routing Policy

Deploying to Railway

Connecting to n8n / WhatsApp

Phase Roadmap

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Super Agent Backend

Quick Start (local, no Docker)

Quick Start (Docker)

API Endpoints

POST /chat

POST /chat/direct

GET /history/{session_id}

DELETE /history/{session_id}

GET /health

Running Tests

Project Structure

Cost-Routing Policy

Deploying to Railway

Connecting to n8n / WhatsApp

Phase Roadmap

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`POST /chat`

`POST /chat/direct`

`GET /history/{session_id}`

`DELETE /history/{session_id}`

`GET /health`

Packages