Skip to content

AlperNab/model-router

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

36 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

🧠 Model Router

Infrastructure UX for provider routing, tools, cost, billing, artifacts, and platform-level operations.

Python FastAPI SQLite Local LLM Cloud LLM No Fake Data

Domain: AI Platform / Routing Β· Suite: AI Platform Core Β· Accent: #818cf8

πŸš€ Quick Start Β· ✨ Features Β· πŸŽ›οΈ Customization Β· 🧠 LLM Providers Β· πŸ§ͺ Testing Β· 🧯 Troubleshooting


🧭 What This Project Does

Model Router is a standalone, browser-based AI workflow app for AI Platform / Routing. It turns structured inputs, uploaded files, and project-specific settings into reviewable outputs using a deterministic local engine plus optional local/cloud LLM enhancement.

Core job: Request β†’ best model/provider.

Designed for: Domain operator, business owner, analyst, or team member who needs this workflow executed reliably.

Why use it:

  • 🧩 Standalone project folder: run this project by itself without depending on a central dashboard.
  • πŸ–₯️ Elegant GUI: includes project-specific panels, structured forms, upload handling, output preview, and exports.
  • 🧠 Model-flexible: choose local models for privacy or cloud models for stronger reasoning.
  • 🧾 Auditable: every run is stored in SQLite with inputs, settings, result, and export history.
  • 🚫 No fake live data: external systems are only used when real API keys/connectors are configured.
  • πŸ›‘οΈ Human review gates: sensitive legal, medical, hiring, finance, or security outputs are flagged for review.

✨ Features

  • fallback routing
  • cost/latency/quality scoring
  • local/cloud policy
  • retry logic
  • prompt classification
  • provider health checks
  • audit logs

🧱 Built-In Platform Capabilities

  • ⚑ FastAPI backend with documented JSON endpoints.
  • 🎨 Responsive web UI with dark, polished SaaS-style layout.
  • πŸ“ File upload and text extraction for common document/code formats.
  • πŸ—‚οΈ Job history saved locally in data/*.sqlite3.
  • πŸ” Encrypted provider settings for API keys and local endpoints.
  • πŸ“€ Exports to Markdown, JSON, DOCX, and PDF when dependencies are available.
  • πŸ”Œ Provider routing for local and cloud LLMs.
  • πŸ§ͺ Local test file to verify the project runs.

🎨 UX/UI Design

UX profile: AI Platform Control Plane

Workflow layout: Provider/tool intake β†’ routing/security β†’ execution β†’ audit/export

Empty state: Configure local/cloud providers or paste a platform workflow. No provider keys are required for Rule Engine mode.

Main UI Components

  • Provider routing matrix
  • Tool registry
  • Cost/billing controls
  • Security settings
  • Audit trail

Review / Workflow Lanes

  • Configure
  • Route
  • Execute
  • Audit
  • Optimize

Metrics Shown in the Interface

  • Provider readiness
  • Cost control
  • Security posture
  • Execution reliability

Quick Actions

  • Configure providers
  • Review routing rules
  • Check security controls
  • Export audit package

🧩 Project Inputs

These are the main fields exposed by the GUI and /api/run. Required fields are enforced before execution.

Field Type Required Default Purpose
request
Request
text Yes β€” Affects input: Request.
work_brief
Work brief / source text / URL / instructions
textarea Yes β€” Paste the material, URL, description, or instruction needed for this project.

πŸŽ›οΈ Customization

This project is not a generic prompt box. The customization controls are connected to workflow behavior, validation, output shape, and export format.

Field Type Required Default Purpose
execution_mode
Execution mode
select No Production Controls strictness, depth, and output format for this project workflow.
provider_keys
provider keys
text No β€” Affects customization: provider keys.
default_models
default models
select No β€” Affects customization: default models.
privacy_mode
privacy mode
select No cloud allowed Affects customization: privacy mode.
cost_cap
cost cap
text No β€” Affects customization: cost cap.
latency_target
latency target
text No β€” Affects customization: latency target.
fallback_chain
fallback chain
text No β€” Affects customization: fallback chain.
task_routing_rules
task routing rules
textarea No β€” Affects customization: task routing rules.
output_format
output format
select No Markdown Affects customization: output format.
language
language
select No English Affects customization: language.
confidence_threshold
Confidence threshold
slider No 75 Items below this confidence are escalated to the human review queue.

Select / Option Controls

  • Execution mode: Draft, Production, Audit / strict review, JSON/API output
  • privacy mode: cloud allowed, local only, redact sensitive data
  • output format: Markdown, JSON, CSV, PDF, DOCX, XLSX
  • language: English, Arabic, Egyptian Arabic, French, German, Spanish

🧠 LLM Providers

You can run the project with the local deterministic engine, or enhance the output with a configured LLM provider.

Supported Provider Types

Provider Type Examples Best For
Local OpenAI-compatible Ollama, LM Studio, vLLM Private files, offline/local workflows, cost control
Cloud OpenAI-compatible OpenAI, OpenRouter, custom gateway General high-quality generation and structured output
Anthropic Claude models Long-context reasoning and document-heavy workflows
Google Gemini Gemini models Multimodal or Google ecosystem workflows
Mistral Mistral API Fast European cloud models
Azure OpenAI Azure deployments Enterprise-controlled cloud deployment
AWS Bedrock Bedrock-hosted models AWS enterprise environments

Recommended Model Usage

Use Case Recommendation
Drafting fast cloud or local instruct model
Reasoning strong reasoning model
Private documents local model via Ollama/LM Studio/vLLM
Vision/PDF pages vision-capable model when image pages are used

πŸš€ Quick Start

1) Clone or open this folder

cd model-router

2) Run on macOS / Linux / WSL

chmod +x run_gui.sh
./run_gui.sh

3) Run on Windows PowerShell

Set-ExecutionPolicy -Scope Process -ExecutionPolicy Bypass
.\run_gui_windows.ps1

4) Open the GUI

http://127.0.0.1:9137

πŸ› οΈ Manual Installation

Use this when you want full control instead of the run scripts.

cd model-router
python -m venv .venv
source .venv/bin/activate      # Windows: .venv\Scripts\activate
python -m pip install --upgrade pip
pip install -r requirements.txt
cp .env.example .env           # Windows: copy .env.example .env
uvicorn app.main:app --host 127.0.0.1 --port 9137

πŸ” Environment Variables

The project can be configured through the GUI settings screen or .env/environment variables.

Variable Purpose
AI_SUITE_HOST Host to bind the local app, usually 127.0.0.1.
AI_SUITE_PORT Port for this project GUI, default 9137.
AI_SUITE_DB SQLite database path for job history.
AI_SUITE_SECRET_KEY Secret used for local encryption/signing. Set this in production.
OPENAI_API_KEY Enables OpenAI-compatible cloud calls.
ANTHROPIC_API_KEY Enables Anthropic/Claude calls.
GEMINI_API_KEY Enables Google Gemini calls.
OPENROUTER_API_KEY Enables OpenRouter model routing.
MISTRAL_API_KEY Enables Mistral cloud models.
AZURE_OPENAI_ENDPOINT Azure OpenAI endpoint URL.
AZURE_OPENAI_API_KEY Azure OpenAI key.
AZURE_OPENAI_DEPLOYMENT Azure deployment name.
OLLAMA_BASE_URL Local Ollama OpenAI-compatible base URL.
LMSTUDIO_BASE_URL Local LM Studio OpenAI-compatible base URL.
VLLM_BASE_URL Local vLLM OpenAI-compatible base URL.

πŸ–₯️ How to Use the GUI

  1. Open the local URL.
  2. Review the project purpose and workflow lanes.
  3. Fill the required input fields.
  4. Adjust only the project-related customization controls.
  5. Upload source files when needed.
  6. Choose Rule Engine for local deterministic output or select a configured LLM provider.
  7. Run the workflow.
  8. Review warnings, scorecards, and output sections.
  9. Export the result as Markdown, JSON, DOCX, or PDF.

πŸ”„ Workflow

  • Request
  • best model/provider

Analysis Modules

  • routing_decision
  • budget_guard
  • privacy_gate

Output Sections

  • Model choice
  • Fallback chain
  • Cost/risk estimate
  • Routing rules

Scorecards

  • Input completeness
  • Domain fit
  • Risk level
  • Automation readiness
  • Export readiness

πŸ“€ Outputs & Exports

  • routing API
  • usage logs
  • provider config
  • failover reports

The export system is designed for reviewable deliverables. For regulated or business-critical work, export drafts should be reviewed before sending to clients, customers, patients, employees, authorities, or production systems.


πŸ”Œ Real Integrations & Connector Policy

Configured integrations in this standalone folder:

  • File upload
  • REST API
  • Export download
  • Job history
  • Provider health endpoints optional

Real Connector Requirements

  • Real source files or text input
  • Provider API key or local LLM endpoint for LLM-enhanced output
  • Human review for business-critical decisions

Important: this project does not simulate live data. If a workflow needs live Shopify, ATS, ERP, tax, customs, medical, security, market, map, analytics, or repository data, it must be connected with valid credentials and real API access. Missing connectors should produce clear setup errors rather than invented results.


🧯 Guardrails

  • Show uncertainty and confidence
  • Cite evidence from input when possible
  • Human review required for legal, medical, financial, hiring, or security decisions
  • Do not invent facts absent from input

Recommended operating rules:

  • βœ… Use local models for private or sensitive files.
  • βœ… Keep API keys out of Git.
  • βœ… Review low-confidence or high-impact outputs manually.
  • βœ… Keep source files and exported deliverables organized under data/.
  • ❌ Do not treat AI output as legal, medical, tax, hiring, trading, or security authority without expert review.

πŸ§ͺ Testing

Run the local smoke test:

python tests/test_single_project.py

Run a health check after starting the server:

curl http://127.0.0.1:9137/api/health

Expected result: the API returns ok: true and identifies this project.


🧬 API Usage

Method Endpoint Use
GET / Opens the browser GUI.
GET /api/health Health check for deployment and uptime monitoring.
GET /api/projects Returns the local project configuration.
GET /api/projects/{slug} Returns the project plugin metadata.
GET /api/providers Lists configured providers and local/cloud options.
POST /api/providers Saves provider settings/API keys.
POST /api/upload Uploads source files for extraction or context.
POST /api/run Runs the project workflow.
GET /api/jobs Lists previous runs and job history.
GET /api/jobs/{job_id} Reads one completed job.
GET /api/jobs/{job_id}/export/{fmt} Exports a job as md, json, docx, or pdf.
GET /api/project-local-status Verifies local project registration and implementation status.

Minimal Run Request

curl -X POST http://127.0.0.1:9137/api/run \
  -H "Content-Type: application/json" \
  -d '{
    "inputs": {
      "work_brief": "Paste the source material or task details here"
    },
    "customization": {
      "execution_mode": "Production"
    },
    "provider": "rule_engine"
  }'

πŸ“ Folder Structure

model-router/
β”œβ”€ app/                         # FastAPI backend, schemas, DB, providers, exports
β”œβ”€ static/                      # Browser GUI assets
β”œβ”€ plugins/                     # Project plugin JSON metadata
β”œβ”€ data/                        # SQLite DB, uploads, exports
β”œβ”€ tests/                       # Smoke tests
β”œβ”€ project_config.json          # Project-specific inputs, controls, UX, workflow
β”œβ”€ PROJECT_IMPLEMENTATION.md    # Implementation details and domain notes
β”œβ”€ requirements.txt             # Python dependencies
β”œβ”€ run_gui.sh                   # macOS/Linux/WSL launcher
β”œβ”€ run_gui_windows.ps1          # Windows PowerShell launcher
└─ README.md                    # This file

🚒 Deployment Notes

For local/private deployment, run with uvicorn behind a reverse proxy if needed. For production:

  • Set AI_SUITE_SECRET_KEY.
  • Use HTTPS.
  • Store provider keys in environment variables or a proper secret manager.
  • Restrict upload sizes and allowed file types.
  • Back up the SQLite database or move job storage to a managed database.
  • Add authentication before exposing beyond localhost.
  • Enable logging and monitoring.

Example production-style command:

AI_SUITE_HOST=0.0.0.0 AI_SUITE_PORT=9137 uvicorn app.main:app --host 0.0.0.0 --port 9137

🧯 Troubleshooting

Problem Fix
python not found Install Python 3.10+ and ensure it is on PATH.
PowerShell blocks the script Run Set-ExecutionPolicy -Scope Process -ExecutionPolicy Bypass.
Port already in use Set another port: AI_SUITE_PORT=9200 ./run_gui.sh.
Provider fails Verify API key, base URL, selected model, and account quota.
Local model fails Start Ollama/LM Studio/vLLM before running the workflow.
PDF/DOCX export fails Reinstall requirements and confirm optional export dependencies installed.
Upload extraction is incomplete Use cleaner source files or paste the important text into work_brief.

🧭 Extension Points

You can extend this project by editing:

  • project_config.json for inputs, settings, output sections, UX metadata, and workflow labels.
  • plugins/model-router.json for plugin metadata.
  • app/domain_engine.py for deterministic business logic.
  • app/llm_gateway.py for provider integrations.
  • static/app.js and static/styles.css for GUI behavior and component design.
  • tests/test_single_project.py for stronger project-specific tests.

βœ… Final Implementation Status

Area Status
Standalone folder GUI βœ… Implemented
FastAPI backend βœ… Implemented
Project-specific config βœ… Implemented
Local deterministic workflow βœ… Implemented
Local/cloud LLM routing βœ… Implemented
Uploads and exports βœ… Implemented
Job history βœ… Implemented
Real external connectors ⚠️ Requires valid credentials/API setup
Fake/simulated live data ❌ Not allowed

πŸ“œ License

Use the license included in this folder. If no explicit license is present, treat the code as private until you choose one.

About

Request to best model/provider

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors