Stop AI data leaks and control spend with a drop-in proxy that sits between your app and LLM providers.
PII redaction (28+ entities + OCR) · Prompt injection blocking · Budget enforcement · Full governance dashboard.
Website · Docs · API Reference · Product Tour · Model Catalog
Your App ──▶ AI ModelGate ──▶ LLM Provider
(AI Firewall)
├─ PII Scan (28+ entities + OCR)
├─ Prompt Injection Detection
├─ Budget & Token Enforcement
└─ Smart Cost Routing
ModelGate is an OpenAI-compatible proxy. If your app uses the OpenAI SDK, integration is a two-line change — swap the baseURL and apiKey:
import OpenAI from "openai";
const client = new OpenAI({
apiKey: "os_hub_your_key_here", // ← Hub API key
baseURL: "https://api.aimodelgate.ai/v1", // ← Hub endpoint
});
const res = await client.chat.completions.create({
model: "oah/llama-3.3-70b", // smart-routed virtual model
messages: [{ role: "user", content: "Explain zero-trust architecture." }],
});
console.log(res.choices[0].message.content);from openai import OpenAI
client = OpenAI(
api_key="os_hub_your_key_here", # ← Hub API key
base_url="https://api.aimodelgate.ai/v1", # ← Hub endpoint
)
res = client.chat.completions.create(
model="oah/llama-3.3-70b",
messages=[{"role": "user", "content": "Explain zero-trust architecture."}],
)
print(res.choices[0].message.content)package main
import (
"context"
"fmt"
"github.com/openai/openai-go"
"github.com/openai/openai-go/option"
)
func main() {
client := openai.NewClient(
option.WithAPIKey("os_hub_your_key_here"), // ← Hub API key
option.WithBaseURL("https://api.aimodelgate.ai/v1"), // ← Hub endpoint
)
completion, _ := client.Chat.Completions.New(context.TODO(), openai.ChatCompletionNewParams{
Model: "oah/llama-3.3-70b",
Messages: []openai.ChatCompletionMessageParamUnion{
openai.UserMessage("Explain zero-trust architecture."),
},
})
fmt.Println(completion.Choices[0].Message.Content)
}curl -X POST https://api.aimodelgate.ai/v1/chat/completions \
-H "Authorization: Bearer os_hub_your_key_here" \
-H "Content-Type: application/json" \
-d '{
"model": "oah/llama-3.3-70b",
"messages": [{"role": "user", "content": "Explain zero-trust architecture."}],
"max_tokens": 512
}'Model names: Use
oah/prefixed virtual models for smart cost-routing, or explicit provider models likegroq/llama-3.3-70b-versatile. See the full model catalog.
| Layer | What It Does |
|---|---|
| PII Redaction | Detects and redacts 28+ entity types (SSN, credit cards, emails, phone numbers, API keys, etc.) + OCR for images — before the LLM ever sees your data |
| Prompt Injection | Blocks adversarial prompt injection patterns in real-time |
| Budget Enforcement | Per-project credit quotas with hard limits — no more surprise AI bills |
| Token Limits | Configurable max tokens per request (up to 32K) to prevent runaway costs |
| Stateless Architecture | We never store prompt content or AI responses. Only metadata is logged. AES-256 encryption for credentials |
Route to 100+ models across 9 providers with one API key:
| Provider | Example Models |
|---|---|
| OpenAI | GPT-4.1, GPT-4.1 Mini, o4-mini |
| Anthropic | Claude Sonnet 4.6, Opus 4.6, Haiku 4.5 |
| Google Gemini | Gemini 2.5 Pro, Gemini 2.5 Flash |
| Groq | Llama 4 Scout, Llama 3.3, DeepSeek |
| xAI | Grok 3, Grok 3 Mini |
| Together.ai | Llama 4, DeepSeek R1, Qwen 3 |
| Mistral AI | Mistral Large, Small, Codestral |
| AWS Bedrock | Claude, Llama, Mistral via Bedrock |
| DeepInfra | DeepSeek V3, Llama 4, Qwen 3 |
| Repository | Description |
|---|---|
| quickstart-ts | TypeScript / Node.js quickstart — clone, npm install, run |
| quickstart-python | Python quickstart — clone, pip install, run |
| quickstart-go | Go quickstart — clone, go run, done |
| postman | Postman collection + environment for the full API |
| Plan | What You Get |
|---|---|
| Free | 1,000,000 ModelGate Credits on signup. Full DLP protection. All features. |
| Managed Credits | Pay-as-you-go. Add credits to your wallet. We handle provider accounts. |
| Pro (BYOK) | $29/month. Bring your own provider API keys. Zero markup on LLM costs. |
No credit card required to start. Sign up and get 1M credits instantly.
Get Started — 1M Free Credits →
Security · Privacy Policy · Terms of Service · Contact
Built by Datum Fuse LLC · Stateless Architecture · No Data Retention · AES-256 Encryption