guardrail

Runtime safety proxy for AI apps.

guardrail is a drop-in proxy that protects your AI features from prompt injection, jailbreaks, and unsafe outputs — with sub-50ms overhead and no vendor lock-in.

from guardrail import GuardrailProxy
import openai

client = GuardrailProxy(
    openai.OpenAI(api_key="..."),
    policy="guardrail.yaml"
)

# Input is checked before reaching the model
# Output is filtered before reaching your app
response = client.chat.completions.create(
    model="gpt-4o",
    messages=[{"role": "user", "content": user_message}]
)

Status

🚧 Early development. Star to follow progress.

What it does

Input protection — prompt injection, jailbreak, instruction override detection
Output filtering — PII leakage, regulated content (medical/legal/financial), harmful content, brand safety
Multi-provider — OpenAI, Anthropic, Google, Mistral, local models
Policy-as-code — define rules in YAML, update at runtime without redeploy
Audit log — every flagged call logged with reason code and severity
Sub-50ms — small classifier models, not LLM-as-judge on the critical path

Roadmap

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.github/workflows		.github/workflows
packages		packages
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
strategy.md		strategy.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

guardrail

Status

What it does

Roadmap

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

guardrail

Status

What it does

Roadmap

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages