One toolkit, three open products + the official cloud client, one free API key.
Eval your AI · Monitor it in production · Govern what ships · or call the hosted TrustModelClient.
pip install trustmodel
trustmodel login # free account → API key + 5 credits ($500). No credit card.
trustmodel eval "Take 500mg of metformin twice daily."Already have the TrustModel SDK installed (v2.x)?
eval/monitor/governand the MCP server arrived in v3.0.0 — a plainpip installis a no-op for you. Upgrade explicitly:pip install -U trustmodel # the three commands: evaluate / monitor / govern pip install -U "trustmodel[mcp]" # …plus the embeddable MCP serverYour existing
TrustModelClientcode keeps working unchanged — full details in the v2 → v3 upgrade guide.
trustmodel: command not found?pip install --userdrops the CLI in a per-userbin/that may not be on yourPATH(e.g.~/.local/binon Linux,~/Library/Python/3.x/binon macOS). Two fixes:# A) run it as a module — always works, no PATH changes needed python -m trustmodel login # B) add pip's user bin to PATH (find it with: python -m site --user-base) export PATH="$(python -m site --user-base)/bin:$PATH" # add to ~/.zshrc or ~/.bashrcInstalling into a virtualenv (
python -m venv .venv && source .venv/bin/activate) avoids this entirely — thetrustmodelscript lands on yourPATHautomatically.
🔴 TrustScore: 41/100 (Grade D) [local]
safety ██········ 18 ⚠
accuracy ██████···· 55
explainability █████····· 47 ⚠
privacy █████████· 90
… 6 more
Flagged:
• [high] safety: appears to give unverified medical/dosage advice
I'm the founder of TrustModel. I built this because "is this AI safe to ship?" shouldn't require a sales call to answer. Install it, read the code, and score your own AI across the same 10 dimensions our enterprise customers use. Your first 5 credits ($500) are on me — create a free account and you can run all three products today. — @karlmehta
Every product needs a free TrustModel API key. Creating a developer account takes ~30 seconds, needs no credit card, and grants 5 credits ($500) to spend across Eval, Monitor, and Govern.
# 1. Sign up (free, 5 credits / $500): https://trustmodel.ai/signup
# 2. Save your key:
trustmodel login
# or: export TRUSTMODEL_API_KEY=tm-...Calibrated cloud scoring spends credits (your first scan per model is free). Local scoring with your own OpenAI/Anthropic key is unmetered — the account just keeps your usage and dashboard in sync.
Score any AI output across 10 trust dimensions and roll it into a 0–100 TrustScore.
from trustmodel import evaluate
result = evaluate("Based on your resume you're not a culture fit. We can't say why.")
print(result.trust_score) # 38.0
print(result.grade) # "F"
print(result.dimensions) # {"explainability": 0.25, "fairness": 0.25, ...}
for v in result.violations:
print(v.severity, v.dimension, v.detail)trustmodel eval ./agent_outputs.jsonl --json # batch / CI-friendly
trustmodel eval "..." --cloud # calibrated cloud score (uses credits)Local scoring uses your own LLM as the judge (OpenAI or Anthropic), at temperature 0, on a 5-point ordinal scale per dimension — so it's reproducible and auditable. No LLM key? It falls back to a transparent heuristic judge so it always runs.
pip install "trustmodel[openai]" # or [anthropic]
export OPENAI_API_KEY=sk-...Continuously score your AI in production. Wrap a function or auto-instrument your LLM client.
from trustmodel import monitor
@monitor(threshold=80) # alert when a response scores below 80
def answer(question: str) -> str:
return my_llm(question)
answer("How do I treat a fever?")
print(answer.monitor.stats()) # {"count": 1, "avg_trust_score": 72.0, "below_threshold": 1}One-line auto-instrumentation + optional OpenTelemetry export:
from trustmodel import auto_init
auto_init(otel=True) # local inline scoring + OTEL spans
auto_init(api_key="tm-...") # also forward traces to your cloud dashboard
import openai
openai.chat.completions.create(...) # every call now scored automaticallyEnforce policy before AI output reaches a user or another tool. Open-source policy packs map to real regulations.
from trustmodel import Guardrail
gr = Guardrail("eu-ai-act")
verdict = gr.check("Based on your resume you're not a culture fit. We can't say why.")
print(verdict.allowed) # False
print(verdict.violations) # [art13-explainability (high), ...]Gate an agent so blocked output never escapes:
from trustmodel import govern
@govern(policy="owasp-llm", on_block="redact")
def agent(prompt: str) -> str:
return my_agent(prompt)trustmodel policies # eu-ai-act, nist-ai-rmf, owasp-llm, nyc-ll144
trustmodel govern "..." --policy nyc-ll144Policy packs are plain YAML — contribute one for your jurisdiction (LGPD, AIDA, …).
The same pip install trustmodel also ships the official TrustModel cloud client for teams
on the hosted platform — calibrated TrustScores, agentic & RAG evaluation, COTS/Galileo
connectors, lending & HR bias verticals, batch jobs, and managed compliance frameworks.
from trustmodel import TrustModelClient
client = TrustModelClient(api_key="tm-...")
result = client.evaluations.create(model="gpt-4o", prompt="...", response="...")
print(result.trust_score)
client.frameworks.list(domain="fair_lending") # discover compliance frameworks
client.agentic.evaluate(...) # score multi-step agentsAuto-capture production agent traces and stream them to your TrustModel dashboard (enterprise
OTel mode — pass agent_id/domain/frameworks and auto_init routes to the telemetry forwarder):
from trustmodel import auto_init
auto_init(
api_key="tm-...",
agent_id="loan-advisor",
domain="fair_lending",
frameworks=["eu-ai-act-high-risk", "iso-42001"],
) # requires: pip install "trustmodel[telemetry]"Two surfaces, one install. The open engine above (
evaluate/monitor/Guardrail) is MIT and runs locally.TrustModelClientis the proprietary cloud client. Both ship in the onetrustmodelwheel — see LICENSE for the per-module split.
safety · fairness · accuracy · privacy · transparency · robustness · accountability · explainability · compliance · reliability
Mapped to EU AI Act, NIST AI RMF, ISO 42001, NYC Local Law 144, OWASP LLM Top 10.
| TrustModel | DeepEval / Promptfoo | Manual audit | |
|---|---|---|---|
| Trust score across 10 governance dimensions | ✅ | partial | ✅ |
| Eval + live monitoring + runtime governance | ✅ | eval only | ❌ |
| Regulation-mapped policy packs (EU AI Act, LL144…) | ✅ | ❌ | ✅ |
| Runs locally with your own LLM | ✅ | ✅ | ❌ |
| Calibrated, audit-ready score + report | ✅ (cloud) | ❌ | ✅ |
| Time to first result | 30 sec | minutes | weeks |
| Cost | free + $500 credits | free | $15k+ |
Expose Eval and Govern to any Model Context Protocol client (Claude Code, Cursor, Claude Desktop, …). Local evaluate, govern, and policies need no API key; score_cloud gives the calibrated, audit-ready score with a free key.
pip install "trustmodel[mcp]"
trustmodel-mcp # or: trustmodel mcp — runs the server on stdioZero-install with uv:
uvx --from "trustmodel[mcp]" trustmodel-mcpRegister it with Claude Code:
claude mcp add trustmodel -- uvx --from "trustmodel[mcp]" trustmodel-mcpOr add to Claude Desktop / Cursor (claude_desktop_config.json / .cursor/mcp.json):
{
"mcpServers": {
"trustmodel": {
"command": "uvx",
"args": ["--from", "trustmodel[mcp]", "trustmodel-mcp"]
}
}
}| Tool | Key? | What it does |
|---|---|---|
evaluate |
none | Local TrustScore across 10 dimensions (heuristic, or your own OpenAI/Anthropic key as judge). |
govern |
none | Allow/block check against a policy pack (eu-ai-act, nist-ai-rmf, nyc-ll144, owasp-llm, …). |
policies |
none | List built-in policy packs. |
score_cloud |
free key | Calibrated, benchmarked, audit-ready cloud TrustScore (TRUSTMODEL_API_KEY + trustmodel[cloud]). |
The
mcpextra requires Python ≥ 3.10. There's also a TypeScript MCP server —@trustmodel/mcp-server(repo).
The engine (evaluate / monitor / govern / Guardrail + policy packs) is MIT-licensed
and free — run it forever. The TrustModelClient cloud client, calibrated hosted TrustScore,
PDF compliance reports, certification badges, and in-VPC agent governance are the commercial layer
at trustmodel.ai and ship under the proprietary
TrustModel SDK License. One pip install trustmodel, two licenses — upgrade when you
need a score you can hand to an auditor.
📚 Docs & wiki · 🤗 Live demo · 💬 Discussions · 🔑 Get your free key
⭐ If this is useful, star it — it's how I know to keep building.