PermBench

The open-source benchmark for sub-call permission control in regulated vertical AI agents.

PermBench tests whether your AI agent enforces the 5 permission boundaries that matter in legal, healthcare, and financial verticals:

cross_tenant — does it leak data across matter / patient / client boundaries?
scope_upgrade_silent — does it silently elevate from read to write/delete?
token_forward_inherit — do sub-agents inherit credentials they shouldn't?
composite_violation — do N legal calls combine into one illegal action?
high_risk_action_silent — do export / delete / send execute without escalation?

Status

This is a placeholder repo for the PermBench v0.1 launch on 2026-06-15.

The benchmark suite, scorer, leaderboard, and 120+ failure cases are landing in the next 3 weeks. Watch this repo or follow @permforge for the launch.

What lands at launch

Artifact	Status
120+ scenario test cases (legal · healthcare · financial)	in progress
Scoring rubric mapping to EU AI Act Annex III · ABA 5.3 · HIPAA Minimum Necessary	in progress
Reference adapters · OpenAI Agents SDK · LangGraph · CrewAI · AutoGen	in progress
Public leaderboard at https://permbench.permforge.com	in progress
RFC v2 (taxonomy + 10-standard comparison)	done · published in private workspace

License

Apache License 2.0 — see LICENSE. Free to fork, run on your own agent, and cite in your AI risk review.

Contact

Email · contact@permforge.com
Site · https://permforge.com

PermForge ≠ Perforce — we are AI agent permission runtime, not version control.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PermBench

Status

What lands at launch

Related

License

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

PermBench

Status

What lands at launch

Related

License

Contact

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages