Security Policy

Reporting a Vulnerability

If you discover a security vulnerability in AgentGuard, please report it responsibly.

Do NOT open a public GitHub issue for security vulnerabilities.

Email the maintainers directly with:

We will acknowledge receipt within 48 hours and provide a timeline for the fix.

The following are in scope for security reports:

AgentGuard includes a dedicated security test suite (packages/core/tests/security/) with 92 tests covering:

Run the security test suite:

make test-security

Server-side trust computation: Clients cannot escalate trust levels. The server always has the final say.
Defense in depth: Three independent detection layers (rules, anomaly, semantic) must all agree before allowing suspicious operations.
Physical separation: Two-phase architecture ensures raw external data never coexists with tool execution capability.
Fail-closed: When in doubt, block and require human confirmation.
Immutable audit: Merkle tree traces make tampering detectable.