Agent Guard: a runtime security proxy for MCP tool calls #781

mmbinumer · 2026-06-11T23:58:58Z

mmbinumer
Jun 11, 2026

Pre-submission Checklist

This post follows the MCP community guidelines

What would you like to share?

What I built:
A runtime security proxy for MCP: it sits between your MCP client (Claude Desktop, Cursor, or any MCP compatible client) and your real MCP servers, checking every tool call for secrets in transit, dangerous commands (rm -rf, curl | sh, etc.), and accidental data exfiltration via taint tracking (e.g. if an agent reads .env and that value later shows up in a call to an external tool, it gets blocked). Also has a prompt-injection tripwire. Every call is logged with a risk score and verdict (allowed / warned / blocked / error), plus a kill switch and audit-only mode for tuning.

There's a growing space of MCP runtime guards (e.g. policy/rate-limit proxies, static scanners), Agent Guard's main differentiator is taint tracking for accidental exfiltration: if an agent reads a sensitive file and that value later flows to an external-facing tool, the call is blocked.

How I built it:
Python, implemented as an MCP server itself: it proxies tools/list and tools/call to whatever downstream servers you configure in agent-guard.yaml. Tool names get aggregated as <server>__<tool>. Detection runs pre and post-call (pattern matching + a small taint store).

Challenges:
Getting taint tracking to not be too noisy — landed on exact-value matching (plus basic base64/hex decoding) for v1. Also hit an MCP tool-naming validation issue along the way (. separators aren't allowed in tool names, switched to __).

Still early, would love feedback, especially from anyone running agents with real tool access. Limitations are documented up front in the README.

Relevant Links

- GitHub repository: https://github.com/mmbinumer/agent-guard
** Demo**: examples/verdict_demos.py

minhtruongg · 2026-06-13T08:36:40Z

minhtruongg
Jun 13, 2026

Taint tracking is the right call. Most proxies catch the outbound pattern but miss that the value was read two steps earlier.

Hard case: agent reads a secret, a summarization step slightly transforms it, then it goes out. Does exact-value matching still catch it?

How does it hold up when multiple agent threads are running at the same time?

0 replies

mmbinumer · 2026-06-13T13:09:06Z

mmbinumer
Jun 13, 2026
Author

Honest answer: both are known gaps, not solved yet.

Transformed values: No, matching is exact substring (plus one level of base64/hex decode), so a paraphrase/truncation/re-encode breaks it. Called out in the README's limitations. Considering tagging subtokens to catch partial leaks, but that raises false positive, unsolved tradeoff.

Concurrency: Taint store is per session, in memory, asyncio safe for concurrent calls within a session. Sessions are isolated by design. Real limit is FIFO eviction (max_entries) could drop an earlier tagged value before a later leak attempt, missing the match on long sessions.

These are exactly the gaps that need solving before its prod ready

1 reply

minhtruongg Jun 13, 2026

Appreciate the honest breakdown. The FIFO eviction gap is the one that worries me most in production, a long agent session is exactly when secrets are highest risk.
We’re working on something in this space. Would be good to compare notes, are you open to a quick call?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Model Context Protocol

Agent Guard: a runtime security proxy for MCP tool calls #781

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Model Context Protocol

Agent Guard: a runtime security proxy for MCP tool calls #781

Uh oh!

mmbinumer Jun 11, 2026

Pre-submission Checklist

What would you like to share?

Relevant Links

Replies: 2 comments · 1 reply

Uh oh!

minhtruongg Jun 13, 2026

Uh oh!

Uh oh!

mmbinumer Jun 13, 2026 Author

Uh oh!

minhtruongg Jun 13, 2026

mmbinumer
Jun 11, 2026

Replies: 2 comments 1 reply

minhtruongg
Jun 13, 2026

mmbinumer
Jun 13, 2026
Author