MCP client should validate tool descriptions for prompt injections and freeze them. #247968

pelikhan · 2025-05-01T18:34:30Z

The tool description returned by a MCP server may contain prompt injection strings in the tool description. The tool descriptions are inserted in the final prompt that gets processed by the client LLM.

Note that the rug-pull attack is a variant where the MCP server injects the prompt injection strings are a few uses or some other environment trigger. Thus, it is not detected on first load when the user reviews the tools.

Recommendation

Freeze the tool list and repeat any kind of validation when the tool list changes.
Run prompt injection detection services on the tool list to detect attempts at injecting prompts through the description.

This technique is implemented in genaiscript. https://microsoft.github.io/genaiscript/blog/mcp-tool-validation/

connor4312 · 2025-05-01T19:42:30Z

We don't validate tool descripts aside from ensuring they're safe from the model's schema (e.g. not too long for 4o).

I'm also not sure how useful validating tool descriptions are when tool responses are non-deterministic and a much better place to mount any kind of prompt-injection attack.

Additionally, the tools exposed by an MCP server may be non-deterministic and change over time. In fact that is ideal behavior for say a browser tool, where it might only expose an "open browser" tool initially and then not expose tools to interact with that browser until it's open. (No sense eating up the context window unnecessarily)

pelikhan · 2025-05-01T21:04:21Z

Once the tool description have been validated, you would want to store a hash and redo the validation whenever a change is done. Otherwise, it is possible for a malicious mcp server to dynamically change their tool description to shadow or mutate their intents.

connor4312 · 2025-05-01T22:54:26Z

That could only reasonably done in an automated way, we would not want to bug the users with prompts whenever tools change.

pelikhan · 2025-05-02T05:34:24Z

You could think of having a "lock" icon that allows the user to convey the intent to freeze the tool. At which point, it makes sense to notify that things changed.

vs-code-engineering bot added the triage-needed label May 1, 2025

vs-code-engineering bot assigned connor4312 May 1, 2025

connor4312 assigned digitarald May 1, 2025

connor4312 added the under-discussion label May 1, 2025

vs-code-engineering bot removed the triage-needed label May 1, 2025

digitarald added the chat-mcp label May 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MCP client should validate tool descriptions for prompt injections and freeze them. #247968

MCP client should validate tool descriptions for prompt injections and freeze them. #247968

pelikhan commented May 1, 2025

connor4312 commented May 1, 2025 •

edited

Loading

pelikhan commented May 1, 2025

connor4312 commented May 1, 2025

pelikhan commented May 2, 2025

MCP client should validate tool descriptions for prompt injections and freeze them. #247968

MCP client should validate tool descriptions for prompt injections and freeze them. #247968

Comments

pelikhan commented May 1, 2025

Recommendation

connor4312 commented May 1, 2025 • edited Loading

pelikhan commented May 1, 2025

connor4312 commented May 1, 2025

pelikhan commented May 2, 2025

connor4312 commented May 1, 2025 •

edited

Loading