LLM07 - Insecure Plugin Design - Mitigation/How to Prevent Enhancements #242

GangGreenTemperTatum · 2023-11-06T03:45:51Z

I believe LLM07 could benefit from some or all of these mitigation methods to be included in the vulnerability:

Human in the Loop: A plugin should not be able to invoke another plugin (by default), especially plugins with high stakes operations and to ensure that generated content meets quality and ethical standards.
It should be transparent to the user which plugin will be invoked, and what data is sent to it. Possibly even allow modifying the data before its sent.
A security contract and threat model for plugins should be created, so we can have a secure and open infrastructure where all parties know what their security responsibilities are.
An LLM application must assume plugins cannot be trusted (e.g. direct or indirect prompt injection), and similarly plugins cannot blindly trust LLM application invocations (example: confused deputy attack)
Regularly perform red teaming and model serialization attacks with thorough benchmarking and reporting of input and outputs.
Plugins that handle PII and/or impersonate the user are high stakes.
Isolation. Discussed before that follows a Kernel LLM vs. Sandbox LLMs could help.

These mitigation techniques are primarily focused towards combatting indirect prompt injection, but should pretty much be a defacto standard. I also think there should be some sort of statement or wording such as "Plugins should never be inheriently trusted".

Resource and inspiration kudos to embracethered.

GangGreenTemperTatum added the enhancement Changes/additions to the Top 10; eg. clarifications, examples, links to external resources, etc label Nov 6, 2023

GangGreenTemperTatum assigned jsotiro and GangGreenTemperTatum Nov 6, 2023

GangGreenTemperTatum removed their assignment Feb 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LLM07 - Insecure Plugin Design - Mitigation/How to Prevent Enhancements #242

LLM07 - Insecure Plugin Design - Mitigation/How to Prevent Enhancements #242

GangGreenTemperTatum commented Nov 6, 2023 •

edited

LLM07 - Insecure Plugin Design - Mitigation/How to Prevent Enhancements #242

LLM07 - Insecure Plugin Design - Mitigation/How to Prevent Enhancements #242

Comments

GangGreenTemperTatum commented Nov 6, 2023 • edited

GangGreenTemperTatum commented Nov 6, 2023 •

edited