AgentDyn: A Dynamic Open-Ended Benchmark for Evaluating Prompt Injection Attacks of Real-World Agent Security System

Hao Li, Ruoyao Wen, Shanghao Shi, Ning Zhang, Chaowei Xiao.

The official implementation of the paper "AgentDyn: A Dynamic Open-Ended Benchmark for Evaluating Prompt Injection Attacks of Real-World Agent Security System".

AgentDyn is a dynamic, open-ended agent security benchmark featuring 60 challenging open-ended user tasks and 560 injection test cases across the Shopping, GitHub, and Daily Life scenarios. It is built on top of the AgentDojo framework. A huge thanks to the AgentDojo team for their admirable contribution to the community!

Quickstart

pip install -e .

Running the benchmark

For adaptability, we support an evaluation script same as AgentDojo's. Documentation on how to use the script can be obtained with the --help flag.

For example, to run the shopping suite , with gpt-4o-2024-08-06 as the LLM, the tool filter as a defense, and the attack with important_instructions, run the following command:

python -m agentdojo.scripts.benchmark -s shopping \
    --model GPT_4O_2024_08_06 \
    --defense tool_filter --attack important_instructions

Before running, please export your API key, through:

OpenAI Model: export OPENAI_API_KEY=XXX
Google Model: export GOOGLE_API_KEY=XXX
Open-sourced Models (Qwen, LlaMA): export OPENROUTER_API_KEY=XXX

Supported settings

Avaliable Suites:

AgentDyn supports shopping,github, and dailylife suites, as well as the original four suites from AgentDojo (banking,slack, travel and workspace).

Avaliable Models:

We evaluate the following models in our paper: GPT_4O_MINI_2024_07_18, GPT_4O_2024_08_06, GEMINI_2_5_FLASH, GEMINI_2_5_PRO, LLAMA_3_3_70B, QWEN3_235B, GPT_5_1_2025_11_13, GPT_5_MINI_2025_08_07. Other models supported by AgentDojo are also compatible.

Avaliable Defenses:

In addition to the original defenses in AgentDojo, we provide support for PIGuard and PromptGuard2. The complete list of defenses supported in our paper includes: repeat_user_prompt, spotlighting_with_delimiting, tool_filter, transformers_pi_detector, piguard_detector, prompt_guard_2_detector.

Inspect Results

To review the results reported in our paper, please refer to the log files in the (runs/).

References

If you find this work useful in your research or applications, we appreciate that if you can kindly cite:

@articles{AgentDyn,
  title={AgentDyn: A Dynamic Open-Ended Benchmark for Evaluating Prompt Injection Attacks of Real-World Agent Security System},
  author={Hao Li and Ruoyao Wen and Shanghao Shi and Ning Zhang and Chaowei Xiao},
  journal = {arXiv},
  eprint = {2602.03117},
  year={2026}
}

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
docs		docs
examples		examples
notebooks		notebooks
runs		runs
src/agentdojo		src/agentdojo
tests		tests
util_scripts		util_scripts
CITATION.bib		CITATION.bib
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
agentdojo_README.md		agentdojo_README.md
mkdocs.yml		mkdocs.yml
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AgentDyn: A Dynamic Open-Ended Benchmark for Evaluating Prompt Injection Attacks of Real-World Agent Security System

Quickstart

Running the benchmark

Supported settings

Avaliable Suites:

Avaliable Models:

Avaliable Defenses:

Inspect Results

References

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Languages

Folders and files

Latest commit

History

Repository files navigation

AgentDyn: A Dynamic Open-Ended Benchmark for Evaluating Prompt Injection Attacks of Real-World Agent Security System

Quickstart

Running the benchmark

Supported settings

Avaliable Suites:

Avaliable Models:

Avaliable Defenses:

Inspect Results

References

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Languages

Packages