[Phase 0.2.3] Create adversarial test framework

## Phase
`Phase 0 — Foundations` | Track 0.2 — Test Infrastructure | Priority: **P1**

## Summary
Create `tests/adversarial/` for prompt injection, fuzzing, and abuse scenario tests.

## What
- Create `tests/adversarial/__init__.py`
- Create `tests/adversarial/conftest.py` with:
  - `injection_payloads` — fixture loading prompt injection patterns from YAML/JSON
  - `mock_llm_with_injection` — simulates an LLM that returns injected content
  - `attack_scenario` — parameterized fixture for multi-step attack chains
- Create `tests/adversarial/payloads/` directory with:
  - `prompt_injection.yaml` — 50+ injection patterns
  - `indirect_injection.yaml` — web content injection patterns
  - `resource_exhaustion.yaml` — DoS attack patterns
- Integrate with `hypothesis` for property-based/fuzz testing

## Why
Adversarial testing is how you prove guardrails actually work. A curated payload library and testing framework means we can run red-team tests on every change.

## Acceptance Criteria
- [ ] `tests/adversarial/` directory exists with framework
- [ ] Payload files contain at least 50 prompt injection patterns
- [ ] `hypothesis` added to dev dependencies
- [ ] At least one adversarial test runs using the framework
- [ ] Framework supports parameterized attack scenarios

## References
- [NVIDIA Garak — LLM vulnerability scanner](https://github.com/NVIDIA/garak)
- [Hypothesis — property-based testing](https://hypothesis.readthedocs.io/)
- [OWASP LLM Top 10](https://genai.owasp.org/)
- Design Doc: `docs/plans/2026-03-29-security-ai-guardrails-performance-design.md`

## Blocks
All Phase 4.3 and 4.4 issues (adversarial and fuzzing tests)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Phase 0.2.3] Create adversarial test framework #9

Phase

Summary

What

Why

Acceptance Criteria

References

Blocks

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Phase 0.2.3] Create adversarial test framework #9

Description

Phase

Summary

What

Why

Acceptance Criteria

References

Blocks

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions