docs: Add advisory requirement APTS-HO-A02 Disclosure and Mitigation of AI Influence on Operator Decisions by jorgeraad · Pull Request #45 · OWASP/APTS

jorgeraad · 2026-04-30T22:32:19Z

Context

Hi, I'm Jorge, a software engineer at Pensar. I work on core agent functionality in our open source offensive security agent harness, github.com/pensarai/apex.

AI disclosure: This contribution was drafted with AI assistance. I have reviewed all changes for accuracy, consistency with the standard, and compliance with the style guide, and take full ownership of the submission.

What changed and why

Adds Disclosure and Mitigation of AI Influence on Operator Decisions as a new advisory practice (APTS-HO-A02) in the Advisory Requirements appendix. This is the second advisory in the Human Oversight domain.

Autonomous pentest platforms increasingly use language models not only to act on the target but also to shape what the operator sees at decision time — the narrative that frames a finding, the option set in an approval prompt, the wording, the preselected default. This is a manipulation surface distinct from the prompt-injection threats covered by the Manipulation Resistance domain: there, an external entity manipulates the agent; here, the agent influences its own supervisor's choice through affordances the agent controls. Established findings from human-computer interaction (default bias, primacy bias, choice architecture effects) show these shaping decisions meaningfully influence which option an operator selects, even without adversarial intent.

Existing requirements partially address this risk but leave a gap. APTS-HO-001 mandates pre-approval gates; APTS-HO-005 requires a chain-of-custody audit trail of approval decisions; APTS-HO-010 mandates a human decision point before irreversible actions. None speak to the form of the question the operator is asked to confirm. APTS-AR-006 covers the agent's reasoning chain but not the model-shaped inputs handed to the human. The practical effect is that an audit trail can show "operator approved" while concealing that the operator was offered a single highlighted choice with the safer option visually de-emphasized.

The advisory text notes this practice is a candidate for tier-gated inclusion in v0.2.0.

Practice description

The advisory pairs provenance for AI-shaped operator affordances (items 1-3) with bias mitigation at high-impact gates (item 4). The four points are ordered by implementation cost:

Distinguish a default click-through from an actively-selected response (single audit field — lowest-cost wedge)
Log the model and prompt that shaped the operator's view (provenance, parallels existing agent-action logging)
Record the full option set, including filtered alternatives (capture pre-presentation candidates so reviewers can detect a suppressed deny/abort)
Reduce default and ordering bias for high-impact gates (no preselected default; equal visual weight for abort/deny; consider randomized order; consider typed confirmation for the most severe categories)

Affected requirements

New advisory: APTS-HO-A02
No new normative requirements, no tier-count changes
Advisory practice count: 13 → 14

Files changed

standard/appendix/Advisory_Requirements.md — New APTS-HO-A02 advisory entry (self-contained, follows the format of APTS-SC-A02 and APTS-RP-A01)
standard/3_Human_Oversight/README.md — Updated the appendix-advisory line to register HO-A02 alongside HO-A01; added an inline see-also reference under APTS-HO-005 (the requirement most directly extended by this advisory)
README.md, index.md, standard/README.md, standard/Introduction.md, standard/Frontispiece.md, standard/Getting_Started.md, standard/appendix/Glossary.md, standard/appendix/Vendor_Evaluation_Guide.md — Advisory practice count synced from 13 to 14 (matches the cross-file sync pattern from PR Update advisory practice count to 13 across all files #27)

No normative requirement counts changed (173 total, 72/157/173 tier counts unchanged). The machine-readable export (standard/apts_requirements.json) is intentionally untouched, consistent with the existing convention that advisory practices are excluded from the JSON export.

jinsonvarghese · 2026-05-01T08:43:18Z

Hi @jorgeraad, good advisory! The gap between logging that a decision happened (HO-005) and how the decision was presented to the operator is real. The four practice components are practical and well-ordered.

Two things before merging:

Rebase needed - PR docs: Add advisory requirement APTS-MR-A01 Goal Misgeneralization and Emergent Misalignment Evaluation Suite #43 (APTS-MR-A01) has been merged, so the advisory count on main is already 14. This PR needs to rebase and update all count references to 15 instead of 14. The files to update: standard/Frontispiece.md, standard/Getting_Started.md, standard/README.md, standard/Introduction.md, standard/appendix/Glossary.md, standard/appendix/Vendor_Evaluation_Guide.md, README.md, and index.md.
v0.2.0 candidate - Tagging this as a v0.2.0 candidate for potential promotion to normative. For now, we will keep it advisory.

…of AI Influence on Operator Decisions Adds APTS-HO-A02 as a new advisory practice in the Human Oversight domain, the second advisory in HO. Addresses a gap in existing coverage: APTS-HO-001, HO-005, HO-010, and AR-006 mandate approval gates, audit trails, and reasoning-chain capture, but none address the form of the question the operator is asked to confirm. The practical effect is that an audit trail can show "operator approved" while concealing that the operator was offered a single highlighted choice with the safer option visually de-emphasized. The advisory pairs provenance for AI-shaped operator affordances with bias mitigation at high-impact gates. The Practice Description is a four-point list ordered by implementation cost, from a single response-classification audit field through to no-preselected-default and typed-confirmation rules at HO-010 gates. Cross-file count sync from 14 to 15 advisory practices (rebased on top of OWASP#43, which brought the count to 14). No new normative requirements, no tier counts changed (173 total, 72/157/173 unchanged). The machine-readable JSON export is intentionally untouched, consistent with the existing convention that advisory practices are excluded.

jorgeraad · 2026-05-01T11:08:29Z

@jinsonvarghese thanks for taking a look! Just updated.

jinsonvarghese · 2026-05-01T13:45:59Z

Thank you. Looks good, merging.

jinsonvarghese added the v0.2.0-candidate PRs that are accepted in principle but deferred to the v0.2.0 release label May 1, 2026

jinsonvarghese assigned jorgeraad May 1, 2026

jorgeraad force-pushed the docs/ho-a02-ai-influence-disclosure branch from e6be063 to a574c8b Compare May 1, 2026 11:07

jinsonvarghese merged commit 6699143 into OWASP:main May 1, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: Add advisory requirement APTS-HO-A02 Disclosure and Mitigation of AI Influence on Operator Decisions#45

docs: Add advisory requirement APTS-HO-A02 Disclosure and Mitigation of AI Influence on Operator Decisions#45
jinsonvarghese merged 1 commit intoOWASP:mainfrom
jorgeraad:docs/ho-a02-ai-influence-disclosure

jorgeraad commented Apr 30, 2026

Uh oh!

jinsonvarghese commented May 1, 2026

Uh oh!

jorgeraad commented May 1, 2026

Uh oh!

jinsonvarghese commented May 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

jorgeraad commented Apr 30, 2026

Context

What changed and why

Practice description

Affected requirements

Files changed

Uh oh!

jinsonvarghese commented May 1, 2026

Uh oh!

jorgeraad commented May 1, 2026

Uh oh!

jinsonvarghese commented May 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants