FEAT: Use registry-based default objective scorer in scenarios by rlundeen2 · Pull Request #1528 · microsoft/PyRIT

rlundeen2 · 2026-03-21T18:05:29Z

Centralizes the default objective scorer into Scenario._get_default_objective_scorer(), replacing duplicated hardcoded scorer construction across ContentHarms, Jailbreak, and RedTeamAgent. The base class method checks the ScorerRegistry for a scorer tagged DEFAULT_OBJECTIVE_SCORER (the best F1 scorer from initialization), falling back to TrueFalseInverterScorer(SelfAskRefusalScorer(OpenAIChatTarget())).

Note, we also run scorer metrics from the registry. Because of this, there was a mismatch between the Scenario default scorers and the metrics. Now, these match so metrics are reported.

pyrit/scenario/core/scenario.py

pyrit/scenario/scenarios/airt/content_harms.py

pyrit/scenario/scenarios/foundry/red_team_agent.py

pyrit/setup/initializers/components/scorers.py

pyrit/scenario/scenarios/airt/jailbreak.py

pyrit/setup/initializers/components/scorers.py

nina-msft · 2026-03-23T19:56:52Z

Assuming we're only updating content_harms, jailbreaks, and foundry scenarios to use the default objective scorer from core because they most closely resemble that type of scorer - but all other scenarios have custom handling for the objective scorer that may rely on the user having a "gpt-4o-unsafe" model.

I see value in removing duplicate code from the changes in this PR and also making it so that we can use metrics-informed scorers from the registry when possible for these scenarios...but for other scenarios we're not getting that same benefit. Are there follow-up stories we need to make so all scenarios are covered?

doc/code/scenarios/1_configuring_scenarios.ipynb

nina-msft

I think this is a net-positive change, we may want to consider follow up stories for other scenarios (based off of my comment below) so that we can offer this advantage across the board for scenarios as it makes sense.

Most of the comments look like nits but all are small improvements - please take if no objections :)

pyrit/scenario/core/scenario.py

pyrit/setup/initializers/components/targets.py

…corer

…soft#1528)

rlundeen2 added 4 commits March 20, 2026 14:12

Using scorer registry scorers by default in scenarios

a29e017

adding tests and fixing up

418089d

re-running notebook and pre-commit

25707fa

updating default conf

8da7132

nina-msft reviewed Mar 23, 2026

View reviewed changes

pyrit/scenario/core/scenario.py Outdated Show resolved Hide resolved

nina-msft reviewed Mar 23, 2026

View reviewed changes

pyrit/scenario/scenarios/airt/content_harms.py Show resolved Hide resolved

hannahwestra25 reviewed Mar 23, 2026

View reviewed changes

pyrit/scenario/scenarios/foundry/red_team_agent.py Show resolved Hide resolved

hannahwestra25 reviewed Mar 23, 2026

View reviewed changes

pyrit/setup/initializers/components/scorers.py Outdated Show resolved Hide resolved

nina-msft reviewed Mar 23, 2026

View reviewed changes

pyrit/scenario/scenarios/airt/jailbreak.py Outdated Show resolved Hide resolved

jsong468 reviewed Mar 23, 2026

View reviewed changes

pyrit/setup/initializers/components/scorers.py Show resolved Hide resolved

nina-msft reviewed Mar 23, 2026

View reviewed changes

doc/code/scenarios/1_configuring_scenarios.ipynb Show resolved Hide resolved

nina-msft approved these changes Mar 23, 2026

View reviewed changes

jsong468 reviewed Mar 23, 2026

View reviewed changes

pyrit/scenario/core/scenario.py Show resolved Hide resolved

jsong468 reviewed Mar 23, 2026

View reviewed changes

pyrit/setup/initializers/components/targets.py Show resolved Hide resolved

jsong468 reviewed Mar 23, 2026

View reviewed changes

pyrit/setup/initializers/components/targets.py Show resolved Hide resolved

rlundeen2 added 2 commits March 24, 2026 10:19

pr feedback

3ae1d16

Merge branch 'main' into users/rlundeen/2026_03_20_scenario_default_s…

b17f031

…corer

rlundeen2 merged commit d0d90fa into microsoft:main Mar 24, 2026
35 checks passed

jbolor21 pushed a commit to jbolor21/jbolor-PyRIT that referenced this pull request Mar 25, 2026

FEAT: Use registry-based default objective scorer in scenarios (micro…

91aba05

…soft#1528)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FEAT: Use registry-based default objective scorer in scenarios#1528

FEAT: Use registry-based default objective scorer in scenarios#1528
rlundeen2 merged 6 commits intomicrosoft:mainfrom
rlundeen2:users/rlundeen/2026_03_20_scenario_default_scorer

rlundeen2 commented Mar 21, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nina-msft commented Mar 23, 2026 •

edited

Loading

Uh oh!

Uh oh!

nina-msft left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

rlundeen2 commented Mar 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nina-msft commented Mar 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

nina-msft left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

rlundeen2 commented Mar 21, 2026 •

edited

Loading

nina-msft commented Mar 23, 2026 •

edited

Loading