MAINT: Add pre-release scorer evaluation metrics by adrian-gavrila · Pull Request #1626 · microsoft/PyRIT

adrian-gavrila · 2026-04-16T22:24:22Z

Description

Updates the scorer evaluation metrics JSONL files with results from the latest pre-release evaluation run. New metric entries are
appended for the following scorers across the existing eval datasets:

These are data-only additions intended to capture baseline scorer performance ahead of the upcoming release. No code changes are included.

Tests and Documentation

No code changes — only appended JSONL metrics records produced by the existing scorer evaluation pipeline. Existing tests remain
valid; no documentation updates required. JupyText was not run as no notebooks or code samples were modified.

Update scorer eval metrics JSONL files with results from the pre-release scorer evaluation run across harm categories, objective achievement, and refusal scorers. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

varunj-msft

looks great! 😄

MAINT: Add pre-release scorer evaluation metrics

ec9e750

Update scorer eval metrics JSONL files with results from the pre-release scorer evaluation run across harm categories, objective achievement, and refusal scorers. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

varunj-msft approved these changes Apr 16, 2026

View reviewed changes

adrian-gavrila merged commit e268d6d into microsoft:main Apr 17, 2026
39 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MAINT: Add pre-release scorer evaluation metrics#1626

MAINT: Add pre-release scorer evaluation metrics#1626
adrian-gavrila merged 1 commit intomicrosoft:mainfrom
adrian-gavrila:adrian-gavrila/pre-release-scorer-evaluation

adrian-gavrila commented Apr 16, 2026

Uh oh!

varunj-msft left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

adrian-gavrila commented Apr 16, 2026

Description

Tests and Documentation

Uh oh!

varunj-msft left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants