Keep harness metrics merge inside experimental composable env by snimu · Pull Request #1201 · PrimeIntellect-ai/verifiers

snimu · 2026-04-20T10:13:06Z

Description

Previous PR touched core verifiers logic (in non-harmful ways, but still); this undoes those changes and solves the problem they were meant to address fully inside experimental

Type of Change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Documentation update
Test improvement

Testing

All existing tests pass when running uv run pytest locally.
New tests have been added to cover the changes

Checklist

My code follows the style guidelines of this project as outlined in AGENTS.md
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
My changes generate no new warnings
Any dependent changes have been merged and published

Additional Notes

Note

Medium Risk
Changes how rewards are aggregated in RubricGroup by no longer coercing None rewards to 0.0, which could surface type errors if any rubric returns None. The new harness-metrics merge runs during rubric cleanup in the experimental env and should be low impact outside that path.

Overview
Moves harness metrics merging fully into the experimental ComposableEnv by replacing the standalone HarnessMetricsRubric (and its @cleanup decorator) with a HarnessMetricsRubricGroup that runs child rubric cleanup then folds _harness_metrics into state["metrics"].

When harness.metrics_path is set, ComposableEnv now wraps the existing rubric (or existing RubricGroup) inside this new group so metrics are merged without touching core scoring flow.

Separately, RubricGroup no longer normalizes None rewards to 0.0 during score_rollout/score_group, relying on rubrics to always set numeric rewards.

^{Reviewed by Cursor Bugbot for commit 1ae31b4. Bugbot is set up for automated code reviews on this repo. Configure here.}

Keep harness metrics merge inside experimental composable env

1ae31b4

snimu merged commit 0c1b133 into main Apr 20, 2026
8 of 9 checks passed

snimu mentioned this pull request Apr 22, 2026

chore: v0.1.13.dev4 dev release #1227

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Keep harness metrics merge inside experimental composable env#1201

Keep harness metrics merge inside experimental composable env#1201
snimu merged 1 commit intomainfrom
sebastian/undo-rubrics-changes-2026-04-20

snimu commented Apr 20, 2026 •

edited by cursor Bot

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

snimu commented Apr 20, 2026 • edited by cursor Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of Change

Testing

Checklist

Additional Notes

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

snimu commented Apr 20, 2026 •

edited by cursor Bot

Loading