remove redundant msg normalization + align `env_response` api by mikasenghaas · Pull Request #1027 · PrimeIntellect-ai/verifiers

mikasenghaas · 2026-03-17T10:29:03Z

Description

Reduces calls to normalize_messages to minimal set:

Once and unconditionally in init_state (this is safe, because not on hot path)
Then for all other call sites (e.g. after get_prompt_messages or env_response) we use maybe_normalize_messages which emits a warning before normalizing the message to nudge users towards using the types directly to avoid potential performance bottlenecks
Changed the env_response return type from Messages | str to just Messages to be consistent with the rest of the API (e.g. get_model_response, rollout, etc just operate on Messages and str -> vf.Messages conversion is an example of (undesired) conversion to custom types) and thus be handled the same way as list of dicts
Also fixes all our envs to use those patterns

Type of Change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Documentation update
Test improvement

Testing

All existing tests pass when running uv run pytest locally.
New tests have been added to cover the changes

Checklist

My code follows the style guidelines of this project as outlined in AGENTS.md
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
My changes generate no new warnings
Any dependent changes have been merged and published

Additional Notes

Note

Medium Risk
Tightens the core MultiTurnEnv/Environment.get_model_response APIs to require Messages and introduces conditional normalization, which could break downstream custom envs/providers that still return raw dicts/strings or pass string prompts.

Overview
Standardizes environment and model prompting to operate on typed vf.Messages only: env_response() and get_model_response() now take/return Messages (no more str), and built-in envs (alphabet_sort, doublecheck, sentence_repeater, GymEnv) were updated to emit vf.UserMessage objects instead of raw dicts.

To cut hot-path Pydantic overhead, MultiTurnEnv now uses maybe_normalize_messages() (new helper) to only normalize when needed and logs a warning once when callers still return raw dicts/strings; supporting log_once/warning_once utilities were added, and parse_response_message() now constructs AssistantMessage directly instead of round-tripping through model_validate.

^{Written by Cursor Bugbot for commit cd87b2c. This will update automatically on new commits. Configure here.}

verifiers/envs/environment.py

willccbb

Approving in advance, but let's auto-convert or hard-fail if get_prompt_messages is overridden to return dicts. would prefer auto-convert i think, writing dicts is often the more ergonomic way to do it, but fine either way (could surface a helper fn + mention in docstring?).

verifiers/envs/environment.py

verifiers/envs/multiturn_env.py

This reverts commit 8b26a4250a581954a3edbb12268deb4b0ebeeed9.

…t types

verifiers/utils/message_utils.py

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

verifiers/utils/logging_utils.py

mikasenghaas requested review from eligotts and willccbb March 17, 2026 10:29

mikasenghaas marked this pull request as ready for review March 17, 2026 10:34

eligotts reviewed Mar 17, 2026

View reviewed changes

verifiers/envs/environment.py Show resolved Hide resolved

willccbb approved these changes Mar 17, 2026

View reviewed changes

cursor bot reviewed Mar 18, 2026

View reviewed changes

verifiers/envs/environment.py Outdated Show resolved Hide resolved

verifiers/envs/multiturn_env.py Outdated Show resolved Hide resolved

mikasenghaas changed the title ~~remove redundant msg normalization~~ remove redundant msg normalization + align env_response api Mar 18, 2026

mikasenghaas added 10 commits March 18, 2026 12:45

remove redundant pydantic calls (mostly normalize_messages)

4e55594

keep normalizer in get_model_response

8ee5859

Revert "keep normalizer in get_model_response"

eb5a5ca

This reverts commit 8b26a4250a581954a3edbb12268deb4b0ebeeed9.

fix ty

4bb8a63

add warning and auto-normalization

1877e7b

fix ty

e219eeb

move normalization into env loop to ensure trajectory list has correc…

f7b6131

…t types

maybe_normalize_messages and updated env_response return type

e86b558

fix envs + api

f1dbfcb

style

559278a

cursor bot reviewed Mar 18, 2026

View reviewed changes

verifiers/utils/message_utils.py Show resolved Hide resolved

log once utils

162b0ac

mikasenghaas force-pushed the pydantic-normalization branch from 3b62a82 to 162b0ac Compare March 18, 2026 13:06

cursor bot reviewed Mar 18, 2026

View reviewed changes

verifiers/utils/logging_utils.py Show resolved Hide resolved

mikasenghaas added 2 commits March 18, 2026 13:39

address bugbot

74b0b78

fix ty

cd87b2c

eligotts approved these changes Mar 18, 2026

View reviewed changes

mikasenghaas merged commit b625dfb into main Mar 19, 2026
6 checks passed

mikasenghaas mentioned this pull request Mar 19, 2026

chore: bump verifiers (1960e77) PrimeIntellect-ai/prime-rl#2051

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

remove redundant msg normalization + align `env_response` api#1027

remove redundant msg normalization + align `env_response` api#1027
mikasenghaas merged 13 commits intomainfrom
pydantic-normalization

mikasenghaas commented Mar 17, 2026 •

edited by cursor bot

Loading

Uh oh!

Uh oh!

willccbb left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cursor bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

mikasenghaas commented Mar 17, 2026 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of Change

Testing

Checklist

Additional Notes

Uh oh!

Uh oh!

willccbb left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mikasenghaas commented Mar 17, 2026 •

edited by cursor bot

Loading