Abort rollout when RLM worker recovery fails by snimu · Pull Request #891 · PrimeIntellect-ai/verifiers

snimu · 2026-02-10T21:55:58Z

Description

Instead of returning a soft error message to the model when the worker can't be restarted, raise RLMWorkerRecoveryError (a SandboxError subclass) to cleanly abort the rollout. If recovery succeeded, the RLM is told that the REPL state is reset.

Type of Change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Documentation update
Test improvement

Testing

All existing tests pass when running uv run pytest locally.
New tests have been added to cover the changes

Checklist

My code follows the style guidelines of this project as outlined in AGENTS.md
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
My changes generate no new warnings
Any dependent changes have been merged and published

Note

Medium Risk
Changes error-handling control flow for timeouts, which can alter rollout termination behavior and upstream exception handling, but is scoped to the experimental RLM environment.

Overview
RLM rollouts now hard-fail when worker recovery fails after a code-execution timeout.

RLMEnv._execute_code now attempts _recover_from_code_timeout() and, if restart fails, raises the new RLMWorkerRecoveryError (a vf.SandboxError subclass) instead of returning a soft error string; if recovery succeeds, the returned timeout message always states the worker was restarted and REPL state reset.

^{Written by Cursor Bugbot for commit e707c33. This will update automatically on new commits. Configure here.}

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{Bugbot Autofix is OFF. To automatically fix reported issues with Cloud Agents, enable Autofix in the Cursor dashboard.}

Instead of returning a soft error message to the model when the worker can't be restarted, raise RLMWorkerRecoveryError (a SandboxError subclass) to cleanly abort the rollout. Continuing with a dead REPL just wastes turns. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

cursor Bot reviewed Feb 10, 2026

View reviewed changes

Comment thread verifiers/envs/experimental/rlm_env.py Outdated

snimu force-pushed the sebastian/rlm-improvements-2026-02-10j branch from 573ab88 to e707c33 Compare February 10, 2026 22:01

snimu merged commit dfd5650 into main Feb 10, 2026
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Abort rollout when RLM worker recovery fails#891

Abort rollout when RLM worker recovery fails#891
snimu merged 1 commit into
mainfrom
sebastian/rlm-improvements-2026-02-10j

snimu commented Feb 10, 2026 •

edited by cursor Bot

Loading

Uh oh!

cursor Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

snimu commented Feb 10, 2026 • edited by cursor Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of Change

Testing

Checklist

Uh oh!

cursor Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

snimu commented Feb 10, 2026 •

edited by cursor Bot

Loading