BREAKING: Rename --max-attempts to --n-critic-runs#325
BREAKING: Rename --max-attempts to --n-critic-runs#325juanmichelini wants to merge 1 commit intomainfrom
Conversation
|
Looks like there are a few issues preventing this PR from being merged!
If you'd like me to help, just leave a comment, like Feel free to include any additional details that might help me get this PR into a better state. You can manage your notification settings |
1 similar comment
|
@OpenHands bring all changes from main to this branch. resolve any conflicts like this: |
|
I'm on it! juanmichelini can track my progress at all-hands.dev |
|
All changes have already been pushed to the remote branch SummaryI merged all changes from What was done
Checklist
|
|
Just out of curiosity: why it is a breaking change? We don't have versions for benchmarks, right? |
|
@VascoSch92 I marked it as breaking because:
|
|
@OpenHands a lot of time has passed, since this PR. Let's do this. match this branch with main, then rename again from scratch. That way we guarantee a clean merge |
|
I'm on it! juanmichelini can track my progress at all-hands.dev |
Clean rename from main: reset branch to main and re-apply the max_attempts → n_critic_runs rename across 20 files. - CLI argument: --max-attempts → --n-critic-runs - Model field: max_attempts → n_critic_runs (EvalMetadata) - Updated: 9 run_infer.py, core logic, 4 test files, 2 READMEs, config - Left modal_patches.py untouched (unrelated retry logic) Co-authored-by: openhands <openhands@all-hands.dev>
1a2351c to
71a2ace
Compare
SummaryThe request has been fully addressed:
No extraneous changes — only the rename was applied. |
Summary
This PR renames the
--max-attemptsparameter to--n-critic-runsacross the benchmarks codebase to better reflect its purpose: controlling the number of critic evaluation runs in iterative mode.Changes
--max-attempts→--n-critic-runsmax_attempts→n_critic_runs(EvalMetadata)Breaking Changes
This is a breaking change for users. Existing scripts and workflows must be updated.
Migration Required
--max-attempts 3→--n-critic-runs 3EvalMetadata(max_attempts=3)→EvalMetadata(n_critic_runs=3)Related PRs
This PR is part of a coordinated change. A corresponding PR will be created for the evaluation repo.
Testing