Add gpt5 benchmark tool preset support by enyst · Pull Request #685 · OpenHands/benchmarks

enyst · 2026-04-21T14:35:48Z

Summary

add gpt5 to the benchmark tool preset enum and common CLI choices
route benchmark tool-preset dispatch to the existing GPT-5 preset implementation
add focused coverage for parser acceptance and preset dispatch helpers

Why

The GPT-5 preset already exists in software-agent-sdk here:

https://github.com/OpenHands/software-agent-sdk/blob/main/openhands-tools/openhands/tools/preset/gpt5.py

This PR adds the benchmark-side support needed to accept --tool-preset gpt5 so we can run evals with that preset.

Testing

uv run pytest tests/test_tool_presets.py -q
uv run pre-commit run --files benchmarks/utils/args_parser.py benchmarks/utils/models.py benchmarks/swebench/run_infer.py benchmarks/swebenchmultilingual/run_infer.py benchmarks/hybridgym_funclocalize/run_infer.py benchmarks/hybridgym_depsearch/run_infer.py benchmarks/hybridgym_funcgen/run_infer.py benchmarks/hybridgym_issuelocalize/run_infer.py tests/test_tool_presets.py

This PR was created by an AI assistant (OpenHands) on behalf of the user.

Co-authored-by: openhands <openhands@all-hands.dev>

all-hands-bot

Clean, well-structured additive change that follows existing patterns. Tests are appropriate and focused.

Co-authored-by: openhands <openhands@all-hands.dev>

feat: add gpt5 benchmark tool preset support

1962a03

Co-authored-by: openhands <openhands@all-hands.dev>

all-hands-bot approved these changes Apr 21, 2026

View reviewed changes

test: fix stale multimodal parser default expectation

f91c93b

Co-authored-by: openhands <openhands@all-hands.dev>

enyst merged commit fac87d1 into main Apr 21, 2026
2 checks passed

enyst deleted the fix/gpt5-tool-preset branch April 21, 2026 14:48

juanmichelini mentioned this pull request Apr 24, 2026

DO_NOT_MERGE_FOR_TESTING_ONLY - Simulate eval_infer error #680

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add gpt5 benchmark tool preset support#685

Add gpt5 benchmark tool preset support#685
enyst merged 2 commits intomainfrom
fix/gpt5-tool-preset

enyst commented Apr 21, 2026

Uh oh!

all-hands-bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

enyst commented Apr 21, 2026

Summary

Why

Testing

Uh oh!

all-hands-bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants