vf-eval: replace -d/--debug with --disable-tui, rename --tui to --fullscreen by mikasenghaas · Pull Request #1183 · PrimeIntellect-ai/verifiers

mikasenghaas · 2026-04-19T00:29:53Z

Summary

Clean up the vf-eval display flags. Two separate concerns were conflated under names that made them hard to reason about:

Whether to use the Rich display at all — previously -d/--debug, now -d/--disable-tui.
Whether the Rich display uses the alternate screen buffer — previously -u/--tui, now -f/--fullscreen.

Changes

Add --disable-tui (-d) and --fullscreen (-f) as the canonical flag names.
Update the log-panel hint (full logs: --disable-tui) and docs table.
Raise SystemExit if both --disable-tui and --fullscreen are passed.

⚠️ Breaking changes

Old	New
`-d` / `--debug`	`-d` / `--disable-tui`
`-u` / `--tui`	`-f` / `--fullscreen`

The -d short flag is preserved but now maps to --disable-tui. -u is no longer accepted — use -f/--fullscreen instead. Running vf-eval env-id --debug or vf-eval env-id --tui after this change will fail with an argparse unrecognized arguments error. Any downstream tooling or TOML configs that set debug=true on EvalConfig will also need updating.

🤖 Generated with Claude Code

Note

Medium Risk
Medium risk due to breaking CLI/TOML config surface changes (--debug/--tui removed) and updated config schema (debug→disable_tui), which can break downstream scripts and saved configs if not migrated.

Overview
Clarifies vf-eval display controls by renaming --tui to --fullscreen (alternate screen buffer) and --debug to --disable-tui (turn off Rich display and use normal logging/tqdm), and updates docs/tests accordingly.

Adds a guard in verifiers/scripts/eval.py that exits when --disable-tui and --fullscreen are both provided, and propagates the renamed setting through EvalConfig, TOML validation (eval_utils.valid_fields), env server console_logging, and the on-screen log hint text.

^{Reviewed by Cursor Bugbot for commit 4faaf5d. Bugbot is set up for automated code reviews on this repo. Configure here.}

Rename the eval CLI's "disable Rich display" flag to the clearer `--disable-tui`. Keep `-d`/`--debug` as a backward-compatible alias that emits a DeprecationWarning. Mirror the rename in `EvalConfig` (`debug` -> `disable_tui`), the log-hint subtitle, docs table, and test fixtures. TOML configs accept either `debug` or `disable_tui` during the deprecation window. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

…lscreen `--tui`/`-u` was confusingly named (the Rich display is always on unless `--disable-tui` is passed — `--tui` only controlled alternate-screen buffering). Rename it to `--fullscreen`; keep `--tui`/`-u` as a DeprecationWarning alias. Raise a `SystemExit` if both `--disable-tui` and `--fullscreen` are set, since they address orthogonal concerns and combining them is meaningless. Also rename `run_evaluations_tui(..., tui_mode=...)` to `fullscreen=...` to match. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Rather than shipping deprecation warnings, just break the old flags. Users must now use `--disable-tui` (was `-d`/`--debug`) and `--fullscreen` (was `-u`/`--tui`). Also remove the TOML `debug` fallback in `EvalConfig` construction — TOML configs must use `disable_tui`. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

There are 2 total unresolved issues (including 1 from previous review).

^{❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

^{Reviewed by Cursor Bugbot for commit 36dc236. Configure here.}

The TOML config validator's `valid_fields` set was still listing the removed `debug` key and omitting `disable_tui`. That meant TOML configs using the new key would fail validation, while configs still using the old key would pass validation but be silently ignored (no longer read by `build_eval_config`). Fix by swapping the key in `valid_fields`. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

willccbb · 2026-04-20T02:05:10Z

Actually sorry can we revert this @mikasenghaas @xeophon ? We should be careful with breaking changes for the primary UX.

cursor Bot reviewed Apr 19, 2026

View reviewed changes

Comment thread verifiers/utils/eval_utils.py

mikasenghaas and others added 2 commits April 19, 2026 00:47

mikasenghaas changed the title ~~vf-eval: deprecate -d/--debug in favor of --disable-tui~~ vf-eval: replace -d/--debug with --disable-tui, rename --tui to --fullscreen Apr 19, 2026

cursor Bot reviewed Apr 19, 2026

View reviewed changes

Comment thread verifiers/scripts/eval.py

mikasenghaas requested review from willccbb and xeophon April 19, 2026 00:54

vf-eval: add -f/-d short flags for --fullscreen/--disable-tui

4faaf5d

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

xeophon approved these changes Apr 19, 2026

View reviewed changes

mikasenghaas merged commit ca07020 into main Apr 19, 2026
6 checks passed

willccbb mentioned this pull request Apr 20, 2026

Revert "vf-eval: replace -d/--debug with --disable-tui, rename --tui to --fullscreen" #1200

Closed

snimu mentioned this pull request Apr 22, 2026

chore: v0.1.13.dev4 dev release #1227

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vf-eval: replace -d/--debug with --disable-tui, rename --tui to --fullscreen#1183

vf-eval: replace -d/--debug with --disable-tui, rename --tui to --fullscreen#1183
mikasenghaas merged 5 commits intomainfrom
worktree-deprecate-d-flag

mikasenghaas commented Apr 19, 2026 •

edited by cursor Bot

Loading

Uh oh!

Uh oh!

cursor Bot left a comment

Uh oh!

Uh oh!

Uh oh!

willccbb commented Apr 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

mikasenghaas commented Apr 19, 2026 • edited by cursor Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

⚠️ Breaking changes

Uh oh!

Uh oh!

cursor Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

willccbb commented Apr 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mikasenghaas commented Apr 19, 2026 •

edited by cursor Bot

Loading