Skip to content

vf-eval: replace -d/--debug with --disable-tui, rename --tui to --fullscreen#1183

Merged
mikasenghaas merged 5 commits intomainfrom
worktree-deprecate-d-flag
Apr 19, 2026
Merged

vf-eval: replace -d/--debug with --disable-tui, rename --tui to --fullscreen#1183
mikasenghaas merged 5 commits intomainfrom
worktree-deprecate-d-flag

Conversation

@mikasenghaas
Copy link
Copy Markdown
Member

@mikasenghaas mikasenghaas commented Apr 19, 2026

Summary

Clean up the vf-eval display flags. Two separate concerns were conflated under names that made them hard to reason about:

  1. Whether to use the Rich display at all — previously -d/--debug, now -d/--disable-tui.
  2. Whether the Rich display uses the alternate screen buffer — previously -u/--tui, now -f/--fullscreen.

Changes

  • Add --disable-tui (-d) and --fullscreen (-f) as the canonical flag names.
  • Update the log-panel hint (full logs: --disable-tui) and docs table.
  • Raise SystemExit if both --disable-tui and --fullscreen are passed.

⚠️ Breaking changes

Old New
-d / --debug -d / --disable-tui
-u / --tui -f / --fullscreen

The -d short flag is preserved but now maps to --disable-tui. -u is no longer accepted — use -f/--fullscreen instead. Running vf-eval env-id --debug or vf-eval env-id --tui after this change will fail with an argparse unrecognized arguments error. Any downstream tooling or TOML configs that set debug=true on EvalConfig will also need updating.

🤖 Generated with Claude Code


Note

Medium Risk
Medium risk due to breaking CLI/TOML config surface changes (--debug/--tui removed) and updated config schema (debugdisable_tui), which can break downstream scripts and saved configs if not migrated.

Overview
Clarifies vf-eval display controls by renaming --tui to --fullscreen (alternate screen buffer) and --debug to --disable-tui (turn off Rich display and use normal logging/tqdm), and updates docs/tests accordingly.

Adds a guard in verifiers/scripts/eval.py that exits when --disable-tui and --fullscreen are both provided, and propagates the renamed setting through EvalConfig, TOML validation (eval_utils.valid_fields), env server console_logging, and the on-screen log hint text.

Reviewed by Cursor Bugbot for commit 4faaf5d. Bugbot is set up for automated code reviews on this repo. Configure here.

Rename the eval CLI's "disable Rich display" flag to the clearer
`--disable-tui`. Keep `-d`/`--debug` as a backward-compatible alias that
emits a DeprecationWarning. Mirror the rename in `EvalConfig`
(`debug` -> `disable_tui`), the log-hint subtitle, docs table, and
test fixtures. TOML configs accept either `debug` or `disable_tui`
during the deprecation window.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
Comment thread verifiers/utils/eval_utils.py
mikasenghaas and others added 2 commits April 19, 2026 00:47
…lscreen

`--tui`/`-u` was confusingly named (the Rich display is always on unless
`--disable-tui` is passed — `--tui` only controlled alternate-screen
buffering). Rename it to `--fullscreen`; keep `--tui`/`-u` as a
DeprecationWarning alias. Raise a `SystemExit` if both `--disable-tui`
and `--fullscreen` are set, since they address orthogonal concerns and
combining them is meaningless. Also rename
`run_evaluations_tui(..., tui_mode=...)` to `fullscreen=...` to match.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
Rather than shipping deprecation warnings, just break the old flags.
Users must now use `--disable-tui` (was `-d`/`--debug`) and
`--fullscreen` (was `-u`/`--tui`). Also remove the TOML `debug` fallback
in `EvalConfig` construction — TOML configs must use `disable_tui`.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
@mikasenghaas mikasenghaas changed the title vf-eval: deprecate -d/--debug in favor of --disable-tui vf-eval: replace -d/--debug with --disable-tui, rename --tui to --fullscreen Apr 19, 2026
Copy link
Copy Markdown

@cursor cursor Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Cursor Bugbot has reviewed your changes and found 1 potential issue.

There are 2 total unresolved issues (including 1 from previous review).

Fix All in Cursor

❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.

Reviewed by Cursor Bugbot for commit 36dc236. Configure here.

Comment thread verifiers/scripts/eval.py
The TOML config validator's `valid_fields` set was still listing the
removed `debug` key and omitting `disable_tui`. That meant TOML configs
using the new key would fail validation, while configs still using the
old key would pass validation but be silently ignored (no longer read
by `build_eval_config`). Fix by swapping the key in `valid_fields`.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
@mikasenghaas mikasenghaas merged commit ca07020 into main Apr 19, 2026
6 checks passed
@willccbb
Copy link
Copy Markdown
Member

Actually sorry can we revert this @mikasenghaas @xeophon ? We should be careful with breaking changes for the primary UX.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants