Detect chevron-style interactive prompts and fix indefinite foreground hang on unrecognised prompts by meganrogge · Pull Request #313244 · microsoft/vscode

meganrogge · 2026-04-29T14:41:34Z

Problem

When the agent runs a sync (foreground) command via run_in_terminal and the command launches an interactive prompt from a library like prompts (used by vitest), enquirer, or inquirer, the agent does not detect that input is required and the call hangs indefinitely.

Why it hangs forever

The foreground run_in_terminal path races four candidates: process exit, continue-in-background, user-specified timeout (default: none), and onDidDetectInputNeeded. When the process is alive waiting for input:

The OutputMonitor polling loop sees isActive === true and only transitions to Idle if detectsInputRequiredPattern matches the cursor line.
Without a matching pattern, the monitor keeps polling until its internal 2-minute extended timeout, then sets state to Cancelled.
But Cancelled does not resolve any of the foreground race candidates. The process is still alive, no user timeout was specified, and onDidDetectInputNeeded was never fired — so the race hangs forever.

Fix

1. Chevron prompt pattern

Add a new pattern in detectsHighConfidenceInputPattern (the fast-path checked on every poll tick) that matches lines starting with ? (after optional whitespace/ANSI escapes) and ending with a prompt-library chevron glyph (› U+203A, ❯ U+276F, ▸ U+25B8, ▶ U+25B6). Anchoring ? to the start of the line matches the canonical prompts/enquirer/inquirer rendering and avoids false positives from incidental ? + chevron combinations in normal output (e.g. What happened? ›).

2. Extended timeout safety net

When the 2-minute extended polling timeout fires, fire onDidDetectInputNeeded before cancelling. This is the critical fix for the indefinite hang — it resolves the foreground race so the agent receives the terminal output and can assess/respond.

3. Soften notification language

Changed the background steering message from "command is waiting for input" to "command may be waiting for input", and the UI label from "needs input" to "may need input". This is important because the event now also fires on extended timeout where the process might just be slow rather than actually waiting for input. The steering text already instructs the agent to assess the output before acting.

Decision matrix: extended timeout behavior

Scenario	`isActive`	Without firing event	With firing event
Unrecognised prompt (fg)	`true`	❌ Hangs forever — no race candidate resolves	✅ Agent gets output, can assess and respond
Unrecognised prompt (bg)	`true`	❌ Silently cancelled, agent never sees it	✅ Steering message sent, agent assesses output
Long-running command (fg) — e.g. 3min build	`true`	⚠️ Also hangs forever (same bug, different cause)	✅ Agent told "may need input" — steering text says "call GetTerminalOutput to continue polling", so agent polls and build finishes normally
Long-running command (bg)	`true`	Silent cancel at 2min	⚠️ Benign steering message — agent sees output, assesses "still running", continues polling
Already exited process	`false`	N/A — caught before timeout	Same

Tests

Chevron pattern tests: each chevron variant (›, ❯, ▸, ▶) preceded by ? with and without trailing whitespace; ANSI escape-prefixed prompt lines; negatives for already-typed responses, chevrons without leading ?, and mid-line ? with chevron
Extended timeout test: verifies onDidDetectInputNeeded fires when extended timeout expires with an active process

Fixes #312576

Copilot

Pull request overview

This PR improves terminal run_in_terminal prompt detection so agent executions don’t hang indefinitely when common Node prompt libraries (eg prompts/enquirer/inquirer) render interactive questions that end with chevron-style glyphs.

Changes:

Added a chevron-glyph input detection regex intended to recognize ? ... <chevron> prompt lines.
Added unit tests covering multiple chevron variants and basic positive/negative cases.

Show a summary per file

File	Description
src/vs/workbench/contrib/terminalContrib/chatAgentTools/browser/tools/monitoring/outputMonitor.ts	Adds a chevron-based regex to the input-needed detection patterns.
src/vs/workbench/contrib/terminalContrib/chatAgentTools/test/browser/outputMonitor.test.ts	Adds tests for chevron-style interactive prompts and a few negative cases.

Copilot's findings

Comments suppressed due to low confidence (1)

src/vs/workbench/contrib/terminalContrib/chatAgentTools/browser/tools/monitoring/outputMonitor.ts:616

The new chevron prompt regex (/\?.*[›❯▸▶]\s*$/) only requires a ? anywhere on the line, but the comment/PR description says these libraries prefix prompts with ? (at the start). As written, a non-prompt line like "What happened? ›" would be treated as input-required if it happens to end with one of these glyphs. Consider tightening the pattern to require the canonical prompt prefix (e.g. start-of-line ? + whitespace) so it matches the intended rendering and reduces false positives.

		// Interactive prompt libraries (prompts, enquirer, inquirer) prefix the prompt with
		// '? ' and end the line with a distinctive chevron character followed by optional
		// trailing whitespace where the cursor is awaiting input. Requiring a '?' earlier
		// on the line avoids false positives from random output that happens to contain a
		// chevron (e.g. git log decorations).
		// Examples:
		//   "? Do you want to install jsdom? <chevron>"  (prompts)
		//   "? Pick a color <chevron> "                  (inquirer / enquirer)
		// allow-any-unicode-next-line
		/\?.*[›❯▸▶]\s*$/,

Files reviewed: 2/2 changed files
Comments generated: 2

- Anchor the ? to start-of-line (with optional whitespace/ANSI escapes) to avoid false positives like "What happened? ›" - Add negative test for mid-line ? with chevron - Add positive test for ANSI-prefixed prompt lines - Update comment to reflect the tightened pattern Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

When the 2-minute extended polling timeout fires and the process is still active, fire onDidDetectInputNeeded instead of silently cancelling. This ensures the agent sees the terminal output and can assess/respond to unrecognised interactive prompts rather than hanging indefinitely. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

meganrogge · 2026-04-29T15:18:00Z

/requires-eval-assessment terminalbench2 gpt-5.4,claude-opus-4.6,claude-opus-4.7

Copilot

Copilot's findings

Files reviewed: 2/2 changed files
Comments generated: 2

…r/tools/monitoring/outputMonitor.ts Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

…s/browser/tools/monitoring/outputMonitor.ts" This reverts commit 93761d6.

The background steering message and UI label now say "may be waiting for input" / "may need input" instead of asserting input is definitely needed. This is important because onDidDetectInputNeeded also fires on extended timeout where the process might just be slow rather than waiting for input. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

vs-code-engineering · 2026-04-29T15:43:56Z

⏳ Queued vscode build for 62f020120ff3b0d304296ef6f0f517af7c1a00f1 (step 1/2).

Build: https://dev.azure.com/monacotools/Monaco/_build/results?buildId=434896
When this succeeds, the eval-assessment publish build will be queued automatically.

vs-code-engineering · 2026-04-29T16:41:46Z

🚀 Queued eval-assessment publish build for a6951e11ba48bf63fc669ea802ca431ed9565753 (step 2/2).

Pipeline run: https://dev.azure.com/monacotools/Monaco/_build/results?buildId=434918
On success, publishes @vscode/vscode-copilot-evaluation-agent@0.0.0-dev.a6951e11ba on the dev tag.

vs-code-engineering · 2026-04-29T16:56:15Z

🔬 Queued eval-assessment benchmark for 8fec77ac24.

Package: @vscode/vscode-copilot-evaluation-agent@0.0.0-dev.a6951e11ba (dev tag)
Benchmark: terminalbench2
Tracking issues:
- gpt-5.4: https://github.com/github/evald/issues/18800
- claude-opus-4.6: https://github.com/github/evald/issues/18801
- claude-opus-4.7: https://github.com/github/evald/issues/18802

Results will be posted back here when the run completes.

vs-code-engineering · 2026-04-29T16:56:40Z

✅ Eval-assessment build published.

Package: @vscode/vscode-copilot-evaluation-agent@0.0.0-dev.a6951e11ba (tag: dev)
Install: npm install @vscode/vscode-copilot-evaluation-agent@0.0.0-dev.a6951e11ba
Pipeline run: https://dev.azure.com/monacotools/Monaco/_build/results?buildId=434918

vs-code-engineering · 2026-04-29T23:05:44Z

📊 Eval-assessment benchmark complete.

Tracking issue: https://github.com/github/evald/issues/18802
Publish build: https://dev.azure.com/monacotools/Monaco/_build/results?buildId=434918

Analysis Results

Resolution Rate

Benchmark	Total Cases	Passed	Failed	Resolved Rate
terminalbench2	89	70	19	78.65%

Token Usage

Metric	Value
Total Tokens	74,858,135
Input Tokens	73,733,364
Output Tokens	1,124,771
Cached Tokens	70,833,033

Step Counts

Metric	Value
Total Steps	1,877
Mean Steps/Instance	21.09

vs-code-engineering · 2026-04-30T00:51:10Z

📊 Eval-assessment benchmark complete.

Tracking issue: https://github.com/github/evald/issues/18801
Publish build: https://dev.azure.com/monacotools/Monaco/_build/results?buildId=434918

Analysis Results

Resolution Rate

Benchmark	Total Cases	Passed	Failed	Resolved Rate
terminalbench2	89	49	40	55.06%

Token Usage

Metric	Value
Total Tokens	51,208,362
Input Tokens	50,198,448
Output Tokens	1,009,914
Cached Tokens	47,123,467

Step Counts

Metric	Value
Total Steps	1,822
Mean Steps/Instance	20.47

vs-code-engineering · 2026-04-30T00:52:34Z

📊 Eval-assessment benchmark complete.

Tracking issue: https://github.com/github/evald/issues/18800
Publish build: https://dev.azure.com/monacotools/Monaco/_build/results?buildId=434918

Analysis Results

Resolution Rate

Benchmark	Total Cases	Passed	Failed	Resolved Rate
terminalbench2	89	45	44	50.56%

Token Usage

Metric	Value
Total Tokens	72,042,918
Input Tokens	71,131,026
Output Tokens	911,892
Cached Tokens	61,496,960

Step Counts

Metric	Value
Total Steps	1,566
Mean Steps/Instance	17.60

Detect chevron-style interactive prompts in agent terminal

341b693

Fixes #312576

Copilot AI review requested due to automatic review settings April 29, 2026 14:41

meganrogge self-assigned this Apr 29, 2026

meganrogge added this to the 1.119.0 milestone Apr 29, 2026

meganrogge enabled auto-merge (squash) April 29, 2026 14:43

Copilot started reviewing on behalf of meganrogge April 29, 2026 14:44 View session

Copilot AI reviewed Apr 29, 2026

View reviewed changes

Comment thread ...s/workbench/contrib/terminalContrib/chatAgentTools/browser/tools/monitoring/outputMonitor.ts Outdated

Comment thread src/vs/workbench/contrib/terminalContrib/chatAgentTools/test/browser/outputMonitor.test.ts

meganrogge and others added 2 commits April 29, 2026 10:58

meganrogge changed the title ~~Detect chevron-style interactive prompts in agent terminal~~ Detect chevron-style interactive prompts and fix indefinite foreground hang on unrecognised prompts Apr 29, 2026

meganrogge requested a review from Copilot April 29, 2026 15:19

vs-code-engineering Bot added the ~requires-eval-assessment Evals will be run and will generate a report upon completion label Apr 29, 2026

Copilot started reviewing on behalf of meganrogge April 29, 2026 15:21 View session

Copilot AI reviewed Apr 29, 2026

View reviewed changes

Comment thread src/vs/workbench/contrib/terminalContrib/chatAgentTools/test/browser/outputMonitor.test.ts

Comment thread ...s/workbench/contrib/terminalContrib/chatAgentTools/browser/tools/monitoring/outputMonitor.ts

meganrogge and others added 3 commits April 29, 2026 11:36

Update src/vs/workbench/contrib/terminalContrib/chatAgentTools/browse…

93761d6

…r/tools/monitoring/outputMonitor.ts Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Revert "Update src/vs/workbench/contrib/terminalContrib/chatAgentTool…

a99c62f

…s/browser/tools/monitoring/outputMonitor.ts" This reverts commit 93761d6.

meganrogge added ~requires-eval-assessment Evals will be run and will generate a report upon completion and removed ~requires-eval-assessment Evals will be run and will generate a report upon completion labels Apr 29, 2026

pwang347 approved these changes Apr 29, 2026

View reviewed changes

meganrogge merged commit 6707517 into main Apr 29, 2026
26 checks passed

meganrogge deleted the megan/fix-312576-chevron-prompt-detection branch April 29, 2026 16:04

vs-code-engineering Bot removed the ~requires-eval-assessment Evals will be run and will generate a report upon completion label Apr 29, 2026

meganrogge mentioned this pull request Apr 29, 2026

Eval findings - set -e hangs benchmarks #313296

Closed

meganrogge mentioned this pull request May 1, 2026

Test plan: terminal agent tool end-to-end stability (week of 4/23) #313618

Closed

50 tasks

vs-code-engineering Bot added the on-testplan label May 1, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Detect chevron-style interactive prompts and fix indefinite foreground hang on unrecognised prompts#313244

Detect chevron-style interactive prompts and fix indefinite foreground hang on unrecognised prompts#313244
meganrogge merged 6 commits into
mainfrom
megan/fix-312576-chevron-prompt-detection

meganrogge commented Apr 29, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

meganrogge commented Apr 29, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

vs-code-engineering Bot commented Apr 29, 2026

Uh oh!

Uh oh!

vs-code-engineering Bot commented Apr 29, 2026

Uh oh!

vs-code-engineering Bot commented Apr 29, 2026

Uh oh!

vs-code-engineering Bot commented Apr 29, 2026

Uh oh!

vs-code-engineering Bot commented Apr 29, 2026

Uh oh!

vs-code-engineering Bot commented Apr 30, 2026

Uh oh!

vs-code-engineering Bot commented Apr 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

meganrogge commented Apr 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Why it hangs forever

Fix

1. Chevron prompt pattern

2. Extended timeout safety net

3. Soften notification language

Decision matrix: extended timeout behavior

Tests

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Copilot's findings

Uh oh!

Uh oh!

Uh oh!

meganrogge commented Apr 29, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Copilot's findings

Uh oh!

Uh oh!

Uh oh!

vs-code-engineering Bot commented Apr 29, 2026

Uh oh!

Uh oh!

vs-code-engineering Bot commented Apr 29, 2026

Uh oh!

vs-code-engineering Bot commented Apr 29, 2026

Uh oh!

vs-code-engineering Bot commented Apr 29, 2026

Uh oh!

vs-code-engineering Bot commented Apr 29, 2026

Analysis Results

Resolution Rate

Token Usage

Step Counts

Uh oh!

vs-code-engineering Bot commented Apr 30, 2026

Analysis Results

Resolution Rate

Token Usage

Step Counts

Uh oh!

vs-code-engineering Bot commented Apr 30, 2026

Analysis Results

Resolution Rate

Token Usage

Step Counts

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

meganrogge commented Apr 29, 2026 •

edited

Loading