Cap get_terminal_output first-poll and non-prefix responses to a tail by meganrogge · Pull Request #320140 · microsoft/vscode

meganrogge · 2026-06-05T16:46:58Z

Fixes the spillover side of microsoft/vscode-internalbacklog#7869.

PR #315543 already shrinks repeated get_terminal_output polls by returning only the delta. But the first poll and the non-prefix fallback still returned the full buffer (up to the ~60 KB upstream cap in outputHelpers.ts), which is well above the Copilot SDK's ~10 KB spillover threshold and causes the agent to thrash with read_file calls on the spillover temp file.

This change adds a second, tighter cap inside the tool itself: when the full output exceeds 8 KB, return only the last 8 KB (line-aligned). The truncation marker includes both the omitted character count and a hint telling the agent how to recover the head if it needs to: re-run the command and redirect output to a file, then read that file.

Snapshot tracking (length + hash) still uses the full buffer, so subsequent delta polls work unchanged.

Behavior

The tail cap applies on every code path (always on — not gated by an experiment), because the SDK spillover issue affects all users regardless of whether chat.tools.terminal.outputDeltas is enabled. The delta-tracking semantics introduced by #315543 remain gated by that experiment.

Experiment off: full output if ≤ 8 KB; otherwise tail + recovery hint.
Experiment on, first poll: same.
Experiment on, unchanged poll: unchanged since previous poll marker (no output).
Experiment on, pure-prefix delta poll: only the new characters (already small).
Experiment on, non-prefix fallback: same tail treatment as first poll.

Risks / follow-ups

8 KB is a tuned guess. Picked to stay under the ~10 KB SDK spillover threshold with headroom for the prefix text. If the SDK threshold changes we may need to retune.
No telemetry on how often truncation kicks in or whether the agent successfully recovers via the redirect hint. Worth adding to validate.
Line-alignment edge case: single-line outputs with no \n in the last 8 KB fall back to a raw character cut, which could land mid-ANSI-escape. Rare.

Tests

getTerminalOutputTool.test.ts — all 11 tests pass, including two added for the new behavior:

returns only the tail on first poll when output exceeds the tail budget
returns only the tail on non-prefix fallback when output exceeds the tail budget

vs-code-engineering · 2026-06-05T16:49:09Z

📬 CODENOTIFY

The following users are being notified based on files changed in this PR:

@anthonykim1

Matched files:

src/vs/workbench/contrib/terminalContrib/chatAgentTools/browser/tools/getTerminalOutputTool.ts
src/vs/workbench/contrib/terminalContrib/chatAgentTools/test/browser/getTerminalOutputTool.test.ts

Copilot

Pull request overview

This PR further reduces get_terminal_output payload sizes (when chat.tools.terminal.outputDeltas is enabled) by tail-truncating the first poll and non-prefix fallback responses to ~8 KB (line-aligned), preventing Copilot SDK spillover thrash while keeping full-buffer snapshotting for delta detection.

Changes:

Add an internal 8 KB “tail budget” and tail-formatting helper to cap first-poll and non-prefix-fallback outputs.
Preserve full-buffer snapshot (length + hash) so unchanged/delta detection continues to work.
Add unit tests for tail behavior on first poll and on non-prefix fallback.

Show a summary per file

File	Description
src/vs/workbench/contrib/terminalContrib/chatAgentTools/browser/tools/getTerminalOutputTool.ts	Adds tail-capping helpers and applies them to first-poll and non-prefix fallback outputs under the output-deltas experiment.
src/vs/workbench/contrib/terminalContrib/chatAgentTools/test/browser/getTerminalOutputTool.test.ts	Adds tests validating tail truncation behavior for large outputs on first poll and non-prefix fallback.

Copilot's findings

Comments suppressed due to low confidence (1)

src/vs/workbench/contrib/terminalContrib/chatAgentTools/browser/tools/getTerminalOutputTool.ts:117

When output exceeds the tail budget, the first poll returns only a tail, but the unchanged-marker still says "(… characters already shown)". That text becomes inaccurate/misleading because the tool did not actually return the full buffer previously, only the tail. Consider adjusting the unchanged-marker when output.length exceeds the tail budget so it doesn’t imply the full output was already shown.

		if (currentOutputSnapshot.length === previousOutputSnapshot.length && currentOutputSnapshot.hash === previousOutputSnapshot.hash) {
			return `Output of terminal ${id} unchanged since previous poll (${output.length} characters already shown). No new output.`;
		}

Files reviewed: 2/2 changed files
Comments generated: 1

…irect

meganrogge · 2026-06-05T16:53:01Z

/requires-eval-assessment terminalbench2 gpt-5.4,claude-opus-4.6,claude-opus-4.7

vs-code-engineering · 2026-06-05T16:53:59Z

⏳ Queued vscode build for cfa3e205c3b8cc5b0781f6cc131705f127422c8f (step 1/2).

Build: https://dev.azure.com/monacotools/Monaco/_build/results?buildId=445301
When this succeeds, the eval-assessment publish build will be queued automatically.

…l output was previously returned

vs-code-engineering · 2026-06-05T17:04:40Z

⏳ Queued vscode build for 1c5fbc4ace19cf07b0991876cedc33cc0a88ce4a (step 1/2).

Build: https://dev.azure.com/monacotools/Monaco/_build/results?buildId=445307
When this succeeds, the eval-assessment publish build will be queued automatically.

vs-code-engineering · 2026-06-05T18:12:40Z

🚀 Queued eval-assessment publish build for 6c9af180cecf3741c176b354ca3e842a5e1a35fc (step 2/2).

Pipeline run: https://dev.azure.com/monacotools/Monaco/_build/results?buildId=445322
On success, publishes @vscode/vscode-copilot-evaluation-agent@0.0.0-dev.6c9af180ce on the dev tag.

vs-code-engineering · 2026-06-05T18:31:10Z

🔬 Queued eval-assessment benchmark for 627279b46c.

Package: @vscode/vscode-copilot-evaluation-agent@0.0.0-dev.6c9af180ce (dev tag)
Agent: vscode
Benchmark: terminalbench2
Tracking issues:
- terminalbench2 / gpt-5.4: https://github.com/github/evald/issues/28972
- terminalbench2 / claude-opus-4.6: https://github.com/github/evald/issues/28973
- terminalbench2 / claude-opus-4.7: https://github.com/github/evald/issues/28974

Results will be posted back here when the run completes.

vs-code-engineering · 2026-06-05T18:31:41Z

✅ Eval-assessment build published.

Package: @vscode/vscode-copilot-evaluation-agent@0.0.0-dev.6c9af180ce (tag: dev)
Install: npm install @vscode/vscode-copilot-evaluation-agent@0.0.0-dev.6c9af180ce
Pipeline run: https://dev.azure.com/monacotools/Monaco/_build/results?buildId=445322

vs-code-engineering · 2026-06-05T22:31:57Z

📊 Eval-assessment benchmark complete.

Tracking issue: https://github.com/github/evald/issues/28972
Publish build: https://dev.azure.com/monacotools/Monaco/_build/results?buildId=445322

🧪 Results

vs-code-engineering · 2026-06-06T06:48:29Z

📊 Eval-assessment benchmark complete.

Tracking issue: https://github.com/github/evald/issues/28974
Publish build: https://dev.azure.com/monacotools/Monaco/_build/results?buildId=445322

🧪 Results

vs-code-engineering · 2026-06-06T06:50:27Z

📊 Eval-assessment benchmark complete.

Tracking issue: https://github.com/github/evald/issues/28973
Publish build: https://dev.azure.com/monacotools/Monaco/_build/results?buildId=445322

🧪 Results

Cap get_terminal_output first-poll and non-prefix responses to tail

d2e6be5

Copilot AI review requested due to automatic review settings June 5, 2026 16:46

Copilot started reviewing on behalf of meganrogge June 5, 2026 16:47 View session

meganrogge self-assigned this Jun 5, 2026

meganrogge added this to the 1.125.0 milestone Jun 5, 2026

Apply tail cap regardless of outputDeltas experiment

94bacba

Copilot AI reviewed Jun 5, 2026

View reviewed changes

Comment thread .../workbench/contrib/terminalContrib/chatAgentTools/test/browser/getTerminalOutputTool.test.ts

Always apply get_terminal_output tail cap and hint at re-run-with-red…

cfa3e20

…irect

meganrogge added the ~requires-eval-assessment Evals will be run and will generate a report upon completion label Jun 5, 2026

Clarify get_terminal_output unchanged-marker so it does not imply ful…

1c5fbc4

…l output was previously returned

meganrogge added ~requires-eval-assessment Evals will be run and will generate a report upon completion and removed ~requires-eval-assessment Evals will be run and will generate a report upon completion labels Jun 5, 2026

lramos15 approved these changes Jun 5, 2026

View reviewed changes

vs-code-engineering Bot removed the ~requires-eval-assessment Evals will be run and will generate a report upon completion label Jun 5, 2026

meganrogge modified the milestones: 1.125.0, 1.124.0 Jun 5, 2026

meganrogge merged commit a9efb8d into main Jun 5, 2026
25 checks passed

meganrogge deleted the megan/terminal-output-tail-default branch June 5, 2026 20:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cap get_terminal_output first-poll and non-prefix responses to a tail#320140

Cap get_terminal_output first-poll and non-prefix responses to a tail#320140
meganrogge merged 4 commits into
mainfrom
megan/terminal-output-tail-default

meganrogge commented Jun 5, 2026 •

edited

Loading

Uh oh!

vs-code-engineering Bot commented Jun 5, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

meganrogge commented Jun 5, 2026

Uh oh!

vs-code-engineering Bot commented Jun 5, 2026

Uh oh!

vs-code-engineering Bot commented Jun 5, 2026

Uh oh!

vs-code-engineering Bot commented Jun 5, 2026

Uh oh!

vs-code-engineering Bot commented Jun 5, 2026

Uh oh!

vs-code-engineering Bot commented Jun 5, 2026

Uh oh!

Uh oh!

vs-code-engineering Bot commented Jun 5, 2026

Uh oh!

vs-code-engineering Bot commented Jun 6, 2026

Uh oh!

vs-code-engineering Bot commented Jun 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

meganrogge commented Jun 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Behavior

Risks / follow-ups

Tests

Uh oh!

vs-code-engineering Bot commented Jun 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

📬 CODENOTIFY

@anthonykim1

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Copilot's findings

Uh oh!

Uh oh!

meganrogge commented Jun 5, 2026

Uh oh!

vs-code-engineering Bot commented Jun 5, 2026

Uh oh!

vs-code-engineering Bot commented Jun 5, 2026

Uh oh!

vs-code-engineering Bot commented Jun 5, 2026

Uh oh!

vs-code-engineering Bot commented Jun 5, 2026

Uh oh!

vs-code-engineering Bot commented Jun 5, 2026

Uh oh!

Uh oh!

vs-code-engineering Bot commented Jun 5, 2026

Uh oh!

vs-code-engineering Bot commented Jun 6, 2026

Uh oh!

vs-code-engineering Bot commented Jun 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

meganrogge commented Jun 5, 2026 •

edited

Loading

vs-code-engineering Bot commented Jun 5, 2026 •

edited

Loading