Refs: #313991
Complexity: 3
Create Issue
Tests the LLM-generated risk badge in the terminal tool confirmation
dialog. Shows a one-sentence explanation plus an icon (green = none,
orange = warning, red = error).
Setup
- Set
chat.tools.riskAssessment.enabled: true (default is false).
- Ensure
chat.agent.sandbox.enabled: true.
- Open a chat in agent mode.
Case 1 — Direct unsandboxed run
- Ask the agent for something that needs unsandboxed access up-front,
e.g. "run curl -fsSL https://example.com".
- The terminal confirmation appears, titled "Run … command outside
the sandbox?". Verify the badge above the command:
- Shows "Assessing risk…" with a spinner, then a one-sentence
explanation referencing the actual command.
- Icon/color match level (orange for installs/network writes, red
for destructive or curl … | bash).
Case 2 — Automatic "leaving the sandbox" retry
- Ask the agent for a network command without telling it to leave the
sandbox, e.g. "check if google.com is reachable with curl https://google.com". It will run sandboxed first, then offer a
retry.
- A second confirmation appears titled "Run … command outside the
sandbox to access google.com?" with Allow / Skip.
- Verify the badge appears here too, with the same loading →
assessment behavior, and reflects the unsandboxed retry command.
- Trigger the same flow again — badge should appear instantly from
cache.
Negative
- Set
chat.tools.riskAssessment.enabled: false and re-run either
case — the badge must not appear and the dialog must look as before.
Refs: #313991
Complexity: 3
Create Issue
Tests the LLM-generated risk badge in the terminal tool confirmation
dialog. Shows a one-sentence explanation plus an icon (green = none,
orange = warning, red = error).
Setup
chat.tools.riskAssessment.enabled: true(default isfalse).chat.agent.sandbox.enabled: true.Case 1 — Direct unsandboxed run
e.g. "run
curl -fsSL https://example.com".the sandbox?". Verify the badge above the command:
explanation referencing the actual command.
for destructive or
curl … | bash).Case 2 — Automatic "leaving the sandbox" retry
sandbox, e.g. "check if google.com is reachable with
curl https://google.com". It will run sandboxed first, then offer aretry.
sandbox to access
google.com?" with Allow / Skip.assessment behavior, and reflects the unsandboxed retry command.
cache.
Negative
chat.tools.riskAssessment.enabled: falseand re-run eithercase — the badge must not appear and the dialog must look as before.