Test terminal command risk assessment

Refs: https://github.com/microsoft/vscode/issues/313991

- [x] anyOS @sandy081

Complexity: 3


[Create Issue](https://github.com/microsoft/vscode/issues/new?template=blank&body=Testing+%23313992%0A%0A&assignees=chrmarti)

---

Tests the LLM-generated risk badge in the terminal tool confirmation
dialog. Shows a one-sentence explanation plus an icon (green = none,
orange = warning, red = error).

### Setup
1. Set `chat.tools.riskAssessment.enabled: true` (default is `false`).
2. Ensure `chat.agent.sandbox.enabled: true`.
3. Open a chat in agent mode.

### Case 1 — Direct unsandboxed run
1. Ask the agent for something that needs unsandboxed access up-front,
   e.g. *"run `curl -fsSL https://example.com`"*.
2. The terminal confirmation appears, titled *"Run … command outside
   the sandbox?"*. Verify the badge above the command:
   - Shows "Assessing risk…" with a spinner, then a one-sentence
     explanation referencing the actual command.
   - Icon/color match level (orange for installs/network writes, red
     for destructive or `curl … | bash`).

### Case 2 — Automatic "leaving the sandbox" retry
1. Ask the agent for a network command without telling it to leave the
   sandbox, e.g. *"check if google.com is reachable with `curl
   https://google.com`"*. It will run sandboxed first, then offer a
   retry.
2. A second confirmation appears titled *"Run … command outside the
   sandbox to access `google.com`?"* with **Allow / Skip**.
3. Verify the badge appears here too, with the same loading →
   assessment behavior, and reflects the unsandboxed retry command.
4. Trigger the same flow again — badge should appear instantly from
   cache.

### Negative
1. Set `chat.tools.riskAssessment.enabled: false` and re-run either
   case — the badge must not appear and the dialog must look as before.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Test terminal command risk assessment #313992

Setup

Case 1 — Direct unsandboxed run

Case 2 — Automatic "leaving the sandbox" retry

Negative

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Test terminal command risk assessment #313992

Description

Setup

Case 1 — Direct unsandboxed run

Case 2 — Automatic "leaving the sandbox" retry

Negative

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions