feat(plan): enforce strict read-only policy and halt execution on violation #16849

jerop · 2026-01-16T17:00:56Z

Summary

This PR refines the experimental Plan Mode by implementing a strict "secure-by-default" policy that allows only read and search operations. It also enhances the tool scheduler to immediately halt agent execution if a prohibited tool is attempted, preventing retry loops and providing clear guidance to the user.

Closes #16625

Details

Policy Implementation

Introduced packages/core/src/policy/policies/plan.toml with a default-deny rule for all tools in Plan mode.
Explicitly allow-listed read and search tools (read_file, web_fetch, google_web_search, etc.) only.

Execution Control

Updated CoreToolScheduler to return a STOP_EXECUTION error type when a tool is denied in Plan mode.
This triggers the existing "Stop" logic in the agent loop, preventing the model from retrying the same blocked action.

How to Validate

Build the project:
```
npm run build
```

Run in Plan Mode:

# Ensure experimental.plan is enabled in your settings
npm start -- --approval-mode=plan

Attempt a Write Operation:
- Ask: "Create a new file called test.txt"
- Expected: The agent stops immediately with the error: "Tool execution denied by policy. You are in Plan Mode - adjust your prompt to only use read and search tools."
Attempt a Read Operation:
- Ask: "Read package.json"
- Expected: The tool executes successfully.

Pre-Merge Checklist

Introduces a default-deny policy for Plan mode that explicitly allows only safe read and search tools. Includes integration tests verifying tool enforcement and priority logic.

Updates the tool scheduler to return a STOP_EXECUTION error type when a tool is denied in Plan mode. This breaks the agent's retry loop and provides a clear instructional error message. Includes unit tests for the new denial behavior.

Updates the Plan mode unit test to use the correct MockTool constructor signature and proper type casting for the mocked PolicyEngine.

github-actions · 2026-01-16T17:13:38Z

Size Change: +375 B (0%)

Total Size: 23.1 MB

ℹ️ View Unchanged

Filename	Size	Change
`./bundle/gemini.js`	23.1 MB	+375 B (0%)
`./bundle/sandbox-macos-permissive-closed.sb`	1.03 kB	0 B
`./bundle/sandbox-macos-permissive-open.sb`	890 B	0 B
`./bundle/sandbox-macos-permissive-proxied.sb`	1.31 kB	0 B
`./bundle/sandbox-macos-restrictive-closed.sb`	3.29 kB	0 B
`./bundle/sandbox-macos-restrictive-open.sb`	3.36 kB	0 B
`./bundle/sandbox-macos-restrictive-proxied.sb`	3.56 kB	0 B

_{compressed-size-action}

jacob314

jerop requested a review from a team as a code owner January 16, 2026 17:00

jerop force-pushed the feat/plan-mode-refinement branch 2 times, most recently from d4fb3bf to a75fd86 Compare January 16, 2026 17:05

jerop added 2 commits January 16, 2026 12:06

feat(plan): implement restrictive policy for Plan mode

ff8c7f3

Introduces a default-deny policy for Plan mode that explicitly allows only safe read and search tools. Includes integration tests verifying tool enforcement and priority logic.

jerop force-pushed the feat/plan-mode-refinement branch from a75fd86 to 476394f Compare January 16, 2026 17:07

fix(tests): correct MockTool usage and type casting in Plan mode test

408b955

Updates the Plan mode unit test to use the correct MockTool constructor signature and proper type casting for the mocked PolicyEngine.

jerop enabled auto-merge January 16, 2026 17:17

jacob314 approved these changes Jan 16, 2026

View reviewed changes

jerop added this pull request to the merge queue Jan 16, 2026

github-merge-queue bot removed this pull request from the merge queue due to failed status checks Jan 16, 2026

jerop added this pull request to the merge queue Jan 16, 2026

Merged via the queue into main with commit 5241174 Jan 16, 2026
43 of 44 checks passed

jerop deleted the feat/plan-mode-refinement branch January 16, 2026 18:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(plan): enforce strict read-only policy and halt execution on violation #16849

feat(plan): enforce strict read-only policy and halt execution on violation #16849

Uh oh!

jerop commented Jan 16, 2026

Uh oh!

github-actions bot commented Jan 16, 2026

Uh oh!

jacob314 left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat(plan): enforce strict read-only policy and halt execution on violation #16849

feat(plan): enforce strict read-only policy and halt execution on violation #16849

Uh oh!

Conversation

jerop commented Jan 16, 2026

Summary

Details

Policy Implementation

Execution Control

How to Validate

Pre-Merge Checklist

Uh oh!

github-actions bot commented Jan 16, 2026

Uh oh!

jacob314 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants