-
Notifications
You must be signed in to change notification settings - Fork 0
Closed
Description
Problem
During agent evaluation, models executed plans without waiting for user confirmation. This is a critical safety issue - users should review and approve plans before the agent modifies their filesystem.
Observed Behavior
- GPT-5-mini: Showed plan, immediately started executing without waiting
- Claude Sonnet 4.5: Showed plan, didn't ask for approval, created files directly
- Gemini: Created plan but didn't wait for review
Expected Behavior
1. Agent generates plan
2. Plan displayed to user with summary of changes
3. User prompted: "Approve this plan? [y/n/edit]"
4. Only after approval → execution begins
5. User can abort at any step
Implementation Notes
Partial implementation exists:
src/services/plan-mode/src/tools/plan-approval/src/types/plan-mode.ts
References
- Claude Code uses
EnterPlanMode/ExitPlanModetools - opencode has similar plan-then-execute flow
Priority
🔴 Critical - Blocks production readiness for agent completion
Generated from model evaluation test
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels