You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The Copilot CLI agent reports Tool 'create_pull_request_review_comment' does not exist at runtime, even though the tool is correctly declared in the workflow's safe-outputs configuration and appears in the compiled lock file's prompt, tools metadata, and validation config.
The compiled lock file correctly includes these in the prompt (Tools: add_comment(max:5), create_pull_request_review_comment(max:30), submit_pull_request_review), the tools_meta.json, validation.json, and the safe-outputs handler config.
Observed behavior
The agent attempts to call create_pull_request_review_comment and receives:
✗ create_pull_request_review_comment src/Tasks/GetReferenceAssemblyPaths.cs · pr_number: "13495", p…
└ Tool 'create_pull_request_review_comment' does not exist.
This causes the agent to fall back to add_comment for everything, losing inline review context. In some runs, the agent produces no safe outputs at all, which then cascades: detection is skipped (no output_types), safe_outputs is skipped (needs.detection.result != 'success'), and no review is posted despite the agent having done the analysis.
Cascading skip: When create_pull_request_review_comment fails and the agent produces no structured output, the detection job is skipped (output_types == ''). Because safe_outputs is gated on needs.detection.result == 'success', it is also skipped — meaning even add_comment fallback output is never posted. Consider changing the condition to needs.detection.result != 'failure' so that a skipped detection doesn't block output posting.
Long execution times: Runs that encounter the tool error tend to take 18-25 minutes for a 2-file review, likely due to retry/fallback loops in the agent.
Expected behavior
The create_pull_request_review_comment safe-output tool should be available to the agent at runtime when declared in safe-outputs and compiled into the lock file.
Summary
The Copilot CLI agent reports
Tool 'create_pull_request_review_comment' does not existat runtime, even though the tool is correctly declared in the workflow'ssafe-outputsconfiguration and appears in the compiled lock file's prompt, tools metadata, and validation config.Reproduction
Repository: dotnet/msbuild
Workflow:
review-on-open.agent.lock.yml/review.agent.lock.ymlgh-aw version: v0.67.1
AWF version: v0.25.13
Workflow configuration
The
safe-outputssection in shared/review-shared.md correctly declares all three tools:The compiled lock file correctly includes these in the prompt (
Tools: add_comment(max:5), create_pull_request_review_comment(max:30), submit_pull_request_review), thetools_meta.json,validation.json, and the safe-outputs handler config.Observed behavior
The agent attempts to call
create_pull_request_review_commentand receives:This causes the agent to fall back to
add_commentfor everything, losing inline review context. In some runs, the agent produces no safe outputs at all, which then cascades:detectionis skipped (nooutput_types),safe_outputsis skipped (needs.detection.result != 'success'), and no review is posted despite the agent having done the analysis.Affected runs
Additional observations
Intermittent: Some runs succeed with the same workflow config (e.g., run Weekly Research Report: AI Workflow Automation Landscape and Strategic Opportunities - August 2025 #12, Remove ai-inference, opencode, genaiscript agentic engines for now #10, Weekly Research Report: AI Workflow Automation Landscape and Strategic Opportunities - August 2025 #9). The tool declaration hasn't changed between runs.
Cascading skip: When
create_pull_request_review_commentfails and the agent produces no structured output, thedetectionjob is skipped (output_types == ''). Becausesafe_outputsis gated onneeds.detection.result == 'success', it is also skipped — meaning evenadd_commentfallback output is never posted. Consider changing the condition toneeds.detection.result != 'failure'so that a skipped detection doesn't block output posting.Long execution times: Runs that encounter the tool error tend to take 18-25 minutes for a 2-file review, likely due to retry/fallback loops in the agent.
Expected behavior
The
create_pull_request_review_commentsafe-output tool should be available to the agent at runtime when declared insafe-outputsand compiled into the lock file.