Fix overly aggressive error patterns in Claude, Codex, and Copilot engines by Copilot · Pull Request #1185 · github/gh-aw

Copilot · 2025-10-03T00:47:48Z

Problem

The error patterns in Claude, Codex, and Copilot engines were too broad and created false positives by matching informational text in logs instead of just actual errors. This resulted in excessive noise in workflow outputs and made it difficult to identify genuine issues.

Overly Broad Patterns

Patterns like (?i)unauthorized, (?i)forbidden, and (?i)permission.*denied matched any occurrence of these keywords, even in informational contexts:

❌ FALSE POSITIVE: "The API returned 401 Unauthorized"
❌ FALSE POSITIVE: "This endpoint is forbidden without admin privileges"
❌ FALSE POSITIVE: "Permission was denied because the token expired"

These patterns triggered on:

Claude's reasoning/thinking text (e.g., "I'll check if the user is unauthorized...")
API response descriptions (e.g., "Received 403 Forbidden from server")
Informational context in logs
Documentation or explanatory text

Solution

All generic patterns now require explicit error context to match. Instead of matching any occurrence of keywords, patterns now look for these keywords only when preceded by error markers like "error", "ERROR", or "failed":

- Pattern: (?i)permission.*denied
+ Pattern: (?i)error.*permission.*denied

- Pattern: (?i)unauthorized  
+ Pattern: (?i)error.*unauthorized

- Pattern: (?i)forbidden
+ Pattern: (?i)error.*forbidden

Results

Before (False Positives):

"The API returned 401 Unauthorized"
  ❌ MATCHED - Generic pattern (too broad)

"Permission was denied because the token expired"
  ❌ MATCHED - Generic pattern (too broad)

After (Fixed):

"The API returned 401 Unauthorized"
  ✅ No match - Informational text correctly ignored

"Permission was denied because the token expired"
  ✅ No match - Informational text correctly ignored

"ERROR: unauthorized access attempt"
  ✅ MATCHED - Actual error correctly detected

Changes

Replaced Patterns

(?i)permission.*denied → (?i)error.*permission.*denied
(?i)unauthorized → (?i)error.*unauthorized
(?i)forbidden → (?i)error.*forbidden
(?i)access.*restricted → (?i)error.*access.*restricted
(?i)insufficient.*permission → (?i)error.*insufficient.*permission
(?i)token.*invalid → (?i)error.*token.*invalid (Copilot only)

Unchanged Patterns

Specific patterns that were already precise were kept as-is:

(?i)access denied.*only authorized.*can trigger.*workflow
(?i)access denied.*user.*not authorized
(?i)repository permission check failed
(?i)configuration error.*required permissions not specified
(?i)authentication failed
(?i)not authorized.*copilot

Testing

Added comprehensive test suite in pkg/workflow/error_pattern_tuning_test.go that verifies:

Patterns don't match informational text (prevents false positives)
Patterns still match actual errors (maintains error detection)
All three engines (Claude, Codex, Copilot) behave consistently

All existing unit and integration tests continue to pass.

Impact

This fix reduces noise in workflow logs by eliminating false positive error matches, making it easier to identify genuine issues when debugging agentic workflows.

Original prompt

The error pattern in claude is overly aggressive.

Analyze the output of this run https://github.com/githubnext/repo-mind/actions/runs/18209359159 and tune the claude agentic engine error patterns.

💡 You can make Copilot smarter by setting up custom instructions, customizing its development environment and configuring Model Context Protocol (MCP) servers. Learn more Copilot coding agent tips in the docs.

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

…gines Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Initial plan

b385a8e

Copilot AI assigned Copilot and pelikhan Oct 3, 2025

Copilot started work on behalf of pelikhan October 3, 2025 00:48 View session

Copilot AI and others added 4 commits October 3, 2025 00:57

Initial analysis of Claude error pattern issue

b3f1c30

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Fix overly aggressive error patterns in Claude, Codex, and Copilot en…

ab7b4d6

…gines Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Merge branch 'main' into copilot/tune-claude-error-patterns

7e1fc43

Add documentation to hasErrorContext function per code review feedback

443bf2e

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot AI changed the title ~~[WIP] Refactor claude error handling for improved accuracy~~ Fix overly aggressive error patterns in Claude, Codex, and Copilot engines Oct 3, 2025

Copilot AI requested a review from pelikhan October 3, 2025 01:08

Copilot finished work on behalf of pelikhan October 3, 2025 01:08

pelikhan marked this pull request as ready for review October 3, 2025 01:09

pelikhan enabled auto-merge (squash) October 3, 2025 01:10

pelikhan disabled auto-merge October 3, 2025 01:11

pelikhan merged commit b2abf16 into main Oct 3, 2025
24 of 27 checks passed

pelikhan deleted the copilot/tune-claude-error-patterns branch October 3, 2025 01:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix overly aggressive error patterns in Claude, Codex, and Copilot engines#1185

Fix overly aggressive error patterns in Claude, Codex, and Copilot engines#1185
pelikhan merged 5 commits intomainfrom
copilot/tune-claude-error-patterns

Copilot AI commented Oct 3, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Copilot AI commented Oct 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Overly Broad Patterns

Solution

Results

Changes

Replaced Patterns

Unchanged Patterns

Testing

Impact

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Copilot AI commented Oct 3, 2025 •

edited

Loading