Add exception.type to OTel exception span events; sanitize event attributes by Copilot · Pull Request #25972 · github/gh-aw

Copilot · 2026-04-13T02:19:24Z

Exception span events in sendJobConclusionSpan only emit exception.message — exception.type is never set, violating OTel semantic conventions and preventing backends (Tempo, Honeycomb, Datadog) from classifying/grouping errors. Separately, sanitizeOTLPPayload redacts span.attributes but skips span.events[].attributes, so exception messages bypass sensitive-key redaction.

Changes

Add exception.type extraction — parse colon-prefixed error messages (push_to_pull_request_branch:Cannot push... → type gh-aw.push_to_pull_request_branch, message Cannot push...). Falls back to gh-aw.AgentError when no valid identifier prefix exists (must match ^[a-z_][a-z0-9_.]*$, length < 64). Mixed-case prefixes are accepted but normalized to lowercase for consistent grouping (e.g., Push_To_PR:... → gh-aw.push_to_pr).
Sanitize event attributes — extend sanitizeOTLPPayload to apply sanitizeAttrs() to span.events[].attributes, closing the redaction gap.

// Before: anonymous exceptions, no redaction on events
attributes: [buildAttr("exception.message", msg.slice(0, MAX_ATTR_VALUE_LENGTH))]

// After: classified exceptions, events sanitized
attributes: [
  buildAttr("exception.type", exceptionType),
  buildAttr("exception.message", exceptionMessage),
]

8 new tests covering type extraction, lowercase normalization, fallback cases, and event attribute sanitization. All 159 tests pass.

…vent attributes - Extract colon-prefixed type from error messages (e.g., "push_to_pull_request_branch:...") as "gh-aw.push_to_pull_request_branch"; fall back to "gh-aw.AgentError" - Strip the type prefix from exception.message when extracted - Extend sanitizeOTLPPayload to sanitize span.events[].attributes (redact sensitive keys, truncate long values) — closes the gap where event attributes bypassed redaction - Add 7 new tests covering type extraction, fallback, invalid prefixes, and event attribute sanitization Agent-Logs-Url: https://github.com/github/gh-aw/sessions/71aeea08-33f4-412c-8716-0866e880672e Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot

Pull request overview

This PR updates OTLP exception span events emitted by sendJobConclusionSpan to include exception.type (per OTel semantic conventions) and closes a redaction gap by sanitizing span.events[].attributes in sanitizeOTLPPayload.

Changes:

Emit exception.type alongside exception.message for exception events, with type extraction from colon-prefixed error messages and a fallback type.
Extend sanitizeOTLPPayload to apply attribute redaction/truncation to span.events[].attributes.
Add/extend tests covering event-attribute sanitization and exception.type extraction/fallback behavior.

Show a summary per file

File	Description
actions/setup/js/send_otlp_span.cjs	Adds event-attribute sanitization and emits `exception.type` for exception span events.
actions/setup/js/send_otlp_span.test.cjs	Adds tests for event attribute sanitization and exception type extraction/fallback cases.

Copilot's findings

Tip

Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Files reviewed: 2/2 changed files
Comments generated: 1

Copilot · 2026-04-13T02:37:14Z

actions/setup/js/send_otlp_span.cjs

+          // Extract colon-prefixed type when available ("push_to_pull_request_branch:...")
+          const colonIdx = msg.indexOf(":");
+          const exceptionType = colonIdx > 0 && colonIdx < 64 && /^[a-z_][a-z0-9_.]*$/i.test(msg.slice(0, colonIdx)) ? `gh-aw.${msg.slice(0, colonIdx)}` : "gh-aw.AgentError";
+          const exceptionMessage = (colonIdx > 0 && exceptionType !== "gh-aw.AgentError" ? msg.slice(colonIdx + 1).trim() : msg).slice(0, MAX_ATTR_VALUE_LENGTH);


The prefix validation regex is case-insensitive (/i), but the PR description/spec says the identifier must match ^[a-z_][a-z0-9_.]*$ (lowercase only). As-is, messages like Push_To_PR:... would be treated as a valid type and emitted as gh-aw.Push_To_PR, which can lead to inconsistent exception.type values and diverges from the documented behavior. Consider removing the i flag (or normalizing the extracted prefix to lowercase before validation/emission) and adding a test to cover uppercase prefixes (either expected rejection or normalization).

github-actions · 2026-04-13T02:41:00Z

🧪 Test Quality Sentinel Report

Test Quality Score: 90/100

✅ Excellent test quality

Metric	Value
New/modified tests analyzed	7 (new functions) + 3 modified
✅ Design tests (behavioral contracts)	7 (100%)
⚠️ Implementation tests (low value)	0 (0%)
Tests with error/edge cases	7 (100%)
Duplicate test clusters	0
Test inflation detected	✅ Yes (148 test lines / 24 production lines = 6.17:1)
🚨 Coding-guideline violations	0

Test Classification Details

📋 All 7 new test functions (click to expand)

Test	File	Classification	Issues Detected
`redacts sensitive keys in span event attributes`	`send_otlp_span.test.cjs`	✅ Design	None — asserts both redaction and pass-through. Has assertion messages.
`truncates long string values in span event attributes`	`send_otlp_span.test.cjs`	✅ Design	None — tests the 1024-char truncation boundary.
`preserves span events without attributes unchanged`	`send_otlp_span.test.cjs`	✅ Design	Minor: second `expect` lacks an assertion message.
`extracts exception.type from colon-prefixed error messages`	`send_otlp_span.test.cjs`	✅ Design	Minor: most `expect()` calls lack assertion messages.
`falls back to gh-aw.AgentError when message has no colon prefix`	`send_otlp_span.test.cjs`	✅ Design	Minor: `expect()` calls lack assertion messages.
`falls back to gh-aw.AgentError when colon prefix contains invalid characters`	`send_otlp_span.test.cjs`	✅ Design	Minor: `expect()` calls lack assertion messages.
`falls back to gh-aw.AgentError when colon prefix exceeds 64 characters`	`send_otlp_span.test.cjs`	✅ Design	Minor: only checks `exception.type`; does not assert the full original message is preserved (see below).

Modified tests (3 existing tests had exception.type assertions added):

"handles agent error events" — two events now also assert exception.type === "gh-aw.AgentError" ✅
"caps agent errors at 7 events" — loop now also asserts exception.type for all 7 ✅

Flagged Tests — Minor Issues Only

⚠️ `falls back to gh-aw.AgentError when colon prefix exceeds 64 characters`

Classification: Design test (passes)
Issue: The test only asserts on exception.type but does not assert that exception.message contains the full original message (\$\{longPrefix}:some error). The production code comment says "Full original message kept when type extraction fails," and test 6 checks this for the invalid-characters case — but test 7 doesn't verify the same invariant for the length-exceeded case.
Suggested improvement: Add an assertion that msgAttr.value.stringValue equals the full original message when the prefix is too long.

const msgAttr = span.events[0].attributes.find(a => a.key === "exception.message");
expect(msgAttr.value.stringValue, "full message preserved when prefix exceeds 64 chars")
  .toBe(`\$\{longPrefix}:some error`);

⚠️ Missing assertion messages (tests 4–7, partially 3)

Classification: Guideline note
Issue: Several expect() calls in the new sendJobConclusionSpan tests don't include a description string (e.g., expect(typeAttr.value.stringValue).toBe("gh-aw.AgentError") rather than expect(typeAttr.value.stringValue, "context").toBe(...)). Tests 1–2 model the correct pattern. Not a blocking issue, but adds diagnostic value when failures occur in CI.

⚠️ Test inflation (6.17:1)

Classification: Metric flag — not a quality concern in context
Explanation: 148 test lines were added vs 24 production lines. This triggers the mechanical inflation flag, but it is expected here: the production code adds a new branch (event-attribute sanitization + type extraction) with several distinct fallback conditions that genuinely require separate test cases. The ratio is high because the new tests cover previously-untested code paths rather than inflating coverage of unchanged code.

Language Support

Tests analyzed:

🐹 Go (*_test.go): 0 tests (no Go test files changed)
🟨 JavaScript (*.test.cjs): 7 new tests, 3 modified (vitest)

Score Breakdown

Component	Points	Notes
Behavioral Coverage (40 pts)	40	All 7 new tests are design tests
Error/Edge Case Coverage (30 pts)	30	All 7 include error paths or boundary conditions
Low Duplication (20 pts)	20	No duplicate clusters detected
Proportional Growth (10 pts)	0	Inflation flag triggered (6.17:1); context-appropriate but mechanically penalized
Total	90

Verdict

✅ Check passed. 0% of new tests are implementation tests (threshold: 30%). All 7 new tests verify observable outputs and behavioral contracts of the new exception.type extraction logic and event-attribute sanitization.

📖 Understanding Test Classifications

Design Tests (High Value) verify what the system does:

Assert on observable outputs, return values, or state changes
Cover error paths and boundary conditions
Would catch a behavioral regression if deleted
Remain valid even after internal refactoring

Implementation Tests (Low Value) verify how the system does it:

Assert on internal function calls (mocking internals)
Only test the happy path with typical inputs
Break during legitimate refactoring even when behavior is correct
Give false assurance: they pass even when the system is wrong

Goal: Shift toward tests that describe the system's behavioral contract — the promises it makes to its users and collaborators.

🧪 Test quality analysis by Test Quality Sentinel · ● 454K · ◷

github-actions

✅ Test Quality Sentinel: 90/100. Test quality is excellent — 0% of new tests are implementation tests (threshold: 30%). All 7 new tests verify behavioral contracts of the exception.type extraction and event-attribute sanitization features. Minor suggestions posted in the comment above.

pelikhan · 2026-04-13T03:25:53Z

@copilot review all comments

…ping Addresses review feedback: the regex accepts mixed-case prefixes but the extracted type is now lowercased before emission, ensuring consistent exception.type values (e.g., "Push_To_PR:..." → "gh-aw.push_to_pr"). Adds a test covering uppercase prefix normalization. Agent-Logs-Url: https://github.com/github/gh-aw/sessions/d1e40e6f-5461-4f60-a911-7dda914b4b87 Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot · 2026-04-13T03:31:02Z

@copilot review all comments

Addressed the review feedback about the case-insensitive regex in 1ed0c1d — the extracted prefix is now normalized to lowercase before emission (e.g., Push_To_PR:... → gh-aw.push_to_pr), ensuring consistent exception.type values for backend grouping. Added a test covering uppercase prefix normalization.

Initial plan

70448f7

Copilot AI assigned Copilot and pelikhan Apr 13, 2026

Copilot AI linked an issue Apr 13, 2026 that may be closed by this pull request

[otel-advisor] OTel improvement: add exception.type to exception span events to enable error classification #25937

Closed

1 task

Copilot started work on behalf of pelikhan April 13, 2026 02:19 View session

Copilot AI changed the title ~~[WIP] Add exception.type to exception span events for error classification~~ Add exception.type to OTel exception span events; sanitize event attributes Apr 13, 2026

Copilot finished work on behalf of pelikhan April 13, 2026 02:28

Copilot AI requested a review from pelikhan April 13, 2026 02:29

pelikhan marked this pull request as ready for review April 13, 2026 02:34

Copilot AI review requested due to automatic review settings April 13, 2026 02:34

Copilot started reviewing on behalf of pelikhan April 13, 2026 02:35 View session

Copilot AI reviewed Apr 13, 2026

View reviewed changes

github-actions bot mentioned this pull request Apr 13, 2026

[aw] No-Op Runs #25214

Open

github-actions bot approved these changes Apr 13, 2026

View reviewed changes

Copilot started work on behalf of pelikhan April 13, 2026 03:26 View session

Copilot finished work on behalf of pelikhan April 13, 2026 03:31

pelikhan merged commit d1c210e into main Apr 13, 2026

pelikhan deleted the copilot/otel-advisor-add-exception-type branch April 13, 2026 04:04

github-actions bot mentioned this pull request Apr 13, 2026

[code-simplifier] refactor: extract hasValidPrefix boolean in exception type extraction (#25972 follow-up) #25989

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add exception.type to OTel exception span events; sanitize event attributes#25972

Add exception.type to OTel exception span events; sanitize event attributes#25972
pelikhan merged 3 commits intomainfrom
copilot/otel-advisor-add-exception-type

Copilot AI commented Apr 13, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Apr 13, 2026

Uh oh!

github-actions bot commented Apr 13, 2026

Uh oh!

github-actions bot left a comment

Uh oh!

pelikhan commented Apr 13, 2026

Uh oh!

Copilot AI commented Apr 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Copilot AI commented Apr 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Copilot's findings

Uh oh!

Copilot AI Apr 13, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Apr 13, 2026

🧪 Test Quality Sentinel Report

Test Quality Score: 90/100

Test Classification Details

Flagged Tests — Minor Issues Only

⚠️ falls back to gh-aw.AgentError when colon prefix exceeds 64 characters

⚠️ Missing assertion messages (tests 4–7, partially 3)

⚠️ Test inflation (6.17:1)

Language Support

Score Breakdown

Verdict

Uh oh!

github-actions bot left a comment

Choose a reason for hiding this comment

Uh oh!

pelikhan commented Apr 13, 2026

Uh oh!

Copilot AI commented Apr 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Copilot AI commented Apr 13, 2026 •

edited

Loading

⚠️ `falls back to gh-aw.AgentError when colon prefix exceeds 64 characters`