perf(contribution-check): cut token/tool overhead per #5558 by lpcox · Pull Request #5576 · github/gh-aw-firewall

lpcox · 2026-06-26T15:52:33Z

Implements the actionable recommendations from #5558 to reduce the token/tool overhead of the Contribution Check workflow (ranked #1 by total AIC). All review data is pre-fetched in steps:, so the agent only needs to read three context files and emit a single add_comment (or noop).

Changes

#	Recommendation	Applied
2	Stop loading GitHub tools	✅ `tools.github: false` (see note)
3	`max-turns` 5 → 3	✅
4	`strict` false → true	✅
5	Tighten prompt's tool constraint	✅

⚠️ Important correction to recommendation #2

The issue suggested removing the tools: block entirely. That backfires on this gh-aw version: with no explicit tools.github, gh-aw auto-injects a read-only GitHub MCP server with a broader toolset (context,repos,issues,pull_requests) than the original gh-proxy/[pull_requests] config — adding tool schemas and a github-mcp-server container, the opposite of the goal.

The correct way to drop GitHub tools is the explicit github: false (same pattern as doc-maintainer.md). This removes the github-mcp-server and cli-proxy containers and eliminates the ~5.3 stray GitHub API calls/run. The compiled lock file shrinks by ~150 lines (86 KB → 79 KB). edit: is kept so the agent can read the pre-fetched files; the safeoutputs MCP that backs add_comment is unaffected.

strict mode

Enabling strict: true required removing the internal sandbox.mcp.version: "latest" key, which strict mode disallows (it's an internal implementation detail).

Verification

gh aw compile contribution-check → 0 errors, 0 warnings.
Lock no longer contains github-mcp-server, GITHUB_TOOLSETS, or a cli-proxy container; safeoutputs MCP + add_comment preserved.
scripts/ci/contribution-check-workflow.test.ts updated (max-turns: 5 → 3) and passing.

Risk note (max-turns)

max-turns: 3 maps to the api-proxy maxRuns: 3 hard cap. If the agent ever needs a 4th LLM invocation it will receive the terminal 403 (max_runs_exceeded) that surfaces as a misleading "authentication failed" engine error (cf. #5552). Removing GitHub tools frees up the turns previously wasted on stray gh calls, so the linear read→comment task should fit comfortably in 3 — but a live test PR (per the issue's checklist) is the right way to confirm before relying on it.

Out of scope

Recommendation #1 (all Copilot runs report null token_usage) is a separate, systemic api-proxy telemetry-export investigation, not a per-workflow change. Not addressed here.

Refs #5558

Token-optimization pass on the Contribution Check workflow (ranked #1 by total AIC). All review data is already pre-fetched in `steps:`, so the agent only needs to read three context files and emit a single `add_comment` (or noop) safe-output. Changes: - tools: disable GitHub tools (`github: false`) instead of the previous `gh-proxy`/`pull_requests` config. NOTE: simply *removing* the tools block makes gh-aw auto-inject a read-only GitHub MCP server with a *broader* toolset (context,repos,issues,pull_requests), which is the opposite of the optimization goal. Explicit `github: false` drops the github-mcp-server and cli-proxy containers entirely and prevents the ~5.3 stray GitHub API calls/run the agent was making despite the prompt forbidding them. (lock file shrinks ~150 lines.) Keep `edit:` so the agent can read the pre-fetched files. - strict: false -> true (also required removing the internal `sandbox.mcp.version` key, which strict mode disallows). - max-turns: 5 -> 3 (linear task: read files -> compare -> comment). - Prompt: add an explicit "only add_comment or noop" tool constraint. - Recompiled contribution-check.lock.yml; updated the workflow test's max-turns assertion 5 -> 3. Out of scope: recommendation #1 (all Copilot runs report null token_usage) is a separate, systemic api-proxy telemetry-export investigation, not a per-workflow change. Refs #5558 Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

github-actions · 2026-06-26T15:54:18Z

✅ Coverage Check Passed

Overall Coverage

Metric	Base	PR	Delta
Lines	98.24%	98.28%	📈 +0.04%
Statements	98.17%	98.21%	📈 +0.04%
Functions	99.53%	99.53%	➡️ +0.00%
Branches	94.00%	94.00%	➡️ +0.00%

📁 Per-file Coverage Changes (1 files)

File	Lines (Before → After)	Statements (Before → After)
`src/workdir-setup.ts`	92.7% → 94.5% (+1.82%)	92.7% → 94.5% (+1.82%)

Coverage comparison generated by scripts/ci/compare-coverage.ts

Copilot

Pull request overview

Reduces Copilot token/tool overhead in the Contribution Check agentic workflow by removing GitHub MCP tooling from the agent runtime, tightening execution constraints, and updating the compiled lock + CI test to match.

Changes:

Lowered the agent turn cap from 5 → 3 and enabled strict: true for tighter runtime behavior.
Disabled GitHub MCP tools for the agent via tools.github: false while keeping tools.edit so the agent can read the pre-fetched context files.
Tightened the prompt to explicitly limit post-read tool usage to add_comment (max 1) or noop, and updated the workflow test + regenerated the lock file accordingly.

Show a summary per file

File	Description
scripts/ci/contribution-check-workflow.test.ts	Updates the workflow guard test to expect `max-turns: 3`.
.github/workflows/contribution-check.md	Applies the workflow-level optimization knobs (turn cap, strict mode, tool disablement) and strengthens the prompt’s tool-use constraints.
.github/workflows/contribution-check.lock.yml	Regenerates the compiled lock to reflect the updated workflow configuration and removed GitHub MCP/cli-proxy components.

Review details

Tip

Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Files reviewed: 3/3 changed files
Comments generated: 0
Review effort level: Low

github-actions · 2026-06-26T16:38:31Z

✅ Smoke Copilot BYOK AOAI (Entra) completed. Copilot AOAI BYOK (Entra) mode operational. 🔓

github-actions · 2026-06-26T16:38:31Z

📡 Smoke OTel Tracing completed. All tracing scenarios validated. ✅

github-actions · 2026-06-26T16:38:35Z

✅ Build Test Suite completed successfully!

github-actions · 2026-06-26T16:38:35Z

✅ Smoke Claude passed

github-actions · 2026-06-26T16:38:36Z

📰 VERDICT: Smoke Copilot has concluded. All systems operational. This is a developing story. 🎤

github-actions · 2026-06-26T16:38:37Z

🔑 Smoke Copilot PAT PAT auth validated. All systems operational. ✅

github-actions · 2026-06-26T16:38:37Z

🚀 Security Guard has started processing this pull request

github-actions · 2026-06-26T16:38:38Z

✨ The prophecy is fulfilled... Smoke Codex has completed its mystical journey. The stars align. 🌟

github-actions · 2026-06-26T16:38:41Z

✅ Smoke Copilot BYOK AOAI (api-key) completed. Copilot AOAI BYOK (api-key) mode operational. 🔓

github-actions · 2026-06-26T16:38:46Z

✅ Smoke Copilot BYOK completed. Copilot BYOK mode operational. 🔓

github-actions · 2026-06-26T16:38:47Z

🔌 Smoke Services — All services reachable! ✅

github-actions · 2026-06-26T16:38:48Z

✅ Smoke Gemini completed. All facets verified. 💎

github-actions · 2026-06-26T16:38:48Z

Chroot tests passed! Smoke Chroot - All security and functionality tests succeeded.

github-actions · 2026-06-26T16:39:09Z

❌ Contribution Check failed. Please review the logs for details.

github-actions · 2026-06-26T16:41:36Z

Smoke Test: Claude Engine Validation

API check: ✅ PASS
gh check: ✅ PASS
File check: ✅ PASS

Overall result: PASS

Generated by Smoke Claude for issue #5576 · 37.2 AIC · ⊞ 3.3K · ◷

github-actions · 2026-06-26T16:41:58Z

🔥 Smoke Test: Copilot PAT — PASS

Test	Result
GitHub MCP connectivity	✅
GitHub.com HTTP	✅ 200
File write/read	✅

Overall: PASS · Auth mode: PAT (COPILOT_GITHUB_TOKEN)

cc @lpcox

🔑 PAT report filed by Smoke Copilot PAT

github-actions · 2026-06-26T16:42:19Z

🔬 Smoke Test Results

Test	Status
GitHub MCP connectivity	✅ PASS
GitHub.com HTTP	✅ PASS (200)
File write/read	⚠️ N/A (pre-step data unavailable)

PR: perf(contribution-check): cut token/tool overhead per #5558
Author: @lpcox

Overall: PASS ✅

📰 BREAKING: Report filed by Smoke Copilot

github-actions · 2026-06-26T16:42:47Z

Smoke Test: Copilot BYOK (Direct) Mode ✅

PASS — All smoke tests confirmed.

✅ GitHub MCP connectivity verified (2 recent closed PRs)
✅ BYOK inference path working (agent → api-proxy sidecar → api.githubcopilot.com)

Running in direct BYOK mode via COPILOT_PROVIDER_API_KEY.

🔑 BYOK report filed by Smoke Copilot BYOK

github-actions · 2026-06-26T16:42:52Z

Smoke test summary

fix: propagate apiProxy.auth OIDC config fields to all layers ✅
[Test Coverage] security: test coverage for compose-sanitizer, domain-validation, and domain-matchers ✅
GitHub read checks ✅
Playwright GitHub title ✅
File write/read ✅
Build (npm ci && npm run build) ✅
Overall: PASS

Warning

Firewall blocked 1 domain

The following domain was blocked by the firewall during workflow execution:

registry.npmjs.org

To allow these domains, add them to the network.allowed list in your workflow frontmatter:

network:
  allowed:
    - defaults
    - "registry.npmjs.org"

See Network Configuration for more information.

🔮 The oracle has spoken through Smoke Codex

github-actions · 2026-06-26T16:43:09Z

🔬 Smoke Test: API Proxy OpenTelemetry Tracing

Scenario	Result	Detail
1. Module Loading	✅	`otel.js` loads cleanly; exports `startRequestSpan`, `setTokenAttributes`, `setBudgetAttributes`, `endSpan`, `endSpanError`, `shutdown`, `isEnabled` + test helpers. `isEnabled()` returns `true` (FileSpanExporter fallback active when no OTLP endpoint set).
2. Test Suite	✅	`otel.test.js`: 39/39 passed; `otel-fanout.test.js`: 20/20 passed (59 total, 0 failures).
3. Env Var Forwarding	✅	`src/services/api-proxy-env-config.ts` forwards `GH_AW_OTLP_ENDPOINTS`, `OTEL_EXPORTER_OTLP_ENDPOINT`, `OTEL_EXPORTER_OTLP_HEADERS`, `GITHUB_AW_OTEL_TRACE_ID`, `GITHUB_AW_OTEL_PARENT_SPAN_ID`, and `OTEL_SERVICE_NAME` (default: `awf-api-proxy`) to the api-proxy container.
4. Token Tracker Integration	✅	`token-tracker-http.js` exposes `onUsage` callback (line 283/324); called after normalized usage extraction — OTEL hook point confirmed.
5. OTEL Diagnostics	⚪	No spans exported — api-proxy container not started in this sandbox (Docker-in-Docker not supported). Expected for static analysis runs; spans will appear in live integration tests.

Summary: All functional scenarios pass. Scenario 5 is a runtime-only check that requires a live container; its absence here is expected.

📡 OTel tracing validated by Smoke OTel Tracing

github-actions · 2026-06-26T16:43:22Z

@lpcox
GitHub MCP PRs: perf(contribution-check): cut token/tool overhead per #5558; Split squid config tests by concern ✅
GitHub.com connectivity: ✅
File write/read: ✅
Direct BYOK inference: ✅

Running in direct BYOK mode (AWF_AUTH_TYPE=github-oidc + AWF_AUTH_AZURE_* + COPILOT_PROVIDER_BASE_URL) via api-proxy → Azure OpenAI (Foundry, o4-mini-aw) authenticated via Microsoft Entra

Overall: PASS

🪪 BYOK (AOAI Entra) report filed by Smoke Copilot BYOK AOAI (Entra)

github-actions · 2026-06-26T16:43:39Z

Chroot Smoke Test Results

Runtime	Host Version	Chroot Version	Match?
Python	`Python 3.12.13`	`Python 3.12.3`	❌
Node.js	`v24.17.0`	`v22.23.0`	❌
Go	`go1.22.12`	`go1.22.12`	✅

Overall: ❌ Not all tests passed — Python and Node.js versions differ between host and chroot.

Tested by Smoke Chroot

github-actions · 2026-06-26T16:44:23Z

🏗️ Build Test Suite Results

Ecosystem	Project	Build/Install	Tests	Status
Bun	elysia	✅	1/1 passed	✅ PASS
Bun	hono	✅	1/1 passed	✅ PASS
C++	fmt	✅	N/A	✅ PASS
C++	json	✅	N/A	✅ PASS
Deno	oak	N/A	1/1 passed	✅ PASS
Deno	std	N/A	1/1 passed	✅ PASS
.NET	hello-world	✅	N/A	✅ PASS
.NET	json-parse	✅	N/A	✅ PASS
Go	color	✅	1/1 passed	✅ PASS
Go	env	✅	1/1 passed	✅ PASS
Go	uuid	✅	1/1 passed	✅ PASS
Java	gson	✅	1/1 passed	✅ PASS
Java	caffeine	✅	1/1 passed	✅ PASS
Node.js	clsx	✅	all passed	✅ PASS
Node.js	execa	✅	all passed	✅ PASS
Node.js	p-limit	✅	all passed	✅ PASS
Rust	fd	✅	1/1 passed	✅ PASS
Rust	zoxide	✅	1/1 passed	✅ PASS

Overall: 8/8 ecosystems passed — ✅ PASS

Generated by Build Test Suite for issue #5576 · 35.9 AIC · ⊞ 7.8K · ◷

github-actions · 2026-06-26T16:44:29Z

Smoke Test Results

❌ GitHub MCP Testing (Tools not found)
❌ GitHub.com Connectivity (Connection blocked)
✅ File Writing Testing
✅ Bash Tool Testing

Overall status: FAIL

Warning

Firewall blocked 1 domain

The following domain was blocked by the firewall during workflow execution:

localhost

To allow these domains, add them to the network.allowed list in your workflow frontmatter:

network:
  allowed:
    - defaults
    - "localhost"

See Network Configuration for more information.

💎 Faceted by Smoke Gemini

github-actions · 2026-06-26T16:44:54Z

perf(contribution-check): cut token/tool overhead per #5558
Split squid config tests by concern
GitHub MCP connectivity: ✅
github.com connectivity: ✅
File I/O: ✅
BYOK inference: ✅
Running in direct BYOK mode (COPILOT_PROVIDER_API_KEY + COPILOT_PROVIDER_BASE_URL) via api-proxy → Azure OpenAI (Foundry, o4-mini-aw)
Overall: PASS

@lpcox

🔑 BYOK (AOAI api-key) report filed by Smoke Copilot BYOK AOAI (api-key)

github-actions · 2026-06-26T16:45:12Z

Smoke Test Results — Services Connectivity

Check	Result
Redis PING	❌ Timeout (RC=124)
PostgreSQL pg_isready	❌ No response
PostgreSQL SELECT 1	❌ Timeout (RC=124)

Overall: FAIL — host.docker.internal is unreachable from this sandbox. All three connections timed out with no response.

🔌 Service connectivity validated by Smoke Services

Copilot AI review requested due to automatic review settings June 26, 2026 15:52

Copilot started reviewing on behalf of lpcox June 26, 2026 15:52 View session

Copilot AI reviewed Jun 26, 2026

View reviewed changes

lpcox added the ready-for-aw label Jun 26, 2026

lpcox temporarily deployed to aoai-model June 26, 2026 16:38 — with GitHub Actions Inactive

github-actions Bot added the smoke-claude label Jun 26, 2026

github-actions Bot mentioned this pull request Jun 26, 2026

[aw] Contribution Check failed #5579

Closed

github-actions Bot added the smoke-copilot-pat label Jun 26, 2026

github-actions Bot added the smoke-copilot label Jun 26, 2026

lpcox temporarily deployed to aoai-model June 26, 2026 16:42 — with GitHub Actions Inactive

github-actions Bot added the smoke-copilot-byok label Jun 26, 2026

github-actions Bot added the smoke-codex label Jun 26, 2026

github-actions Bot added the smoke-copilot-byok-aoai-entra label Jun 26, 2026

lpcox temporarily deployed to aoai-model June 26, 2026 16:43 — with GitHub Actions Inactive

github-actions Bot mentioned this pull request Jun 26, 2026

[aw] Smoke Codex is missing required tool #5580

Closed

github-actions Bot added the build-test label Jun 26, 2026

github-actions Bot added the smoke-copilot-byok-aoai-apikey label Jun 26, 2026

lpcox merged commit 3378a60 into main Jun 26, 2026
88 of 90 checks passed

lpcox deleted the perf/contribution-check-token-optim-5558 branch June 26, 2026 18:17

github-actions Bot mentioned this pull request Jun 26, 2026

fix(test): sync doc-maintainer test with max-turns 15 + prompt rewrite #5587

Merged

Uh oh!

Conversation

lpcox commented Jun 26, 2026

Changes

⚠️ Important correction to recommendation #2

strict mode

Verification

Risk note (max-turns)

Out of scope

Uh oh!

github-actions Bot commented Jun 26, 2026

✅ Coverage Check Passed

Overall Coverage

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Review details

Uh oh!

github-actions Bot commented Jun 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Jun 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Jun 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Jun 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Jun 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Jun 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Jun 26, 2026

Uh oh!

github-actions Bot commented Jun 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Jun 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Jun 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Jun 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Jun 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Jun 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Jun 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Jun 26, 2026

Smoke Test: Claude Engine Validation

Uh oh!

github-actions Bot commented Jun 26, 2026

🔥 Smoke Test: Copilot PAT — PASS

Uh oh!

github-actions Bot commented Jun 26, 2026

🔬 Smoke Test Results

Uh oh!

github-actions Bot commented Jun 26, 2026

Smoke Test: Copilot BYOK (Direct) Mode ✅

Uh oh!

github-actions Bot commented Jun 26, 2026

Uh oh!

github-actions Bot commented Jun 26, 2026

🔬 Smoke Test: API Proxy OpenTelemetry Tracing

Uh oh!

github-actions Bot commented Jun 26, 2026

Uh oh!

github-actions Bot commented Jun 26, 2026

github-actions Bot commented Jun 26, 2026 •

edited

Loading

github-actions Bot commented Jun 26, 2026 •

edited

Loading

github-actions Bot commented Jun 26, 2026 •

edited

Loading

github-actions Bot commented Jun 26, 2026 •

edited

Loading

github-actions Bot commented Jun 26, 2026 •

edited

Loading

github-actions Bot commented Jun 26, 2026 •

edited

Loading

github-actions Bot commented Jun 26, 2026 •

edited

Loading

github-actions Bot commented Jun 26, 2026 •

edited

Loading

github-actions Bot commented Jun 26, 2026 •

edited

Loading

github-actions Bot commented Jun 26, 2026 •

edited

Loading

github-actions Bot commented Jun 26, 2026 •

edited

Loading

github-actions Bot commented Jun 26, 2026 •

edited

Loading

github-actions Bot commented Jun 26, 2026 •

edited

Loading