Skip to content

fix: recompile red-team-benchmark lock file#4191

Merged
lpcox merged 1 commit into
mainfrom
fix/recompile-red-team-benchmark
Jun 2, 2026
Merged

fix: recompile red-team-benchmark lock file#4191
lpcox merged 1 commit into
mainfrom
fix/recompile-red-team-benchmark

Conversation

@lpcox
Copy link
Copy Markdown
Collaborator

@lpcox lpcox commented Jun 2, 2026

The activation job fails with:

ERR_CONFIG: Lock file '.github/workflows/red-team-benchmark.lock.yml' is outdated!

PR #4187 modified the .md source (added mounts and env vars) but only manually patched one line in the lock file, leaving the frontmatter hash stale. This recompiles properly.

Failing run: https://github.com/github/gh-aw-firewall/actions/runs/26799108413

The lock file was stale after PR #4187 changed the .md source without
a full recompile (frontmatter hash mismatch).

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
Copilot AI review requested due to automatic review settings June 2, 2026 04:55
@lpcox lpcox merged commit edcada6 into main Jun 2, 2026
43 of 44 checks passed
@lpcox lpcox deleted the fix/recompile-red-team-benchmark branch June 2, 2026 04:56
@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented Jun 2, 2026

✅ Coverage Check Passed

Overall Coverage

Metric Base PR Delta
Lines 96.35% 96.40% 📈 +0.05%
Statements 96.21% 96.25% 📈 +0.04%
Functions 98.27% 98.27% ➡️ +0.00%
Branches 90.42% 90.46% 📈 +0.04%
📁 Per-file Coverage Changes (1 files)
File Lines (Before → After) Statements (Before → After)
src/config-writer.ts 89.3% → 90.9% (+1.65%) 89.3% → 90.9% (+1.65%)

Coverage comparison generated by scripts/ci/compare-coverage.ts

@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented Jun 2, 2026

Smoke Test: Claude Engine

Result: PASS

💥 [THE END] — Illustrated by Smoke Claude

@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented Jun 2, 2026

fix: recompile red-team-benchmark lock file
fix(benchmark): mount adversarial_dojo into AWF container and pass API keys
✅ GitHub PR query
✅ Playwright
✅ File write
✅ Discussion
✅ Build
PASS

Warning

Firewall blocked 1 domain

The following domain was blocked by the firewall during workflow execution:

  • registry.npmjs.org

To allow these domains, add them to the network.allowed list in your workflow frontmatter:

network:
  allowed:
    - defaults
    - "registry.npmjs.org"

See Network Configuration for more information.

🔮 The oracle has spoken through Smoke Codex

@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented Jun 2, 2026

Smoke Test: Copilot BYOK (Offline) Mode

Test Result
GitHub MCP connectivity ✅ PR listed successfully
GitHub.com HTTP ⚠️ Pre-step data unavailable (template not expanded)
File write/read ⚠️ Pre-step data unavailable (template not expanded)
BYOK inference (agent → api-proxy → api.githubcopilot.com) ✅ Responding now

Running in BYOK offline mode (COPILOT_OFFLINE=true) via api-proxy → api.githubcopilot.com.
Author: @lpcox | Reviewer: @Copilot

Overall: PARTIAL — BYOK path confirmed ✅; pre-computed step data was not injected into the prompt.

🔑 BYOK report filed by Smoke Copilot BYOK

@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented Jun 2, 2026

🔬 Smoke Test Results

Test Status
GitHub MCP connectivity ✅ PR fetched: "fix: recompile red-team-benchmark lock file"
GitHub.com HTTP connectivity ⚠️ Pre-step data unavailable (template vars unexpanded)
File write/read ⚠️ Pre-step data unavailable (template vars unexpanded)

Overall: PARTIAL — MCP test passed; pre-computed test data was not injected (workflow template variables unexpanded).

Author: @lpcox

📰 BREAKING: Report filed by Smoke Copilot

Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Regenerates the compiled Agentic Workflows lock file for the weekly Red-Team Benchmark so the workflow passes the “stale lock file” validation and reflects the current compiled output.

Changes:

  • Recompiled .github/workflows/red-team-benchmark.lock.yml, updating gh-aw-metadata (frontmatter hash) and embedded generated blocks.
  • Updated the lock workflow’s pinned action manifest and step graph (e.g., new checkout sparse-checkout list, added setup steps, updated AWF install step, and adjusted AWF execution flags) as produced by the compiler.
Show a summary per file
File Description
.github/workflows/red-team-benchmark.lock.yml Regenerated compiled lock workflow to refresh metadata/hash and align the workflow content with the current compilation output.

Copilot's findings

Tip

Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

  • Files reviewed: 1/1 changed files
  • Comments generated: 1

Comment on lines +380 to +381
- name: Setup uv
uses: astral-sh/setup-uv@08807647e7069bb48b6ef5acd8ec9567f424441b # v8.1.0
@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented Jun 2, 2026

🏗️ Build Test Suite Results

Ecosystem Project Build/Install Tests Status
Bun elysia 1/1 passed ✅ PASS
Bun hono 1/1 passed ✅ PASS
C++ fmt N/A ✅ PASS
C++ json N/A ✅ PASS
Deno oak N/A 1/1 passed ✅ PASS
Deno std N/A 1/1 passed ✅ PASS
.NET hello-world N/A ✅ PASS
.NET json-parse N/A ✅ PASS
Go color 1/1 passed ✅ PASS
Go env 1/1 passed ✅ PASS
Go uuid 1/1 passed ✅ PASS
Java gson 1/1 passed ✅ PASS
Java caffeine 1/1 passed ✅ PASS
Node.js clsx all passed ✅ PASS
Node.js execa all passed ✅ PASS
Node.js p-limit all passed ✅ PASS
Rust fd 1/1 passed ✅ PASS
Rust zoxide 1/1 passed ✅ PASS

Overall: 8/8 ecosystems passed — ✅ PASS

Generated by Build Test Suite for issue #4191 · sonnet46 870.4K ·

@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented Jun 2, 2026

Smoke test passed

Warning

Firewall blocked 1 domain

The following domain was blocked by the firewall during workflow execution:

  • localhost

To allow these domains, add them to the network.allowed list in your workflow frontmatter:

network:
  allowed:
    - defaults
    - "localhost"

See Network Configuration for more information.

💎 Faceted by Smoke Gemini

@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented Jun 2, 2026

Smoke Test Results: FAIL ❌

Check Result
Redis PING ❌ Timeout/no response
PostgreSQL pg_isready ❌ No response on port 5432
PostgreSQL SELECT 1 ❌ Timeout/no response

host.docker.internal is not reachable from this environment. All three connectivity checks failed.

🔌 Service connectivity validated by Smoke Services

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants