fix(cooldown): close silent-bypass regressions from #25, #26, #27 by j7an · Pull Request #28 · j7an/shared-workflows

j7an · 2026-04-13T03:03:54Z

Summary

Closes three regressions in dependency-cooldown.yml observed on j7an/nexus-mcp#160 on 2026-04-12:

Workflow no longer enforces a cooldown despite its name — @dependabot rebase bypasses Dependabot native cooldown #25 — silent cooldown bypass via @dependabot rebase
Clean re-scan does not remove a stale security-review-needed label from a prior dirty scan #26 — stale security-review-needed label not removed on dirty-then-clean re-scan
astral-sh/setup-uv silently omitted from Packages Scanned list in dependency-cooldown.yml #27 — astral-sh/setup-uv silently dropped from extractor due to YAML list-marker shape

⚠️ Behavior Changes (read before merging)

Default auto-merge now enforces a 7-day release-age gate. Consumers on @v2 with auto_merge: true will see Dependabot PRs sit in pending for up to 7 days instead of auto-merging immediately on day 0. Set cooldown_days: 0 in your caller workflow to restore pre-v2.0.2 behavior.
security-review-needed label is now reconciled on every scan. Automation that treated this label as persistent will now see it removed when a re-scan finds zero applicable advisories. Labels are preserved (not removed) when a scan encounters API errors, so transient failures don't silently clear warnings.
New cooldown-pending label (amber) applied when target versions are within the configured cooldown window.
New gate terminal states — the dependency-cooldown / gate context can now end at pending (cooldown-only block, default) or failure (cooldown-only block with fail_on_cooldown: true) in addition to the existing success and error states.

New Inputs

Input	Type	Default	Description
`cooldown_days`	number	`7`	Minimum release age in days before auto-merge is allowed. Set to `0` to disable workflow-side release-age enforcement.
`fail_on_cooldown`	boolean	`false`	If `true`, cooldown blocks set the gate to `failure` instead of `pending`. Use when branch protection requires a hard-red blocker.

Architecture

Extracted two standalone scripts from the inline workflow bash:

scripts/extract-deps.sh — parses PR diff, emits TSV (name\tversion\tecosystem). Rewritten with bash-native regex to fix astral-sh/setup-uv silently omitted from Packages Scanned list in dependency-cooldown.yml #27 (handles YAML list-marker - uses: form). Bash 3.2 compatible.
scripts/check-release-age.sh — takes dep TSV, queries GitHub Releases + PyPI APIs, emits verdict TSV with per-row pass/fail/error verdicts. Two-attempt retry, fixture mode for tests, yanked detection.

The workflow itself (dependency-cooldown.yml) was restructured with:

New actions/checkout step pulling j7an/shared-workflows@github.workflow_sha with sparse-checkout
Three-branch gate state machine (advisory > cooldown > clean)
reconcile_label() helper that authoritatively adds/removes both labels on every scan (HAS_ERROR-aware — labels are preserved on API-error re-scans)
Release Age table in the scan comment with per-package age and earliest-unblock footer
Extraction Warning banner when the sanity-check detects a drift between raw diff count and extractor output

Test Coverage

New bats test suite with 8 tests (all passing):

1..8
ok 1 blocks sub-cooldown actions at COOLDOWN_DAYS=7 (regression for #25)
ok 2 passes everything at COOLDOWN_DAYS=0 (escape hatch)
ok 3 PyPI happy path returns pass for aged release
ok 4 yanked PyPI release fails regardless of age
ok 5 missing fixture (simulates 404) produces error verdict
ok 6 extracts all three actions from nexus-mcp#160 diff (regression for #27)
ok 7 extracts Python deps from requirements.txt diff
ok 8 empty diff produces empty output with exit 0

tests/extract-deps.bats — 3 tests against hand-crafted and real-captured diffs
tests/check-release-age.bats — 5 tests with fixture-mode API responses, fixed NOW_EPOCH for determinism
tests/fixtures/extract-deps/nexus-mcp-160.diff — real captured diff from deps: bump the all-actions group across 1 directory with 3 updates nexus-mcp#160 (the regression fixture)
.github/workflows/ci-scripts.yml — new CI runner that triggers bats on scripts/**, tests/**, ci-scripts.yml, or dependency-cooldown.yml changes

Root Cause of #27

The v2.0.1 regex ^\+\s+uses: did not match YAML list-marker-prefixed lines of the form + - uses: foo@sha # v1.0.0, which is exactly how astral-sh/setup-uv appears in nexus-mcp#160's diff (17 raw +...uses: lines, 2 of them - uses: shape for setup-uv, 15 plain uses: for the other two actions). The new regex ^\+[[:space:]]+(-[[:space:]]+)?uses: accepts both shapes.

Commits

17 fix: commits (all patch-bump under tag-release.yml's conventional-commit analyzer):

1f3f981 — extract-deps regression fixture for nexus-mcp#160
998554a — implement extract-deps.sh with bash-native parser
96dcc5e — add Python and empty-diff extract-deps tests
9ca390a — add check-release-age regression fixture
8beeb3d — implement check-release-age.sh with tier-1 GitHub/PyPI lookups
747ce1f — add edge-case tests for check-release-age
94f40fe — add bats test runner workflow
aab2e50 — wire extract-deps.sh into dependency-cooldown.yml
fc3f140 — add cooldown_days input and wire check-release-age.sh
d09b8fa — reconcile labels on every scan to fix stale-label bug
22120ca — combine advisory and cooldown gates in state machine
2790d8d — add Release Age and Extraction Warning comment sections
9c3fe65 — document cooldown_days, fail_on_cooldown, and cooldown-pending label
9afccfd — run reconcile_label before HAS_ERROR early-exit
053996e — add checkout, fix sanity-check dedup, guard reconcile removal on error
1ae4585 — filter local/docker in sanity check, fix date fallback order
df5d327 — also run bats on dependency-cooldown.yml changes

Test Plan

bats tests/ — 8/8 passing locally
YAML parses — python3 -c 'import yaml; yaml.safe_load(...)' clean on both dependency-cooldown.yml and ci-scripts.yml
shellcheck clean on both scripts
Commit prefix audit — all 17 commits use fix: prefix
CI green on this PR (ci-scripts.yml should run on this PR because it touches dependency-cooldown.yml)
Self-consumption smoke test — wait for the next Dependabot PR against shared-workflows to run the new workflow end-to-end
Replay verification — craft a test PR with nexus-mcp#160's bump list against a throwaway repo, confirm cooldown-pending label applied, gate pending, auto-merge not enabled

Deferred to Follow-ups (not blocking)

Release notes polish (callout for default behavior change, reconcile semantic)
Action name regex hardening (markdown-metacharacter rejection)
Negative age_days clamp on clock skew
Extract reconcile_label and sanity-check to testable shims
Add actionlint/shellcheck CI on .github/workflows/** changes

Review

This PR was developed under an extensive 3-pass parallel review process across 5 domains (security, correctness, API contract, portability, test quality). Final score: Security 9/10, Correctness 9/10, API Contract 8/10, Portability 9/10, Test Quality 6/10 → naive average 8.2/10, no remaining blockers.

Fixes #25
Fixes #26
Fixes #27

Captures the exact diff from j7an/nexus-mcp#160 that exposed issue #27 (astral-sh/setup-uv silently dropped). Test fails until the real extractor is implemented in the next commit. Refs #27

Root cause of #27: the v2.0.1 regex `^\+\s+uses:` in dependency-cooldown.yml does not match YAML list-marker prefixed lines of the form `+ - uses: foo@...`, which is exactly how astral-sh/setup-uv appears in nexus-mcp#160's diff. Empirical verification against the captured fixture: $ grep -cE '^\+.*uses:' tests/fixtures/extract-deps/nexus-mcp-160.diff 17 $ grep -cE '^\+\s+uses:' tests/fixtures/extract-deps/nexus-mcp-160.diff 15 The two dropped lines are both astral-sh/setup-uv (list-marker form). The other actions (harden-runner, trufflehog) appear without the `- ` prefix and were extracted correctly, so setup-uv was silently dropped while the workflow reported success. The new bash-native parser uses [[ =~ ]] regex with an optional `(-[[:space:]]+)?` group that matches both shapes, plus: 1. Whitespace normalization via ${line%$'\r'} and [[:space:]] classes 2. Ecosystem-prefixed dedup keys (actions:foo vs pypi:foo) so a cross-ecosystem name collision doesn't drop a row 3. Strict mode (set -euo pipefail) and explicit malformed-input detection (exit 2 when input has no diff markers) 4. Bash 3.2 compatible dedup (newline-delimited sentinel string instead of `declare -A`) so the script runs on macOS system bash as well as Linux CI Fixes #27

Proves the extractor handles requirements.txt-style bumps and produces clean zero-row output on an empty diff. Complements the nexus-mcp#160 regression test for actions coverage. Refs #27

…#160 Captures hand-crafted GitHub API response fixtures for the three actions that appeared in j7an/nexus-mcp#160, with published_at values tuned to produce 3d/4d/14d ages relative to NOW_EPOCH=1775995200 (2026-04-12). Test fails until the real checker is implemented in the next commit. Refs #25

… lookups Single-tier release-date lookup: - Actions: gh api repos/<owner>/<repo>/releases/tags/v<version> - PyPI: pypi.org/pypi/<pkg>/<version>/json (with yanked detection) Two-attempt retry on transient failures. Fixture mode via AGE_FIXTURE_DIR for hermetic tests. COOLDOWN_DAYS=0 disables the gate entirely (each row emitted as pass with no API call). Bash 3.2 compatible (no associative arrays, no [[ -v ]]). Refs #25

Covers the COOLDOWN_DAYS=0 escape hatch, PyPI happy path with naive UTC upload_time format, yanked-release forced-fail, and 404-simulates-error. Refs #25

Runs on PRs touching scripts/** or tests/** with hardened runner and pinned action SHAs. Makes the extract-deps and check-release-age regressions catchable in CI without needing a live Dependabot PR. Refs #25, #27

Replaces the inline action+Python extraction block with a single shell-out to scripts/extract-deps.sh, then populates the legacy ACTIONS/PY_DEPS/ACTION_VERSIONS/PY_VERSIONS variables that the downstream advisory-scan loops still consume. Preserves PR-body version fallback for both ecosystems. Adds the extraction-count sanity-check guard (#27 acceptance criterion): when the diff line count and extractor row count disagree, the scan comment surfaces an EXTRACTION_WARNING that downstream comment-rendering code (added in Task 12) will display. Refs #27

Introduces two new workflow_call inputs: - cooldown_days (default 7) — enforces release age before auto-merge - fail_on_cooldown (default false) — opt-in hard-red gate for strict consumers The phase-2 check runs after extraction and before the advisory scan, populating COOLDOWN_FAILURES for the gate state machine added in Task 11. AGE_TSV is preserved for the comment-body Release Age section added in Task 12. When COOLDOWN_DAYS=0, the script invocation is skipped entirely (escape hatch preserves pre-v2.0.2 behavior for consumers who can't adopt the new gate immediately). Refs #25

Replaces the additive 'gh pr edit --add-label' block in the 'advisories found' branch with a reconcile_label() helper that runs on every scan regardless of verdict. The helper adds OR removes each label to match the current scan result, so a dirty-then-clean sequence (observed on j7an/nexus-mcp#160) leaves no stale labels. Handles two labels generically: - security-review-needed (red, B60205) — advisory gate - cooldown-pending (amber, FBCA04) — release-age gate (consumed by Task 11 state machine) Each label has its own color and description; the helper takes them as parameters so adding a third label later is one call, not a copy-paste. Fixes #26

Replaces the advisory-only auto-merge decision with a three-state machine: TOTAL>0 → gate=success, AUTO_MERGE_OK=false ('review needed') COOLDOWN_FAILURES>0 → gate=pending (or failure if FAIL_ON_COOLDOWN=true), AUTO_MERGE_OK=false ('waiting for age') both 0 → gate=success, AUTO_MERGE_OK=true ('ready for merge') Auto-merge is now gated on AUTO_MERGE_OK=true, which is set only in the fully-clean case. This closes the silent-bypass path observed on j7an/nexus-mcp#160 — even if advisories are clean, sub-cooldown versions block auto-merge until the next scan naturally ages past the threshold. Advisory failures win the status-description tie-break (shorter, more actionable than the cooldown message). The comment body and labels (reconciled in Task 10) reflect both gates independently. The HAS_ERROR early-exit branch in the comment builder is unchanged — scan errors still short-circuit to gate=error before the state machine runs. Refs #25

Release Age table shows per-package published date, age in days, and pass/fail/error status. Footer computes the 'earliest unblock' timestamp as max(fail.published_at) + cooldown_days so consumers see a concrete wait time instead of anxious-looking 'pending' state. Extraction Warning section renders near the top of the comment when diff-line count and extractor output disagree, surfacing silent parsing drops (#27 acceptance criterion: extraction-count guard). Date parsing for the unblock-time computation uses a multi-flag fallback chain (GNU date -d, GNU date -d with appended Z, BSD date -jf with and without Z stripping) to handle both GitHub published_at and PyPI upload_time formats portably. Refs #25, #27

…ding label Adds two new inputs to the Inputs table, rewrites the Cool-down configuration section to describe both enforcement layers (Dependabot native + workflow-side), adds a Labels subsection explaining the authoritative reconciliation semantic, and adds a Recommended section on scheduled re-scan for long-pending PRs. Refs #25, #26

Cross-task interaction found in final review: Task 10's reconcile_label block sat below the v2.0.1 HAS_ERROR early-exit branch, so an API-error re-scan (GHSA rate-limit, OSV timeout) exited with state=error before reconciling labels. A previously-dirty PR would keep its stale security-review-needed label across an error-path re-scan, which is exactly the failure mode #26 was filed about — under a narrower trigger. Hoists the reconcile block to run immediately after TOTAL is computed but before the if/elif/else comment-builder branches (including the HAS_ERROR exit). Labels now reconcile on every scan, including error paths, honoring the README's 'reconciled on every scan' guarantee. The reconcile block's content is unchanged from Task 10 — only its position moved. Header comment updated to reflect the reason for the position. Refs #26

…removal on error Three cross-task fixes from final parallel review: 1. Add actions/checkout step — reusable workflows don't auto-checkout their own repo, so ${GITHUB_WORKSPACE}/scripts/... resolved to nothing at runtime and the entire fix silently failed. Checkout pulls j7an/shared-workflows at github.workflow_sha (same commit as the workflow YAML), sparse-checkout scripts/ for minimal clone. Script invocation paths updated to shared-workflows/scripts/*.sh. 2. Fix extraction sanity-check to compare unique action names rather than raw line counts. Previous version compared grep -c (line count) to post-dedup extraction count, which always fired on real diffs bumping an action used across multiple workflow files — including the nexus-mcp#160 fixture this PR exists to defend (17 raw lines, 3 unique actions). New version dedups via sort -u and uses the extractor's exact regex shape (-[[:space:]]+)? to eliminate whitespace drift. 3. Guard reconcile_label's removal branch on empty HAS_ERROR. Task 13a hoisted reconcile above the HAS_ERROR exit to fix the 'stale label never removed on error re-scan' bug, but this introduced the inverse: a re-scan where all GHSA queries error leaves TOTAL=0, causing the reconciler to remove security-review-needed from a genuinely-dirty PR. The new guard makes reconcile additive-only when HAS_ERROR is set: labels can still be added on partial-scan evidence, but never removed unless the scan was clean enough to trust the zero-count. Refs #25, #26, #27

… order Two small fixes from the post-13b parallel review: 1. Sanity-check dedup pipeline now filters ./ and docker:// prefixes. Task 13b's deduped EXPECTED_ACTIONS counted all unique uses: lines in the diff — including local and docker action references that scripts/extract-deps.sh intentionally skips (lines 51-52). A PR adding a local action would trip the 'Extraction mismatch' warning despite correct extractor behavior. The new grep -v clause mirrors the extractor's skip semantics so the sanity-check stays honest. 2. Release Age unblock-epoch date conversion now tries GNU 'date -d @' first, then BSD 'date -r' as fallback. The previous order worked by accident on Ubuntu runners because GNU 'date -r <integer>' treats the argument as a file path (not an epoch), silently fails with non-existent file, and falls through. Brittle: a file named after the epoch integer in cwd would return that file's mtime instead. GNU-first order matches the actual production platform. Refs #25, #27

The ci-scripts.yml paths filter previously only triggered on scripts/**, tests/**, or its own file. Tasks 13b and 13c made several changes to dependency-cooldown.yml (checkout step, sanity-check dedup, reconcile_label HAS_ERROR guard, local/docker filter, date fallback order) — none of which triggered bats CI because the workflow file wasn't in the paths filter. Adding it ensures that any future change to dependency-cooldown.yml runs the extract-deps and check-release-age regression tests as a partial safety net. bats can't reach workflow-level inline bash, but it can catch script-adjacent regressions that the workflow invokes. Refs #25, #27

j7an added 17 commits April 12, 2026 13:13

fix(cooldown): add extract-deps regression fixture for nexus-mcp#160

1f3f981

Captures the exact diff from j7an/nexus-mcp#160 that exposed issue #27 (astral-sh/setup-uv silently dropped). Test fails until the real extractor is implemented in the next commit. Refs #27

fix(cooldown): add Python and empty-diff extract-deps tests

96dcc5e

Proves the extractor handles requirements.txt-style bumps and produces clean zero-row output on an empty diff. Complements the nexus-mcp#160 regression test for actions coverage. Refs #27

fix(cooldown): add edge-case tests for check-release-age

747ce1f

Covers the COOLDOWN_DAYS=0 escape hatch, PyPI happy path with naive UTC upload_time format, yanked-release forced-fail, and 404-simulates-error. Refs #25

fix(ci): add bats test runner for shared-workflows scripts

94f40fe

Runs on PRs touching scripts/** or tests/** with hardened runner and pinned action SHAs. Makes the extract-deps and check-release-age regressions catchable in CI without needing a live Dependabot PR. Refs #25, #27

j7an marked this pull request as ready for review April 13, 2026 03:04

j7an merged commit 16b94fd into main Apr 13, 2026
5 checks passed

j7an deleted the fix/cooldown-regressions branch April 13, 2026 03:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(cooldown): close silent-bypass regressions from #25, #26, #27#28

fix(cooldown): close silent-bypass regressions from #25, #26, #27#28
j7an merged 17 commits intomainfrom
fix/cooldown-regressions

j7an commented Apr 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

j7an commented Apr 13, 2026

Summary

⚠️ Behavior Changes (read before merging)

New Inputs

Architecture

Test Coverage

Root Cause of #27

Commits

Test Plan

Deferred to Follow-ups (not blocking)

Review

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant