feat(ci): pin Claude workflows to Opus 4.7 1M, allow test-skip on doc fixes by shuheng-liu · Pull Request #208 · TensorAuto/OpenTau

shuheng-liu · 2026-04-29T21:13:31Z

What this does

Two improvements to the three Claude integration workflows.

1. Pin to Opus 4.7 with 1M-context, model controlled by a repo variable

All three workflows (claude-pr-review.yml, claude-implement-fixes.yml, extract-claude-lessons.yml) now pass --model via claude_args, evaluating to:

--model ${{ vars.CLAUDE_MODEL || 'claude-opus-4-7[1m]' }}

Default: claude-opus-4-7[1m] — Opus 4.7 with the 1M-context variant. Largest context, top-tier reasoning.
Override: set repo variable CLAUDE_MODEL (Settings → Secrets and variables → Actions → Variables) to any model alias the action accepts. Bumping models = changing one variable, no PR. Useful when Anthropic ships a newer Opus.
The alias claude-opus-4-7 (no date suffix) auto-tracks the latest snapshot of Opus 4.7. Cross-major-version bumps (4.7 → 4.8 → 5.x) still need an explicit update — but that's a once-a-quarter-ish thing, not a per-PR concern.

Previously only claude-pr-review had --model set; the other two ran on whatever the action's default was.

2. Test-skip discretion in `claude-implement-fixes`

Step 4 of the implement-fixes prompt was rigid: "Run the test command from CLAUDE.md before pushing." The CPU suite alone takes ~9 min and is wasted on doc-only / comment-only / formatting fixes, blowing each @claude fix cycle out to 15-20 min.

New step 4 gives Claude discretion + an audit trail:

May skip when the diff is purely documentation, comments, formatting, or string-literal text — the kind of change that cannot alter runtime behavior. Commit body must contain tests: skipped — <one-sentence reason>.
Must run otherwise (imports, function bodies, control flow, types, runtime configs, deps, build manifests). Commit body must contain tests: passed — <exact command run>. Scope to the changed subtree where possible (pytest tests/policies/test_pi05.py for a pi05 change) to keep the run fast.
"When in doubt, run" — explicit fallback to the safer default.
If tests fail: do NOT push; reply on the PR comment thread with the failure and stop.

The audit-trail line in the commit body lets reviewers second-guess if Claude underestimated risk.

How it was tested

actionlint clean on all three files (exit 0).
pre-commit (typos, gitleaks, zizmor, end-of-file-fixer, trailing-whitespace) all pass.
The [1m] model-ID syntax has not been verified against the action's --model parser yet. If the CLI rejects it, fall back to claude-opus-4-7 (no [1m]) by setting the CLAUDE_MODEL repo variable to that — no code change needed.
Test-skip discretion can only be observed end-to-end after merge, by leaving an @claude fix on a doc-only change and confirming the commit body contains tests: skipped and total runtime drops below ~3 minutes.

Note for the reviewer: This PR's own Claude PR Review check will fail with App token exchange failed: 401 Unauthorized — Workflow validation failed. This is expected: the action enforces a security check that rejects any PR which modifies the workflow file calling it. All other CI passes normally.

How to checkout & try? (for the reviewer)

```bash
gh pr checkout
git diff origin/main -- .github/workflows/claude-.yml
actionlint .github/workflows/claude-.yml
```

After merge:

(Optional) Set repo variable CLAUDE_MODEL to override the default.
To exercise the test-skip path: leave an @claude fix comment on any future doc-only PR; expect the run to finish in ~2-3 min with tests: skipped — ... in the commit body.

Checklist

I have added Google-style docstrings to important functions and ensured function parameters are typed.
My PR includes policy-related changes.
- If the above is checked: I have run the GPU pytests (pytest -m "gpu") and regression tests.

… fixes

… fixes (#208) (cherry picked from commit b9b3841)

feat(ci): pin Claude workflows to Opus 4.7 1M, allow test-skip on doc…

03a39c4

… fixes

shuheng-liu added the feature New feature or request label Apr 29, 2026

shuheng-liu self-assigned this Apr 29, 2026

shuheng-liu marked this pull request as ready for review April 29, 2026 21:21

shuheng-liu merged commit b9b3841 into main Apr 29, 2026
3 of 6 checks passed

shuheng-liu deleted the claude/model-and-test-bypass branch April 29, 2026 21:22

shuheng-liu mentioned this pull request Apr 29, 2026

fix(ci): gate extract-claude-lessons on head branch, not PR author #212

Merged

3 tasks

shuheng-liu added a commit that referenced this pull request May 2, 2026

feat(ci): pin Claude workflows to Opus 4.7 1M, allow test-skip on doc…

ef68a93

… fixes (#208) (cherry picked from commit b9b3841)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(ci): pin Claude workflows to Opus 4.7 1M, allow test-skip on doc fixes#208

feat(ci): pin Claude workflows to Opus 4.7 1M, allow test-skip on doc fixes#208
shuheng-liu merged 1 commit into
mainfrom
claude/model-and-test-bypass

shuheng-liu commented Apr 29, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

shuheng-liu commented Apr 29, 2026

What this does

1. Pin to Opus 4.7 with 1M-context, model controlled by a repo variable

2. Test-skip discretion in claude-implement-fixes

How it was tested

How to checkout & try? (for the reviewer)

Checklist

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

2. Test-skip discretion in `claude-implement-fixes`