Port Ricky review-depth skill updates by khaliqgant · Pull Request #63 · AgentWorkforce/skills

khaliqgant · 2026-06-01T12:27:17Z

Summary

Port the review-depth wording from Add adaptive generated workflow review depth ricky#145 into the shared writing-agent-relay-workflows and relay-80-100-workflow skills
Strengthen the shared review-depth contract with explicit light, standard, and deep review/fix paths, final-review gate dependencies, and required artifacts
Remove contradictory universal post-Codex wording from the affected skills and align /create-workflow with selected review-depth behavior
Bump agent-workforce-skills to 1.1.1, writing-agent-relay-workflows to 1.6.16, relay-80-100-workflow to 1.0.8, and create-workflow to 1.0.4

Validation

node -e 'JSON.parse(require("fs").readFileSync("prpm.json","utf8")); console.log("prpm.json OK")'
git diff --check
rg -n "after the Codex loop|post-Codex-fix review are green|mandatory Claude|mandatory sequential|dual review loops|mandatory dual review|Every workflow must include two|This applies even to small|post-Codex-fix path|Claude-then-Codex review/fix loops|mandatory final Claude-then-Codex" skills/writing-agent-relay-workflows/SKILL.md skills/relay-80-100-workflow/SKILL.md commands/create-workflow.md README.md prpm.json returned no matches
prpm publish --dry-run --package writing-agent-relay-workflows
prpm publish --dry-run --package relay-80-100-workflow
prpm publish --dry-run --package create-workflow

Note: full prpm publish --dry-run previously reported an existing unrelated frontmatter validation issue in skills/openclaw-orchestrator/SKILL.md; scoped dry-runs for the changed packages pass.

coderabbitai · 2026-06-01T12:27:30Z

📝 Walkthrough

Walkthrough

This PR updates the agent-workforce-skills package to version 1.1.1, replacing mandatory Claude-then-Codex review/fix loop requirements with a flexible review-depth selection model. Two relay workflow skills adopt the new framework: deeper-tier workflows require full Claude→Codex loops, while lighter tiers may scale down under specific constraints. Version bumps and metadata descriptions across README and prpm.json align with the skill documentation revisions.

Changes

Review-Depth Selection Model Rollout

Layer / File(s)	Summary
Version and metadata coordination `README.md`, `prpm.json`	Package version 1.1.0 → 1.1.1 at top level; `@agent-relay/writing-agent-relay-workflows` bumped 1.6.15 → 1.6.16 and `@agent-relay/relay-80-100-workflow` bumped 1.0.7 → 1.0.8. Collection metadata reasons updated to reference "review-depth … with test hardening" instead of "mandatory Claude-then-Codex" language.
Relay-80-100-Workflow skill update `skills/relay-80-100-workflow/SKILL.md`	Front-matter description shifted from mandatory Claude-then-Codex to review-depth fresh-eyes framing. "After all squads converge" section expanded to detail deep-tier Claude→fixer→Claude→Codex cycles and conditional light/standard-tier scaling when Ricky selects it, while preserving deterministic gates and one independent Claude review/fix pass.
Writing-agent-relay-workflows skill documentation `skills/writing-agent-relay-workflows/SKILL.md`	Extensive documentation overhaul: skill description, checklist item 4, "Review-Depth Fresh-Eyes Loops" section (replacing "Mandatory Fresh-Eyes Review Loops"), "Shadowed Squad Review Loop" steps 8–10, commit/PR boundary gating, role guidance, pipeline requirements, and "Common Mistakes" table all revised to distinguish deep-tier (mandatory full Claude→Codex) from lighter workflows (may scale down under constraints) while keeping deterministic tests and independent Claude review/fix on the critical path.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

Possibly related PRs

AgentWorkforce/skills#42: Both PRs update the same workflow authoring/doc content and align prpm.json/README skill descriptions around the Claude→Codex "review/fix loop" requirements, with this PR revising wording/conditions from the "mandatory Claude-then-Codex" framing.
AgentWorkforce/skills#49: Both PRs modify the skills/writing-agent-relay-workflows/SKILL.md review/fix loop semantics, switching/clarifying Claude↔Codex "review-depth" loop requirements and related workflow gating.
AgentWorkforce/skills#46: Both PRs update the writing-agent-relay-workflows skill docs/metadata to adjust the required Claude↔Codex review/fix loop behavior and wording.

Poem

🐰 A Release in Review-Depth

Once mandatory paths were fixed in stone,
Claude → Codex loops, one way alone.
Now deeper tiers choose their own speed,
Light workflows flex by review-depth's creed.
Versioned, documented, and wise—
flexibility wins the CI prize! ✨

🚥 Pre-merge checks | ✅ 5

✅ Passed checks (5 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title 'Port Ricky review-depth skill updates' accurately summarizes the main objective: porting review-depth wording changes from the ricky repository into shared skills.
Docstring Coverage	✅ Passed	No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Description check	✅ Passed	The pull request description directly aligns with the changeset, describing the porting of review-depth wording, version bumps, and validation steps performed.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings

Create stacked PR
Commit on current branch

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch codex/port-pr145-review-depth-skills

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

gemini-code-assist

Code Review

This pull request updates the documentation and package metadata to transition from a rigid "mandatory Claude-then-Codex" review loop to a flexible "review-depth" model. Under this new model, deep-tier workflows still require the full sequential Claude and Codex loops, while light and standard workflows can scale down based on the selected tier, provided that deterministic gates and at least one independent Claude review pass remain. The review feedback focuses on improving grammatical consistency, tense alignment (e.g., using "selects" instead of "selected"), and phrasing clarity across the updated markdown files.

gemini-code-assist · 2026-06-01T12:28:31Z

 4. A fresh self-review agent reads the actual files, AGENTS.md / CLAUDE.md, recent related work, and local conventions. It writes findings to disk.
 5. The implementer repairs valid findings, then deterministic gates rerun from captured output.
-6. After all squads converge, run the mandatory sequential fresh-eyes review/fix loops: Claude reviews the final diff and artifacts, a fixer repairs valid findings and adds or updates appropriate tests/proofs, Claude reviews the post-fix state again, then Codex repeats the same cycle from scratch over the post-Claude-fix state.
+6. After all squads converge, run the review-depth fresh-eyes review/fix loops. Deep-tier workflows require Claude to review the final diff and artifacts, a fixer to repair valid findings and add or update appropriate tests/proofs, Claude to review the post-fix state again, then Codex to repeat the same cycle from scratch over the post-Claude-fix state. Light and standard generated workflows may scale down only when Ricky selected that tier and deterministic gates plus at least one independent Claude review/fix pass remain mandatory.


To maintain consistent tense with the surrounding verbs ("remain", "may scale down"), consider using "selects" or "has selected" instead of the past-tense "selected".

Suggested change

6. After all squads converge, run the review-depth fresh-eyes review/fix loops. Deep-tier workflows require Claude to review the final diff and artifacts, a fixer to repair valid findings and add or update appropriate tests/proofs, Claude to review the post-fix state again, then Codex to repeat the same cycle from scratch over the post-Claude-fix state. Light and standard generated workflows may scale down only when Ricky selected that tier and deterministic gates plus at least one independent Claude review/fix pass remain mandatory.

6. After all squads converge, run the review-depth fresh-eyes review/fix loops. Deep-tier workflows require Claude to review the final diff and artifacts, a fixer to repair valid findings and add or update appropriate tests/proofs, Claude to review the post-fix state again, then Codex to repeat the same cycle from scratch over the post-Claude-fix state. Light and standard generated workflows may scale down only when Ricky selects that tier and deterministic gates plus at least one independent Claude review/fix pass remain mandatory.

gemini-code-assist · 2026-06-01T12:28:31Z

+## Review-Depth Fresh-Eyes Loops

-Every workflow must include two comprehensive fresh-eyes review/fix loops before final acceptance, commit, PR creation, or handoff: first Claude, then Codex. This applies even to small workflows and even when deterministic tests pass. Tests prove commands passed; the fresh-eyes loops make independent agents read the actual resulting files and artifacts as if they did not author them.
+Deep-tier workflows must include two comprehensive fresh-eyes review/fix loops before final acceptance, commit, PR creation, or handoff: first Claude, then Codex. Lighter generated workflows may use a scaled review depth only when Ricky classifies the spec as light or standard, deterministic tests and hard validation stay mandatory, and at least one independent Claude review/fix pass reads the resulting files and artifacts as if it did not author them.


The phrase "as if it did not author them" is slightly awkward because "it" grammatically refers to "pass" (which doesn't author files). It would be clearer to refer to the "reviewing agent" or "reviewer" instead.

Suggested change

Deep-tier workflows must include two comprehensive fresh-eyes review/fix loops before final acceptance, commit, PR creation, or handoff: first Claude, then Codex. Lighter generated workflows may use a scaled review depth only when Ricky classifies the spec as light or standard, deterministic tests and hard validation stay mandatory, and at least one independent Claude review/fix pass reads the resulting files and artifacts as if it did not author them.

Deep-tier workflows must include two comprehensive fresh-eyes review/fix loops before final acceptance, commit, PR creation, or handoff: first Claude, then Codex. Lighter generated workflows may use a scaled review depth only when Ricky classifies the spec as light or standard, deterministic tests and hard validation stay mandatory, and at least one independent Claude review/fix pass reads the resulting files and artifacts as if the reviewing agent did not author them.

gemini-code-assist · 2026-06-01T12:28:31Z

-8. Run the mandatory fresh-eyes review loops in sequence: Claude reviews the actual final diff and artifacts, a fixer repairs findings and hardens them with appropriate tests/proofs, Claude reviews the post-fix state again, then Codex repeats the same cycle from scratch over the post-Claude-fix state.
-9. Optional extra reviewers can be added for high-stakes work, but they do not replace the sequential Claude-then-Codex loops.
-10. Final signoff only happens after post-Codex-fix review and final deterministic gates prove the spec is complete, or a blocker artifact explains why it cannot be completed.
+8. Run the selected review-depth fresh-eyes loops. Deep-tier workflows run the full Claude loop followed by the Codex loop; light and standard generated workflows may scale down only when Ricky selected that tier.


To maintain consistent tense with the surrounding verbs ("run", "may scale down"), consider using "selects" or "has selected" instead of the past-tense "selected".

Suggested change

8. Run the selected review-depth fresh-eyes loops. Deep-tier workflows run the full Claude loop followed by the Codex loop; light and standard generated workflows may scale down only when Ricky selected that tier.

8. Run the selected review-depth fresh-eyes loops. Deep-tier workflows run the full Claude loop followed by the Codex loop; light and standard generated workflows may scale down only when Ricky selects that tier.

gemini-code-assist · 2026-06-01T12:28:31Z

-| Final acceptance before repair and dual review | Broken work can stop or commit without giving the team a final chance to fix it. Run repairable gates first, then the Claude-then-Codex review/fix loops, then final deterministic acceptance before commit/PR |
-| Skipping the mandatory dual review loops | Add sequential Claude-then-Codex fresh-eyes review/fix loops after repairable verification and before final acceptance, commit, PR creation, or handoff |
+| Final acceptance before repair and required review | Broken work can stop or commit without giving the team a final chance to fix it. Run repairable gates first, then the selected review-depth review/fix loop, then final deterministic acceptance before commit/PR |
+| Skipping required review-depth loops | Add the review/fix loop required for the generated review depth after repairable verification and before final acceptance, commit, PR creation, or handoff; deep tier requires sequential Claude-then-Codex fresh-eyes loops |


To maintain terminology consistency with the rest of the document (which uses "selected review-depth loops" or "selected review depth"), consider changing "generated review depth" to "selected review depth".

Suggested change

| Skipping required review-depth loops | Add the review/fix loop required for the generated review depth after repairable verification and before final acceptance, commit, PR creation, or handoff; deep tier requires sequential Claude-then-Codex fresh-eyes loops |

| Skipping required review-depth loops | Add the review/fix loop required for the selected review depth after repairable verification and before final acceptance, commit, PR creation, or handoff; deep tier requires sequential Claude-then-Codex fresh-eyes loops |

devin-ai-integration

Devin Review found 1 potential issue.

View 3 additional findings in Devin Review.

devin-ai-integration · 2026-06-01T12:30:28Z

 2. Pick the coordination shape deliberately: Conversation for non-trivial coordination, Pipeline only for linear one-shot handoffs.
 3. Use repairable validation gates: capture red output with `failOnError: false`, hand it to a repair owner, then rerun the same check.
-4. Run the mandatory fresh-eyes loops in order: Claude review/fix/final review/final fix, then Codex review/fix/final review/final fix.
+4. Run fresh-eyes review at the depth warranted by the spec: deep-tier workflows use Claude review/fix/final review/final fix followed by Codex review/fix/final review/final fix; lighter generated workflows may scale down only when deterministic gates, hard validation, and at least one independent Claude review/fix pass remain on the critical path.


🟡 Incomplete terminology migration leaves contradictory "mandatory" instruction in Pipeline table

Line 32 introduces the new review-depth concept: "lighter generated workflows may scale down." Line 1580 (also changed in this PR) updates the Pipeline section to say "the selected review-depth review/fix loops." However, line 93 in the same file still says "The mandatory final Claude-then-Codex review/fix loops still apply" in the Pipeline shape table — directly contradicting the new variable-depth concept. An AI agent reading this skill will receive conflicting instructions: the checklist says review depth is variable based on spec tier, but the Pipeline table says the full loops are always mandatory.

Contradicting line at line 93 (unchanged)

skills/writing-agent-relay-workflows/SKILL.md:93:

| Pipeline (one-shot DAG) | ... | Linear, well-specified transformations; deterministic data passing; no live agent-to-agent coordination during implementation. The mandatory final Claude-then-Codex review/fix loops still apply. |

This contradicts the changed line 32 and changed line 1580 which both use "review-depth" / "selected" language.

Prompt for agents

The PR migrates terminology from 'mandatory Claude-then-Codex' to 'review-depth' across the skill file, but missed updating line 93 in the Coordination Style table. Line 93 still reads 'The mandatory final Claude-then-Codex review/fix loops still apply' in the Pipeline shape's 'Use when' column. This should be updated to use the new review-depth language (e.g., 'The selected review-depth fresh-eyes loops still apply') to match line 32 (checklist item 4) and line 1580 (Pipeline subsection heading text) which both now use the variable-depth terminology. The file is skills/writing-agent-relay-workflows/SKILL.md and the problematic text is in the markdown table at line 93.

Was this helpful? React with 👍 or 👎 to provide feedback.

agent-relay-bot · 2026-06-01T12:31:17Z

Reviewed PR #63 and fixed the issues found.

Changes made:

Updated stale /create-workflow command wording to the new review-depth model.
Bumped /create-workflow metadata from 1.0.3 to 1.0.4 in prpm.json and README.md.
Fixed remaining unconditional Codex/post-Codex acceptance language so light/standard tiers are coherent.
Applied all bot review suggestions around tense, terminology, and ambiguous phrasing.

Local checks run:

Parsed prpm.json and prpm.lock.
Verified manifest package files exist and README versions align for changed packages.
Scanned focused files for stale old-contract phrases.
Checked changed files for trailing whitespace.

agent-relay-bot · 2026-06-01T12:31:23Z

✅ pr-reviewer applied fixes — committed and pushed 9bc4ff5 to this PR. The notes below describe what changed.

Reviewed PR #63 and fixed the issues found.

Changes made:

Updated stale /create-workflow command wording to the new review-depth model.
Bumped /create-workflow metadata from 1.0.3 to 1.0.4 in prpm.json and README.md.
Fixed remaining unconditional Codex/post-Codex acceptance language so light/standard tiers are coherent.
Applied all bot review suggestions around tense, terminology, and ambiguous phrasing.

Local checks run:

Parsed prpm.json and prpm.lock.
Verified manifest package files exist and README versions align for changed packages.
Scanned focused files for stale old-contract phrases.
Checked changed files for trailing whitespace.

agent-relay-bot

✅ pr-reviewer applied fixes — committed and pushed 9bc4ff5 to this PR. The notes below describe what changed.

Reviewed PR #63 and fixed the issues found.

Changes made:

Updated stale /create-workflow command wording to the new review-depth model.
Bumped /create-workflow metadata from 1.0.3 to 1.0.4 in prpm.json and README.md.
Fixed remaining unconditional Codex/post-Codex acceptance language so light/standard tiers are coherent.
Applied all bot review suggestions around tense, terminology, and ambiguous phrasing.

Local checks run:

Parsed prpm.json and prpm.lock.
Verified manifest package files exist and README versions align for changed packages.
Scanned focused files for stale old-contract phrases.
Checked changed files for trailing whitespace.

cubic-dev-ai

1 issue found across 4 files

_{Reply with feedback, questions, or to request a fix.

Re-trigger cubic}

agent-relay-bot · 2026-06-01T12:37:29Z

Reviewed PR #63 locally and fixed the docs consistency issues I found.

Changes made:

Renamed the stale unconditional “Mandatory Claude-Then-Codex” common-pattern section to “Deep-Tier Claude-Then-Codex”.
Clarified that the full Claude-then-Codex loop example applies when selected review depth is deep.
Replaced generic-skill references to “Ricky selects/classifies” with workflow-spec classification language in both affected skills.

Validation run:

jq empty prpm.json
Metadata/file/README version consistency check across all 22 package entries
Repo-wide stale phrase scan for old mandatory/unconditional review-loop wording

agent-relay-bot · 2026-06-01T12:37:37Z

✅ pr-reviewer applied fixes — committed and pushed 51cb9e4 to this PR. The notes below describe what changed.

Reviewed PR #63 locally and fixed the docs consistency issues I found.

Changes made:

Renamed the stale unconditional “Mandatory Claude-Then-Codex” common-pattern section to “Deep-Tier Claude-Then-Codex”.
Clarified that the full Claude-then-Codex loop example applies when selected review depth is deep.
Replaced generic-skill references to “Ricky selects/classifies” with workflow-spec classification language in both affected skills.

Validation run:

jq empty prpm.json
Metadata/file/README version consistency check across all 22 package entries
Repo-wide stale phrase scan for old mandatory/unconditional review-loop wording

agent-relay-bot

✅ pr-reviewer applied fixes — committed and pushed 51cb9e4 to this PR. The notes below describe what changed.

Reviewed PR #63 locally and fixed the docs consistency issues I found.

Changes made:

Renamed the stale unconditional “Mandatory Claude-Then-Codex” common-pattern section to “Deep-Tier Claude-Then-Codex”.
Clarified that the full Claude-then-Codex loop example applies when selected review depth is deep.
Replaced generic-skill references to “Ricky selects/classifies” with workflow-spec classification language in both affected skills.

Validation run:

jq empty prpm.json
Metadata/file/README version consistency check across all 22 package entries
Repo-wide stale phrase scan for old mandatory/unconditional review-loop wording

chore: port review-depth skill updates

6b313ef

gemini-code-assist Bot reviewed Jun 1, 2026

View reviewed changes

devin-ai-integration Bot reviewed Jun 1, 2026

View reviewed changes

chore: apply pr-reviewer fixes for #63

9bc4ff5

agent-relay-bot Bot reviewed Jun 1, 2026

View reviewed changes

cubic-dev-ai Bot reviewed Jun 1, 2026

View reviewed changes

Comment thread skills/writing-agent-relay-workflows/SKILL.md

chore: apply pr-reviewer fixes for #63

51cb9e4

agent-relay-bot Bot reviewed Jun 1, 2026

View reviewed changes

docs: clarify review-depth workflow contract

8d2d333

khaliqgant merged commit 4a367b0 into main Jun 1, 2026
2 checks passed

khaliqgant deleted the codex/port-pr145-review-depth-skills branch June 1, 2026 14:58

	8. Run the selected review-depth fresh-eyes loops. Deep-tier workflows run the full Claude loop followed by the Codex loop; light and standard generated workflows may scale down only when Ricky selected that tier.
	8. Run the selected review-depth fresh-eyes loops. Deep-tier workflows run the full Claude loop followed by the Codex loop; light and standard generated workflows may scale down only when Ricky selects that tier.

	\| Skipping required review-depth loops \| Add the review/fix loop required for the generated review depth after repairable verification and before final acceptance, commit, PR creation, or handoff; deep tier requires sequential Claude-then-Codex fresh-eyes loops \|
	\| Skipping required review-depth loops \| Add the review/fix loop required for the selected review depth after repairable verification and before final acceptance, commit, PR creation, or handoff; deep tier requires sequential Claude-then-Codex fresh-eyes loops \|

Conversation

khaliqgant commented Jun 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Validation

Uh oh!

coderabbitai Bot commented Jun 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Poem

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Jun 1, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 1, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 1, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 1, 2026

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration Bot left a comment

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration Bot Jun 1, 2026

Choose a reason for hiding this comment

Uh oh!

agent-relay-bot Bot commented Jun 1, 2026

Uh oh!

agent-relay-bot Bot commented Jun 1, 2026

Uh oh!

agent-relay-bot Bot left a comment

Choose a reason for hiding this comment

Uh oh!

cubic-dev-ai Bot left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

agent-relay-bot Bot commented Jun 1, 2026

Uh oh!

agent-relay-bot Bot commented Jun 1, 2026

Uh oh!

agent-relay-bot Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

khaliqgant commented Jun 1, 2026 •

edited

Loading

coderabbitai Bot commented Jun 1, 2026 •

edited

Loading

cubic-dev-ai Bot left a comment •

edited

Loading