💡 Opus 4.8 Engine Upgrade: Dynamic Workflows + 4x Honesty for Review Quality #793
Replies: 2 comments
-
|
I agree with the approach, but do not want to enable the fast tier as we do not have any need for an Express PR review capability. The bottleneck in PR reviews is not the speed of the model. |
Beta Was this translation helpful? Give feedback.
-
|
📋 Initiative planned by the BMAD Scrum Master (Bob). Epic #835 — Opus 4.8 engine upgrade: validate the deep-review tier, finish the reference rollout, and keep cost data accurate 3 stories created (inert — labelled
Open questions for review:
Review the epic and its sub-issue DAG, adjust as needed, then add |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Summary
Upgrade engine.sh model references from Opus 4.7 to Opus 4.8 (shipped May 28, 2026) to leverage two key capabilities: Dynamic Workflows (model-internal parallel subagent orchestration for large diffs) and a 4x reduction in missed flawed code. Fast Mode pricing dropped from $30/$150 to $10/$50 per MTok, making speed-optimized Opus viable for latency-sensitive review tiers.
Market Signal
Opus 4.8 benchmarks (May 28, 2026 release):
User Signal
model-pricing.tsvprovides the cost data infrastructure for pricing updatesTechnical Opportunity
engine.shalready supports model selection per tier. Updating model references and adding Opus 4.8 pricing tomodel-pricing.tsvis a low-risk change. The Dynamic Workflows capability requires no code changes — it activates automatically when Opus 4.8 encounters complex multi-part tasks, improving deep-review quality without prompt engineering changes. Fast Mode viability at $10/$50 opens a new "express" tier option for latency-sensitive reviews (small hotfix PRs, security patches).Assessment
Adversarial Review
Strongest objection: Model upgrades risk regression — prompts were tuned for Opus 4.7 behavior. Dynamic Workflows is model-internal and not controllable, meaning review output structure could change unpredictably.
Rebuttal: The skill eval framework (#583, #585, #586) exists precisely to catch regressions from model upgrades. Run the triage and deep-review eval suites against Opus 4.8 before promoting. Dynamic Workflows is triggered by task complexity — simple reviews won't activate it. The 4x honesty improvement alone justifies the upgrade even without Dynamic Workflows.
Suggested Next Step
Run the triage and deep-review skill eval suites against Opus 4.8, compare quality scores to Opus 4.7 baseline, and update
model-pricing.tsvwith Opus 4.8 rates including the new Fast Mode pricing.Beta Was this translation helpful? Give feedback.
All reactions