Description
Integrate balance validation into CI pipeline with automated sweeps on commits, baseline comparisons, and regression detection. Provides continuous feedback on balance impacts of code changes.
Acceptance Criteria
- GitHub Actions workflow runs automated AI tournament sweeps on relevant commits (e.g., changes to simulation, content, or AI systems)
- Workflow uses reduced parameter grid for speed (e.g., 3 difficulty presets, 3 strategies, 5 seeds, 100 ticks)
- Baseline results stored and versioned in repository or artifact storage
- Regression detection compares new sweep results to baseline using statistical thresholds
- Workflow fails with actionable error message if significant balance regressions detected
- Reports published as workflow artifacts or PR comments showing win rate deltas and flagged issues
- At least 5 tests covering baseline management, regression detection, and workflow integration
Priority
Medium (High value but depends on 11.3.1 completion)
Dependencies
- ✅ 11.1.1 (Batch Parameter Sweeps) - Completed
- ✅ 11.3.1 (Analysis and Balance Reporting) - Completed 2025-12-04
Responsible
Development Team
Next Steps
- Design CI sweep subset (reduced parameter grid for fast feedback)
- Implement baseline storage and versioning strategy
- Create GitHub Actions workflow with scheduled and manual triggers
- Add regression threshold configuration
- Implement PR comment reporting for balance impacts
- Test workflow with realistic code changes
References
See .pm/tracker.md task 11.5.1 for complete details and context.
Description
Integrate balance validation into CI pipeline with automated sweeps on commits, baseline comparisons, and regression detection. Provides continuous feedback on balance impacts of code changes.
Acceptance Criteria
Priority
Medium (High value but depends on 11.3.1 completion)
Dependencies
Responsible
Development Team
Next Steps
References
See
.pm/tracker.mdtask 11.5.1 for complete details and context.