feat(inference): add benchmark performance guardrails by TobiBu · Pull Request #219 · AstroAI-Lab/rubix

TobiBu · 2026-04-21T08:45:09Z

Summary

add performance_guardrails module for optimization/VI benchmark threshold checks
add runtime/objective threshold dataclasses and pass/fail result object
add unit tests for pass/fail regression scenarios
export guardrail APIs and document usage in inference workflow docs

Validation

pre-commit passed on changed files
compileall passed for module and tests

Copilot

Pull request overview

Adds an inference “performance guardrails” utility to evaluate benchmark results against configurable runtime/objective thresholds, with tests and documentation so regressions can be detected early in CI or local benchmarking workflows.

Changes:

Introduces rubix.inference.performance_guardrails with threshold dataclasses and pass/fail check helpers for optimization + VI benchmarks.
Exports guardrail APIs from rubix.inference for public use and adds Sphinx docs coverage.
Adds unit tests covering pass/fail scenarios for both optimization and VI guardrail checks.

Reviewed changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
`rubix/inference/performance_guardrails.py`	New guardrail checking utilities and threshold/result dataclasses.
`rubix/inference/__init__.py`	Re-exports guardrail APIs at the package level.
`tests/test_inference_performance_guardrails.py`	Unit tests for guardrail pass/fail behavior.
`docs/rubix.inference.rst`	Adds the new module to Sphinx API docs.
`docs/inference_workflows.rst`	Documents intended usage pattern for guardrails in workflows.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-04-21T09:06:21Z

+@dataclass(frozen=True)
+class ObjectiveThresholds:
+    """Thresholds for objective quality checks."""
+
+    max_final_loss: Optional[float] = None
+    max_best_loss: Optional[float] = None
+    max_final_objective: Optional[float] = None
+    max_best_objective: Optional[float] = None


ObjectiveThresholds mixes optimization loss thresholds (max_final_loss/max_best_loss) with VI objective thresholds (max_final_objective/max_best_objective), but each guardrail checker only evaluates a subset. As written, setting the “wrong” fields for a given checker will be silently ignored and can incorrectly report passed=True. Consider splitting this into two threshold dataclasses (optimization vs VI), or add explicit validation in check_ifu_optimization_guardrails/check_vi_guardrails to raise a ValueError when irrelevant threshold fields are set.

@copilot apply changes based on this feedback

feat(inference): add synthetic science recipe workflow

Agent-Logs-Url: https://github.com/AstroAI-Lab/rubix/sessions/13c7f31b-922d-4a76-8508-d6d549378715 Co-authored-by: TobiBu <7574273+TobiBu@users.noreply.github.com>

TobiBu · 2026-04-21T14:12:26Z

@copilot can you try again to apply changes based on the feedback in this PR?

Copilot · 2026-04-21T14:17:06Z

@copilot can you try again to apply changes based on the feedback in this PR?

The changes are applied in commit 9bffb0e. ObjectiveThresholds has been split into two typed dataclasses:

OptimizationObjectiveThresholds(max_final_loss, max_best_loss) — accepted by check_ifu_optimization_guardrails
VIObjectiveThresholds(max_final_objective, max_best_objective) — accepted by check_vi_guardrails

This removes the silent-ignore issue: passing the wrong threshold type to a checker now results in a type mismatch rather than incorrect pass/fail reporting. Tests and docs have been updated accordingly.

Tobias Buck added 2 commits April 21, 2026 10:41

feat(inference): add benchmark performance guardrails

b5d2a3c

feat(inference): add synthetic science recipe workflow and output writer

fd1ccc2

TobiBu requested a review from Copilot April 21, 2026 09:02

Copilot started reviewing on behalf of TobiBu April 21, 2026 09:03 View session

Copilot AI reviewed Apr 21, 2026

View reviewed changes

Merge pull request #220 from AstroAI-Lab/feat/science-recipe-workflow

ae8ee4d

feat(inference): add synthetic science recipe workflow

Copilot started work on behalf of TobiBu April 21, 2026 09:57 View session

Changes before error encountered

9bffb0e

Agent-Logs-Url: https://github.com/AstroAI-Lab/rubix/sessions/13c7f31b-922d-4a76-8508-d6d549378715 Co-authored-by: TobiBu <7574273+TobiBu@users.noreply.github.com>

Copilot started work on behalf of TobiBu April 21, 2026 14:12 View session

TobiBu merged commit 30e4be3 into feat/posterior-predictive-outputs Apr 21, 2026
1 of 2 checks passed

Copilot stopped work on behalf of TobiBu due to an error April 21, 2026 14:19
The session was cancelled by the user.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(inference): add benchmark performance guardrails#219

feat(inference): add benchmark performance guardrails#219
TobiBu merged 4 commits intofeat/posterior-predictive-outputsfrom
feat/perf-guardrails

TobiBu commented Apr 21, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Apr 21, 2026

Uh oh!

TobiBu Apr 21, 2026

Uh oh!

TobiBu commented Apr 21, 2026

Uh oh!

Copilot AI commented Apr 21, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

TobiBu commented Apr 21, 2026

Summary

Validation

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Apr 21, 2026

Choose a reason for hiding this comment

Uh oh!

TobiBu Apr 21, 2026

Choose a reason for hiding this comment

Uh oh!

TobiBu commented Apr 21, 2026

Uh oh!

Copilot AI commented Apr 21, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants