feat(experiments): wire eval_model config field to evaluator construction by bug-ops · Pull Request #2124 · bug-ops/zeph

bug-ops · 2026-03-22T13:38:08Z

Summary

When experiments.eval_model is set (e.g. openai/gpt-4o, claude/claude-opus-4-6), a dedicated judge provider is created for the evaluator so the judge is independent from the agent under test
Falls back to the agent's primary provider when eval_model is not configured (existing behavior preserved, no breaking changes)
Wired in both paths: agent loop (/experiment start) and standalone CLI (--experiment-run)
Removes the TODO(#eval-model) comment from experiment_cmd.rs

Changes

ExperimentState: new eval_provider: Option<AnyProvider> field (feature-gated)
AppBuilder::build_eval_provider(): creates provider from eval_model spec using existing create_summary_provider (supports ollama/<model>, claude, openai, compatible/<name>)
Agent::with_eval_provider(): builder method to inject the eval provider
runner.rs: wired in agent build path and run_experiment_session

Test plan

All 6366 existing tests pass (cargo nextest run --workspace --features full --lib --bins)
cargo clippy --workspace --features full -- -D warnings clean
cargo +nightly fmt --check clean
Feature is enabled = false by default — no production impact

…tion (#2113) When experiments.eval_model is set, create a dedicated judge provider so the evaluator is independent from the agent under test. Falls back to the primary provider when eval_model is unset (existing behavior preserved). - Add eval_provider field to ExperimentState (feature-gated) - Add AppBuilder::build_eval_provider() using create_summary_provider - Add Agent::with_eval_provider() builder method - Wire eval provider in both agent (/experiment start) and --experiment-run paths - Remove the TODO(#eval-model) comment from experiment_cmd.rs

bug-ops enabled auto-merge (squash) March 22, 2026 13:38

github-actions bot added enhancement New feature or request size/M Medium PR (51-200 lines) documentation Improvements or additions to documentation rust Rust code changes core zeph-core crate and removed size/M Medium PR (51-200 lines) labels Mar 22, 2026

bug-ops force-pushed the feat-issue-2113-wire-eval-model-to-evaluator branch from ede64f7 to f0004d0 Compare March 22, 2026 13:43

github-actions bot added the size/M Medium PR (51-200 lines) label Mar 22, 2026

bug-ops merged commit f6bc748 into main Mar 22, 2026
25 checks passed

bug-ops deleted the feat-issue-2113-wire-eval-model-to-evaluator branch March 22, 2026 13:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(experiments): wire eval_model config field to evaluator construction#2124

feat(experiments): wire eval_model config field to evaluator construction#2124
bug-ops merged 1 commit intomainfrom
feat-issue-2113-wire-eval-model-to-evaluator

bug-ops commented Mar 22, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

bug-ops commented Mar 22, 2026

Summary

Changes

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant