Summary
Define Autoresearch as a metric-experiment preset in the RunContract Harness.
Parent roadmap: #114
Track: C - Preset evolution
Related: #118, #167, #172
Problem
Autoresearch is the bounded experiment/optimization preset, but its RunContract-facing contract is not explicit. The harness needs a clear preset boundary for metric selection, benchmark execution, experiment ledger evidence, stopping rules, and anti-Goodhart safeguards.
Scope
- Define the Autoresearch metric-experiment preset contract.
- Specify required inputs: goal, benchmark or metric target, constraints, evidence standard, stop conditions, and acceptable trade-offs.
- Specify required artifacts such as contract, benchmark/checks, experiment ledger, ideas, decision report, and verification report.
- Define how objective/evaluation signals can inform Autoresearch without becoming hidden hard-blocking authority.
- Identify follow-up implementation slices if existing Autoresearch behavior needs alignment.
Non-goals
- No neural/RL learning implementation.
- No unbounded autonomous research loop.
- No score-only completion authority.
- No runtime plugin/module retirement behavior.
- No command rename or storage-root migration.
Acceptance criteria
Verification
Summary
Define Autoresearch as a metric-experiment preset in the RunContract Harness.
Parent roadmap: #114
Track: C - Preset evolution
Related: #118, #167, #172
Problem
Autoresearch is the bounded experiment/optimization preset, but its RunContract-facing contract is not explicit. The harness needs a clear preset boundary for metric selection, benchmark execution, experiment ledger evidence, stopping rules, and anti-Goodhart safeguards.
Scope
Non-goals
Acceptance criteria
Verification
docs/runcontract-harness-evaluator.md.