feat: add calibrated scoring with writer/judge model separation#27
feat: add calibrated scoring with writer/judge model separation#27
Conversation
Issue: #7 This commit contains uncommitted work left by a prior builder session that failed or was interrupted before completion. The new builder session will start from a clean working tree and can reference this commit for context.
|
❌ Changes Requested This PR was created from a failed builder run (exit code 1, commit labeled "stale work from previous builder attempt"). While the type definitions and dimension constants are a good foundation, several issues need addressing before approval. Issues1. Missing
|
Closes #7
Changes
Commits
7117c2a [prior-run-checkpoint] stale work from previous builder attemptTest plan