Skip to content

docs: standardize model-judge calibration reporting#12

Merged
aryeko merged 1 commit into
mainfrom
docs/calibration-reporting-v015
Jul 4, 2026
Merged

docs: standardize model-judge calibration reporting#12
aryeko merged 1 commit into
mainfrom
docs/calibration-reporting-v015

Conversation

@aryeko

@aryeko aryeko commented Jul 4, 2026

Copy link
Copy Markdown
Contributor

Summary

  • add shared manual pointwise model-judge calibration and reporting guidance
  • clarify covered/partial/missing/contradicted/unknown interpretation and expected-bad fixture handling
  • update eval-kit skills to report deterministic evidence first and keep raw provider bundles local unless curated
  • bump package/version references for v0.1.5

Verification

  • pnpm install --frozen-lockfile
  • pnpm check
  • git diff --check
  • rg sweeps for stale v0.1.4 current examples, default-config toggling guidance, and calibration wording

Notes

Docs/skills-only release. No runtime behavior, schemas, prompts, fixtures, or consumer semantics changed.

@aryeko aryeko merged commit 272bad8 into main Jul 4, 2026
1 check passed
@aryeko aryeko deleted the docs/calibration-reporting-v015 branch July 4, 2026 01:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant