v0.3.0
v0.3.0
Adds three eval-quality features:
scriptobjective assertions for deterministic repo-owned oracle commands, gated by explicit--allow-scripts.- Prompt/assertion leakage lint in
validateandaudit-manifest; use--strict-leakageto fail validation. skill-benchmark judge --judge-cmd ...for pluggable judge backends that emitjudge-results.jsonlcompatible with existing benchmark merging.
Also includes tests and README updates.