v0.3.1
What's Changed
- docs(news): V2 leaderboard ships + V2Trace dataset announce by @reacher-z in #149
- docs: add 'What are you looking for?' audience-targeted entry grid by @reacher-z in #144
- docs: scoring logic — 2-stage rubric, judge prompt, reproducibility by @reacher-z in #148
- Docs/add logo by @Perry2004 in #151
- fix: README scoring.md links → eval/scoring.md by @reacher-z in #150
- docs: update readme FAQ by @Perry2004 in #156
- docs(news): announce TIGER-Lab/ClawBench canonical Space + collection by @reacher-z in #155
- fix: remove ASPCA-related tasks by @Perry2004 in #161
- docs: v0.3.1 patch changelog by @Perry2004 in #162
- build: v0.3.1 patch release by @Perry2004 in #163
Full Changelog: https://github.com/reacher-z/ClawBench/blob/main/CHANGELOG.md