Argument engine for qvra.
bench exists to create constrained comparison, reproducible measurement, and technical pressure.
It is where claims get tested in public.
Artifacts in bench should provide one or more of:
- benchmark harnesses
- reproducible comparisons
- methodology notes
- challenger slots
- before/after evaluation
- reference measurements worth citing
bench is not:
- marketing theater
- vague faster-than claims without method
- screenshots of results with no way to reproduce them
- prestige signaling through numbers alone
Anything promoted into bench should satisfy these:
- explicit scope
- explicit comparison target or baseline
- explicit method
- outputs that can be checked
- a reason the result matters
bench is one of the gravitational anchors.
It creates discussion, links, credibility, retesting, and technical argument under pressure.
Related routes:
runfor utility surfaces users can touch directlylabfor experiments that may later deserve measurementshowfor visible demos of what is being measuredpulsefor the public rhythm of what changed
No benchmark belongs here unless the method is strong enough that disagreement can become productive instead of theatrical.
suites/line-count— reproducible comparison betweenwc -land pure Python line counting on generated input
Active anchor surface.