Two proactive review reflexes, validated with an eval-driven loop. The skill's leverage on a strong base model is consistency and proactivity, not new knowledge — so these are behavioral forcing functions.
Added
- The clutter audit — a countable method for density judgment (border-nesting depth, signals-per-state, always-on markers, chrome-vs-data ratio, the removal test). "Feels busy" becomes named, specific cuts.
- Responsive design — breakpoints and the floor — the breakpoint ladder (wide / standard / narrow / too-small) and mechanics (relative units, load-bearing priority,
SIGWINCH, 80×24 minimum + too-small message). SKILL.mdapplies both reflexes to any layout review even when the user only asked about something else — the most-missed behavior in practice. Review checklist + decision flow reference them so they fire from every entry point.evals/evals.json— the design-review eval set used to validate the change.
See CHANGELOG.md.