v0.5.0
What's New
Counterfactual Eval Harness (CLI)
memor eval-counterfactual --project <name>— win/tie/loss scoring vs no-memory baseline using LLM judge- Builds cases from real session data, measures do-no-harm rate
- Developer tool requiring
ANTHROPIC_API_KEY— not on the dashboard (stays true to "no API keys" product promise)
ROI Trend Sparkline (Dashboard)
- SVG area sparkline in the Token ROI hero section showing daily tool call reduction %
- Gradient fill, dashed grid lines, hover dots with tooltips — Robinhood-style fintech aesthetic
- Backed by
get_roi_trend()grouping turn_metrics by day
Tests
- 227 tests passing (+12 new: 8 counterfactual eval, 4 ROI trend)
Full Changelog: v0.4.1...v0.5.0