MGBench v0.1.0

ostinatocc released this 22 Jun 07:22

· 5 commits to main since this release

ad04a8f

MGBench v0.1.0 freezes the first public memory-governance benchmark release.

Highlights:

608 frozen deterministic scenarios across 8 governance suites.
Deterministic scoring with no LLM judge dependency.
Reference reports for Aionis, raw memory, no memory, Mem0, Supermemory, Graphiti, and Tencent Agent Memory where reports are available.
Public adapter and benchmark contracts for external memory-system evaluations.

Current public reports:

reports/mgbench-v0.1-current.md
reports/mgbench-v0.1-current.json

Interpretation boundary:

MGBench measures memory governance, not patch success.
Filtered competitor modes are labeled external_host when lifecycle/filter knowledge is supplied by the caller.
Raw per-suite metrics are primary evidence; aggregate MGBench score is a compact ranking for the current manifest.

Assets 2