Replay-certified self-modification PoC: public-evidence admission gate with generated comparator covers, lower-tail checks, safe/unsafe growth fallback, and target-witness diagnostics.
python auditing benchmarking research replay ai-agents risk-evaluation llm ollama agent-evaluation gemma3 self-modification
-
Updated
Apr 2, 2026 - Python