Pressure-test research claims with falsifiable evidence plans, adversarial checks, frozen verifiers, and proof ledgers.
benchmarking research verification research-tool codex autonomous-agents scientific-method ai-research falsification research-automation evals research-agent deepseek agent-skills claude-code agent-workflows proof-ledger
-
Updated
May 25, 2026 - JavaScript