ReasonForge — Interactive Test-Time Compute Laboratory. Live scaling curves, MCTS+PRM, Tree-of-Thoughts, Best-of-N verifier, Self-Refine reward-hacking — all on the Game-of-24 benchmark.
mcts tree-of-thoughts inference-scaling test-time-compute reasoning-models claude-code ai-built daily-webapp game-of-24
-
Updated
Apr 15, 2026 - JavaScript