This repo documents presentable experiments in the Reinforcement Learning training of players under the scope of https://github.com/honglu2875/hironaka
Blowup trees of some checkpoints of AlphaZero agents
Small scale, against choosefirst v0
Small scale with varying batch sizes
AlphaZero style RL (notes).
The blowup tree of E8 surface singularity using Zeillinger policy