Skip to content

Releases: DHDev0/Muzero

V1.1

19 Jan 07:38
08af790
Compare
Choose a tag to compare
  • Manage illegal move with negative reward

v1

07 Jan 05:35
Compare
Choose a tag to compare
v1
  • mcts stability on prior on multi sim
  • multi processor and gpu
  • support for all type of torch float
  • expand compatibility to atari and mujoco for 1D and 2D observation
  • add lstm,conv-lstm,transformer Model
  • fix bug on previous backprob
  • fix weight initialization on linear layer
  • add priority to actualize replay buffer from training