Skip to content

v0.2.3

Choose a tag to compare

@PaParaZz1 PaParaZz1 released this 04 Jan 06:43
· 603 commits to main since this release

API Change

  1. move actor_head_type to action_space (which is related DDPG/TD3/SAC)
  2. add multiple seeds in CLI: ding -m serial -c cartpole_dqn_config.py -s 0 -s 1 -s 2
  3. add new replay buffer (which separates algorithm and storage), user can refer to buffer
  4. add new main pipeline for async/parallel framework tutorial

Env (dizoo)

  1. add multi-agent mujoco env (#146)
  2. add delay reward mujoco env (#145)
  3. fix port conflict in gym_soccer (#139)

Algorithm

  1. MASAC algorithm (#112)
  2. TREX IRL algorithm (#119) (#144)
  3. H-PPO hybrid action space algorithm (#140)
  4. residual link in R2D2 (#150)
  5. gumbel softmax (#169)
  6. move actor_head_type to action_space field

Feature

  1. new main pipeline and async/parallel framework (#142) (#166) (#168)
  2. refactor buffer, separate algorithm and storage (#129)
  3. cli in new pipeline(ditask) (#160)
  4. add multiprocess tblogger, fix circular reference problem (#156)
  5. add multiple seed cli
  6. polish eps_greedy_multinomial_sample in model_wrapper (#154)

Fix

  1. R2D3 abs priority problem (#158) (#161)
  2. multi-discrete action space policies random action bug (#167)
  3. doc generate bug with enum_tools (#155)

Style

  1. more comments about R2D2 (#149)
  2. add doc about how to migrate a new env link
  3. add doc about env tutorial in dizoo link
  4. add conda auto release (#148)
  5. udpate zh doc link
  6. update kaggle tutorial link

New Repo

  1. awesome-model-based-RL: A curated list of awesome Model-Based RL resources
  2. DI-smartcross: Decision AI in Traffic Light Control

Contributors: @PaParaZz1 @sailxjx @puyuan1996 @Will-Nie @Weiyuhong-1998 @LikeJulia @RobinC94 @LuciusMos @mingzhang96 @shgqmrf15 @zjowowen