v0.2.3
API Change
- move
actor_head_typetoaction_space(which is related DDPG/TD3/SAC) - add multiple seeds in CLI:
ding -m serial -c cartpole_dqn_config.py -s 0 -s 1 -s 2 - add new replay buffer (which separates algorithm and storage), user can refer to buffer
- add new main pipeline for async/parallel framework tutorial
Env (dizoo)
- add multi-agent mujoco env (#146)
- add delay reward mujoco env (#145)
- fix port conflict in gym_soccer (#139)
Algorithm
- MASAC algorithm (#112)
- TREX IRL algorithm (#119) (#144)
- H-PPO hybrid action space algorithm (#140)
- residual link in R2D2 (#150)
- gumbel softmax (#169)
- move actor_head_type to action_space field
Feature
- new main pipeline and async/parallel framework (#142) (#166) (#168)
- refactor buffer, separate algorithm and storage (#129)
- cli in new pipeline(ditask) (#160)
- add multiprocess tblogger, fix circular reference problem (#156)
- add multiple seed cli
- polish eps_greedy_multinomial_sample in model_wrapper (#154)
Fix
- R2D3 abs priority problem (#158) (#161)
- multi-discrete action space policies random action bug (#167)
- doc generate bug with enum_tools (#155)
Style
- more comments about R2D2 (#149)
- add doc about how to migrate a new env link
- add doc about env tutorial in dizoo link
- add conda auto release (#148)
- udpate zh doc link
- update kaggle tutorial link
New Repo
- awesome-model-based-RL: A curated list of awesome Model-Based RL resources
- DI-smartcross: Decision AI in Traffic Light Control
Contributors: @PaParaZz1 @sailxjx @puyuan1996 @Will-Nie @Weiyuhong-1998 @LikeJulia @RobinC94 @LuciusMos @mingzhang96 @shgqmrf15 @zjowowen