v0.2.3

PaParaZz1 released this 04 Jan 06:43

· 603 commits to main since this release

c6947cd

API Change

move actor_head_type to action_space (which is related DDPG/TD3/SAC)
add multiple seeds in CLI: ding -m serial -c cartpole_dqn_config.py -s 0 -s 1 -s 2
add new replay buffer (which separates algorithm and storage), user can refer to buffer
add new main pipeline for async/parallel framework tutorial

Env (dizoo)

add multi-agent mujoco env (#146)
add delay reward mujoco env (#145)
fix port conflict in gym_soccer (#139)

Algorithm

MASAC algorithm (#112)
TREX IRL algorithm (#119) (#144)
H-PPO hybrid action space algorithm (#140)
residual link in R2D2 (#150)
gumbel softmax (#169)
move actor_head_type to action_space field

Feature

new main pipeline and async/parallel framework (#142) (#166) (#168)
refactor buffer, separate algorithm and storage (#129)
cli in new pipeline(ditask) (#160)
add multiprocess tblogger, fix circular reference problem (#156)
add multiple seed cli
polish eps_greedy_multinomial_sample in model_wrapper (#154)

Fix

R2D3 abs priority problem (#158) (#161)
multi-discrete action space policies random action bug (#167)
doc generate bug with enum_tools (#155)

Style

more comments about R2D2 (#149)
add doc about how to migrate a new env link
add doc about env tutorial in dizoo link
add conda auto release (#148)
udpate zh doc link
update kaggle tutorial link

New Repo

awesome-model-based-RL: A curated list of awesome Model-Based RL resources
DI-smartcross: Decision AI in Traffic Light Control

Contributors: @PaParaZz1 @sailxjx @puyuan1996 @Will-Nie @Weiyuhong-1998 @LikeJulia @RobinC94 @LuciusMos @mingzhang96 @shgqmrf15 @zjowowen

Contributors

sailxjx, mingzhang96, and 9 other contributors

Assets 2