v0.3.1
API Change
- Substitute
gym.wrappers.RecordVideoforgym.wrappers.Monitorto save video replay - Substitute
policy/bc.pyforpolicy/il.pyand update relevant serial_pipeline and unittest - Polish all the configurations in dizoo with our new config guideline
Env (dizoo)
- polish and standardize dizoo config (#252) (#255) (#249) (#246) (#262) (#261) (#266) (#273) (#263) (#280) (#259) (#286) (#277) (#290) (#289) (#299)
- add GRF academic env and config (#281)
- update env inferface of GRF (#258)
- update D4RL offline RL env and config (#285)
- polish PomdpAtariEnv (#254)
Algorithm
- DREX Inverse RL algorithm (#218)
Feature
- separate mq and parallel modules, add redis (#247)
- rename env variables; fix attach_to parameter (#244)
- env implementation check (#275)
- adjust and set the max column number of tabulate in log (#296)
- speed up GTrXL forward method + GRU unittest (#253) (#292)
- add drop_extra option for sample collect
Fix
- add act_scale in DingEnvWrapper; fix envpool env manager (#245)
- auto_reset=False and env_ref bug in env manager (#248)
- data type and deepcopy bug in RND (#288)
- share_memory bug and multi_mujoco env (#279)
- some bugs in GTrXL (#276)
- update gym_vector_env_manager and add more unittest (#241)
- mdpolicy random collect bug (#293)
- gym.wrapper save video replay bug
- collect abnormal step format bug and add unittest
Test
- add buffer benchmark & socket test (#284)
Style
Contributors: @PaParaZz1 @sailxjx @puyuan1996 @Will-Nie @davide97l @hiha3456 @zjowowen @Weiyuhong-1998 @LuciusMos @kxzxvbk @lixl-st @YinminZhang @song2181 @Hcnaeg @norman26625 @jayyoung0802 @RobinC94 @HansBug