Code for [NeurIPS'2019 Spotlight] Policy Continuation with Hindsight Inverse Dynamics
policy-gradient
reinforcement-learning-algorithms
hindsight-experience-replay
hindsight-policy-gradients
neurips-2019
supervised-learning-rl
-
Updated
Jan 7, 2020 - Jupyter Notebook