Code to reproduce Deep 'Model-Based Reinforcement Learning via Estimated Uncertainty and Conservative Policy Optimization'.
- Install MuJoCo at ~/.mujoco and copy the license key to ~/.mujoco/mjkey.txt.
- Clone this RL-POMBU.
- Create a conda environment by running
conda env create -f environmen.yml
cd shells
./run_cheetah.sh
- There is some obsolete code. We will refactor the code in winter vacation.
- We provide a new version of the paper with the appendix in arxiv.