POMBU

Code to reproduce Deep 'Model-Based Reinforcement Learning via Estimated Uncertainty and Conservative Policy Optimization'.

Installation

Install MuJoCo at ~/.mujoco and copy the license key to ~/.mujoco/mjkey.txt.
Clone this RL-POMBU.
Create a conda environment by running

conda env create -f environmen.yml

Run

cd shells
./run_cheetah.sh

Remarks

There is some obsolete code. We will refactor the code in winter vacation.
We provide a new version of the paper with the appendix in arxiv.