Stochastic Lower Bound Optimization
This is the TensorFlow implementation for the paper Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees. A PyTorch version will be released later.
- OpenAI Baselines
- rllab (commit number
- MuJoCo (1.5)
- TensorFlow (>= 1.9)
- NumPy (>= 1.14.5)
- Python 3.6
Before running, please make sure that
baselines are available
python main.py -c configs/algos/slbo.yml configs/envs/half_cheetah.yml -s log_dir=/tmp
If you want to change hyper-parameters, you can either modify a corresponding
yml file or
change it temporarily by appending
model.hidden_sizes='[1000,1000]' in the command line.
See LICENSE for additional details.