Code for the paper "Exploration by Random Network Distillation"
Switch branches/tags
Nothing to show
Clone or download
Latest commit ad0b919 Oct 31, 2018

Exploration by Random Network Distillation

Yuri Burda*, Harri Edwards*, Amos Storkey, Oleg Klimov
*equal contribution

University of Edinburgh

Installation and Usage

The following command should train an RND agent on Montezuma's Revenge

python --gamma_ext 0.999

To use more than one gpu/machine, use MPI (e.g. mpiexec -n 8 python --num_env 128 --gamma_ext 0.999 should use 1024 parallel environments to collect experience on an 8 gpu machine).

Blog post and videos