GitHub - zikunukiz/train-procgen: Reinforcement Learning Generalization with Surprise Minimization

Algorithm Repo

Reinforcement Learning Generalization with Surprise Minimization

Algorithms (ppo2, ppo2_normal, ppo2_cvae) are in another repo forked from OpenAI Baseline: https://github.com/chenziku/baselines/tree/master/baselines/ppo2

Install Dependencies and Setting Up

For fast experiments, run the following on Google Colab:

pip install procgen
git clone https://github.com/chenziku/train-procgen.git
pip uninstall -y imgaug
pip install 'imgaug<0.2.7,>=0.2.5'
pip install -e train-procgen
git clone https://github.com/chenziku/baselines.git
pip install tensorflow-gpu==1.15
pip install mpi4py
pip uninstall -y tensorflow_probability
pip install tensorflow_probability==0.8.0
pip install gputil
cd baselines

Training from scratch

Specify the training algorithm it before the learn function in train.py or test.py

Run the following for training from scratch (200 levels in easy mode on CoinRun, starting from level 0):

python -m strain-procgen.train --env_name coinrun --distribution_mode easy --num_levels 200

For PPO + VAE, we can also change model/vae paths in train_load.py and run the following for training from a loaded policy and VAE (vae280/560):

python -m train-procgen.train_load --env_name coinrun --distribution_mode easy --num_levels 200

Run the following for test (starting from level 1000):

!python -m train-procgen.test --env_name bossfight --distribution_mode easy --start_level 1000

Follow https://github.com/openai/train-procgen for other specifications

References

OpenAI Procgen Benchmark: https://openai.com/blog/procgen-benchmark/

Train Procgen: https://github.com/openai/train-procgen

Surprise Minimization (SMiRL): https://bair.berkeley.edu/blog/2019/12/18/smirl/ https://arxiv.org/abs/1912.05510

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
saved_model		saved_model
train_procgen		train_procgen
.gitignore		.gitignore
README.md		README.md
environment.yml		environment.yml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

saved_model

saved_model

train_procgen

train_procgen

.gitignore

.gitignore

README.md

README.md

environment.yml

environment.yml

setup.py

setup.py

Repository files navigation

Algorithm Repo

Install Dependencies and Setting Up

Training from scratch

References

About

Releases

Packages

Languages

zikunukiz/train-procgen

Folders and files

Latest commit

History

Repository files navigation

Algorithm Repo

Install Dependencies and Setting Up

Training from scratch

References

About

Topics

Resources

Stars

Watchers

Forks

Languages