SPIRAL in pytorch and ray

The original spiral paper (https://arxiv.org/abs/1804.01118) training pipeline implementation in pytorch and ray

Requirements:

2 GPUs. One for policy learning and one for discriminator learning.

Note that this training pipeline is for a single machine. Population-based exploration of hyperparameters (PBT) is not implemented.

Usage:

Install https://github.com/deepmind/spiral following the instructions (need the libmypaint environment)
Copy all python scripts here to spiral/
Download some data. Look at real_image_loader.py for dataset location/format
Adjust hyperparameters in config.py
Run python spiral_torch.py

15000 policy training steps on digit 4 in mnist (each training step is n_batches * n_timesteps, or 64*10):

Possible differences from the original paper

In the original paper, discriminator trains faster than policy because of network structure.
However in this implementation, discriminator trains faster because policy spends most of the time waiting for batches from painter agents.
Policy trains on each trajectory for only once (which causes 1.) so that training is on-policy. But in the paper, they describe the training as off-policy.

Also credit to:

https://github.com/werner-duvaud/muzero-general. I learned about ray here.
https://github.com/eriklindernoren/PyTorch-GAN/blob/master/implementations/wgan_gp/wgan_gp.py. I used the wgan-gp implementation here.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
examples		examples
.gitattributes		.gitattributes
README.md		README.md
config.py		config.py
learners.py		learners.py
networks.py		networks.py
painter.py		painter.py
policies.py		policies.py
real_image_loader.py		real_image_loader.py
spiral_torch.py		spiral_torch.py
storage.py		storage.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

examples

examples

.gitattributes

.gitattributes

README.md

README.md

config.py

config.py

learners.py

learners.py

networks.py

networks.py

painter.py

painter.py

policies.py

policies.py

real_image_loader.py

real_image_loader.py

spiral_torch.py

spiral_torch.py

storage.py

storage.py

utils.py

utils.py

Repository files navigation

SPIRAL in pytorch and ray

Requirements:

Usage:

Possible differences from the original paper

Also credit to:

About

Releases

Packages

Languages

wu375/spiral-pytorch-ray

Folders and files

Latest commit

History

Repository files navigation

SPIRAL in pytorch and ray

Requirements:

Usage:

Possible differences from the original paper

Also credit to:

About

Resources

Stars

Watchers

Forks

Languages