ppo-dice

Please use hyper parameters from this readme. With other hyper parameters things might not work (it's RL after all)!

This repo contains a PyTorch implementation for the paper

Stable Policy Optimization via Off-Policy Divergence Regularization. Ahmed Touati, Amy Zhang, Joelle Pineau and Pascal Vincent. UAI2020

@article{touati2020stable,
  title={Stable Policy Optimization via Off-Policy Divergence Regularization},
  author={Touati, Ahmed and Zhang, Amy and Pineau, Joelle and Vincent, Pascal},
  journal={arXiv preprint arXiv:2003.04108},
  year={2020}
}

Requirements

Python 3 (it might work with Python 2, but I didn't test it)
PyTorch
OpenAI baselines

In order to install requirements, follow:

# PyTorch
conda install pytorch torchvision -c soumith

# Baselines for Atari preprocessing
git clone https://github.com/openai/baselines.git
cd baselines
pip install -e .

# Other requirements
pip install -r requirements.txt

Training

Atari

 ./run_local_atari.sh

Deepmind Control

 ./run_local.sh

LICENSE

Attribution-NonCommercial 4.0 International

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
agents		agents
trpo		trpo
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt
run_local.sh		run_local.sh
run_local_atari.sh		run_local_atari.sh
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ppo-dice

Please use hyper parameters from this readme. With other hyper parameters things might not work (it's RL after all)!

Requirements

Training

Atari

Deepmind Control

LICENSE

About

Releases

Packages

Contributors 2

Languages

License

facebookresearch/ppo-dice

Folders and files

Latest commit

History

Repository files navigation

ppo-dice

Please use hyper parameters from this readme. With other hyper parameters things might not work (it's RL after all)!

Requirements

Training

Atari

Deepmind Control

LICENSE

About

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages