LOLA_DiCE

Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)

Quick results:

Results on IPD using DiCE

[lr_in=0.3, lr_out=0.2, lr_v=0.1, batch_size=128, len_rollout=150, use_baseline=True]

Results on IPD using DiCE and opponent modelling

[lr_in=0.3, lr_out=0.2, lr_v=0.1, batch_size=128, len_rollout=150, use_baseline=True] (It seems that 2 lookaheads is the most stable model with this set of hyperparameters)

Results on IPD using exact gradients

[lr_in=0.3, lr_out=0.2, batch_size=128, len_rollout=150]

Results on IPD using exact gradients and opponent modelling

[lr_in=0.3, lr_out=0.2, batch_size=128, len_rollout=150]

Authors version:

The authors of the paper have their own version (Tensorflow) available here: https://github.com/alshedivat/lola

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
envs		envs
images		images
LICENSE		LICENSE
README.md		README.md
ipd_DiCE.py		ipd_DiCE.py
ipd_DiCE_om.py		ipd_DiCE_om.py
ipd_exact.py		ipd_exact.py
ipd_exact_om.py		ipd_exact_om.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LOLA_DiCE

Quick results:

Results on IPD using DiCE

Results on IPD using DiCE and opponent modelling

Results on IPD using exact gradients

Results on IPD using exact gradients and opponent modelling

Authors version:

About

Releases

Packages

Languages

License

alexis-jacq/LOLA_DiCE

Folders and files

Latest commit

History

Repository files navigation

LOLA_DiCE

Quick results:

Results on IPD using DiCE

Results on IPD using DiCE and opponent modelling

Results on IPD using exact gradients

Results on IPD using exact gradients and opponent modelling

Authors version:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages