Randomized-Ensembled-Double-Q-learning-REDQ-

PyTorch implementation of Randomized-Ensembled-Double-Q-learning-REDQ. This repo contains a notebook version and a script version to run REDQ or SAC.

For more information about REDQ check out the paper or checkout my medium article about it. In the future I will add REDQ to my Soft-Actor-Critic-and-Extensions repository so you can combine it with several other performance increasing extensions like PER, D2RL or Munchausen RL.

Dependencies

Trained and tested on:

Python 3.6
PyTorch 1.7.0  
Numpy 1.15.2 
gym 0.10.11 
pybulletgym

How to use:

The new script combines all extensions and the add-ons can be simply added by setting the corresponding flags.

python train.py -info redq

To train SAC simply set the REDQ specific parameter to (N=2, M=2, G=1).

python train.py --N 2 --M 2 --G 1 -info sac

Observe training results

tensorboard --logdir=runs

Results

REDQ trained with N=5,M=2,G=5 (REDQ actually recommends 10,2,20) However, for faster training I trained with the adapted parameters. If someone finds a way to speed up training please let me know. with N=10, G=20 it takes ~ 10x longer when compared to regular SAC.

REDQ-212 is the regular SAC implementation for N=2, G=1 and M=2.

ToDos:

Currently this REDQ version supports only a subsample size of 2 for the REDQ hyperparameter M, however, this repository will be updated over time.
Do comparison runs for REDQ and SAC [ currently running for pybullet environments like cheetah, hopper]
improve training speed (wall-clock time)
add requirements.txt

Author

Sebastian Dittert

Feel free to use this code for your own projects or research.

@misc{REDQ,
  author = {Dittert, Sebastian},
  title = {PyTorch Implementation of Randomized-Ensembled-Double-Q-learning-REDQ-},
  year = {2020},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/BY571/Randomized-Ensembled-Double-Q-learning-REDQ-}},
}

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
img		img
README.md		README.md
REDQ-notebook.ipynb		REDQ-notebook.ipynb
agent.py		agent.py
buffer.py		buffer.py
networks.py		networks.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

img

img

README.md

README.md

REDQ-notebook.ipynb

REDQ-notebook.ipynb

agent.py

agent.py

buffer.py

buffer.py

networks.py

networks.py

train.py

train.py

Repository files navigation

Randomized-Ensembled-Double-Q-learning-REDQ-

Dependencies

How to use:

Observe training results

Results

ToDos:

Author

About

Releases

Packages

Languages

BY571/Randomized-Ensembled-Double-Q-learning-REDQ-

Folders and files

Latest commit

History

Repository files navigation

Randomized-Ensembled-Double-Q-learning-REDQ-

Dependencies

How to use:

Observe training results

Results

ToDos:

Author

About

Topics

Resources

Stars

Watchers

Forks

Languages