Soft Actor-Critic (SAC) implementation in PyTorch

This is PyTorch implementation of Soft Actor-Critic (SAC) [ArXiv].

If you use this code in your research project please cite us as:

@misc{pytorch_sac,
  author = {Yarats, Denis and Kostrikov, Ilya},
  title = {Soft Actor-Critic (SAC) implementation in PyTorch},
  year = {2020},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/denisyarats/pytorch_sac}},
}

Requirements

We assume you have access to a gpu that can run CUDA 9.2. Then, the simplest way to install all required dependencies is to create an anaconda environment and activate it:

conda env create -f conda_env.yml
source activate pytorch_sac

Instructions

To train an SAC agent on the cheetah run task run:

python train.py env=cheetah_run

This will produce exp folder, where all the outputs are going to be stored including train/eval logs, tensorboard blobs, and evaluation episode videos. One can attacha tensorboard to monitor training by running:

tensorboard --logdir exp

Results

An extensive benchmarking of SAC on the DM Control Suite against D4PG. We plot an average performance of SAC over 3 seeds together with p95 confidence intervals. Importantly, we keep the hyperparameters fixed across all the tasks. Note that results for D4PG are reported after 10^8 steps and taken from the original paper.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
agent		agent
config		config
data		data
figures		figures
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
conda_env.yml		conda_env.yml
logger.py		logger.py
replay_buffer.py		replay_buffer.py
train.py		train.py
utils.py		utils.py
video.py		video.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

agent

agent

config

config

data

data

figures

figures

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

conda_env.yml

conda_env.yml

logger.py

logger.py

replay_buffer.py

replay_buffer.py

train.py

train.py

utils.py

utils.py

video.py

video.py

Repository files navigation

Soft Actor-Critic (SAC) implementation in PyTorch

Requirements

Instructions

Results

About

Releases

Packages

Languages

License

denisyarats/pytorch_sac

Folders and files

Latest commit

History

Repository files navigation

Soft Actor-Critic (SAC) implementation in PyTorch

Requirements

Instructions

Results

About

Topics

Resources

License

Stars

Watchers

Forks

Languages