Skip to content

PyTorch Implementations of Standard Deep RL Algorithms (including REINFORCE, A2C, PPO)

Notifications You must be signed in to change notification settings

siddk/rl-kitchen-sink

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Reinforcement Learning Kitchen Sink

PyTorch Implementations of Standard Deep RL Algorithms (including REINFORCE, A2C, PPO). Each of the below algorithms are implemented for both OpenAI Gym Classic Control Tasks (e.g. Cartpole, MountainCar), as well as the Atari suite.

This repository exists mostly as a means to illustrate basic RL algorithms via extremely readable and well-documented PyTorch implementations. This repository also stores hyperparameters and learning curves (with confidence intervals!) for each task.

This code is inspired in equal parts from the OpenAI Baselines and Ilya Kostrikov's PyTorch-RL Repository.

REINFORCE

Advantage Actor-Critic (A2C)

Proximal Policy Optimization

About

PyTorch Implementations of Standard Deep RL Algorithms (including REINFORCE, A2C, PPO)

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages