Skip to content

PyTorch RL Algorithms Implementations on OpenAI Gym Environments

Notifications You must be signed in to change notification settings

BKHMSI/RL-Playground

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

RL Algorithms using PyTorch on OpenAI Gym

You will find here implementations of several deep reinforcement learning (RL) algorithms using PyTorch. I am going to evaluate and compare each on one or more environment from OpenAI Gym. The purpose of this repository is to help kickstart my journey in RL + document my learning experience. I hope it might be useful for other people starting as well. :)

I am planning to write a blog post to accompany this repo, so stay tuned!

Lunar Lander v2 CartPole v0

Implementations

Algorithm Features Solved* (Episodes**) Paper
  • Baseline
  • Causality
  • CartPole-v0
  • LunarLander-v2
Williams 1992
  • Huber Loss
  • Gradient Clipping
  • Polyak Averaging
  • CartPole-v0 (783)
  • LunarLander-v2 (344)
Minh et al. 2013
  • Same as DQN
  • CartPole-v0 (626)
  • LunarLander-v2 (375)
van Hasselt et al. 2015
  • Same as DQN
  • Proportional Prioritization
  • CartPole-v0 (538)
  • LunarLander-v2 (278)
Schaul et al. 2016
  • Same as DQN + PER
  • CartPole-v0 (698)
  • LunarLander-v2 (275)
Wang et al. 2016
  • Generalized Advantage Estimation
  • PongDeterministic-v4
Minh el al. 2016
  • Rainbow
Hessel et al. 2017
  • and many more...
*These are the environments I attempted to solve using my code so far. The algorithms are certainly capable of solving more (check the attached papers for details). I will be trying them on more diverse environments in the future to evaluate my implementation.
**The average number of episodes it took to solve the environment across 10 runs with different seeds

Configurations

Each implementation has its own yaml config file to easily change model and environment parameters.

About

PyTorch RL Algorithms Implementations on OpenAI Gym Environments

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages