Deep reinforcement learning in PyTorch

This repo provides straightforward implementations of common DRL algorithms.

Algorithm list:

DQN: Deep-Q-network
PG: Policy gradient algorithm
A2C: Advantage actor critic
PPO: Proximal policy optimisation
DDPG: Deep deterministic policy gradient
TD3: Twin-delayed-DDPG
SAC: Soft actor critic

The document IntroToDRL.pdf provides an introduction to deep reinforcement learning and the important formulas behind the algorithms.

Training Rewards

CartPole	CartPole

CartPole	CartPole

Pendulum	Pendulum	Pendulum

Structure

The single_file/ folder contains files with working examples of each algorithm. The modular/ folder contains the same algorithms, but split into their modular.

Name		Name	Last commit message	Last commit date
Latest commit History 97 Commits
images		images
modular		modular
single_file		single_file
.DS_Store		.DS_Store
.gitignore		.gitignore
ImplementationNotes.md		ImplementationNotes.md
IntroToDRL.pdf		IntroToDRL.pdf
README.md		README.md
main_tests.py		main_tests.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

images

images

modular

modular

single_file

single_file

.DS_Store

.DS_Store

.gitignore

.gitignore

ImplementationNotes.md

ImplementationNotes.md

IntroToDRL.pdf

IntroToDRL.pdf

README.md

README.md

main_tests.py

main_tests.py

utils.py

utils.py

Repository files navigation

Deep reinforcement learning in PyTorch

Structure

About

Releases

Packages

Languages

BDEvan5/pytorch_drl

Folders and files

Latest commit

History

Repository files navigation

Deep reinforcement learning in PyTorch

Structure

About

Resources

Stars

Watchers

Forks

Languages