Skip to content

A explaintable and modified version of udacity DRL homework

Notifications You must be signed in to change notification settings

zhengsizuo/DRL_udacity

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

44 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DRL_udacity

Welcome to Deep Reinforment Learning world!
This is an explaintable and modified version of udacity DRL homework~

  • DQN: modified from Udacity repo, tested on Breakout-v0 env.
  • PPO: wrote by myself, tested on Pendulum-v0 and BipedalWalker-v2 envs.
  • policy gradient: REINFORCE with baseline and entropy loss, tested on CartPole-v0
  • monte-carlo: modified version, tested on BlackJack env.
  • Temporal Difference: modified version, tested on CliffWalking-v0

About

A explaintable and modified version of udacity DRL homework

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages