Skip to content

stjordanis/Deep-RL-Bootcamp-Labs

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Solutions to the Deep RL Bootcamp labs

  • Prelab: Set up your computer for all labs.
  • Lab 1: Markov Decision Processes. You will implement value iteration, policy iteration, and tabular Q-learning and apply these algorithms to simple environments including tabular maze navigation (FrozenLake) and controlling a simple crawler robot.
  • Lab 2: Introduction to Chainer. You will implement deep supervised learning using Chainer, and apply it to the MNIST dataset.
  • Lab 3: Deep Q-Learning. You will implement the DQN algorithm and apply it to Atari games.
  • Lab 4: Policy Optimization Algorithms. You will implement various policy optimization algorithms, including policy gradient, natural policy gradient, trust-region policy optimization (TRPO), and asynchronous advantage actor-critic (A3C). You will apply these algorithms to classic control tasks, Atari games, and roboschool locomotion environments.

About

Solutions to the Deep RL Bootcamp labs

Resources

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 46.5%
  • Jupyter Notebook 36.7%
  • JavaScript 8.7%
  • HTML 3.6%
  • CSS 2.6%
  • Shell 1.9%