Skip to content

hhaji/Reinforcement-Learning

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

86 Commits
 
 
 
 

Repository files navigation

Reinforcement-Learning


Index:


Course Overview:

In this course, you will learn the foundations of Reinforcement Learning. To realize the dreams and impact of AI requires autonomous systems that learn to make good decisions. Reinforcement learning is one powerful paradigm for doing so, and it is relevant to an enormous range of tasks, including robotics, game playing, consumer modeling and healthcare.   

Main TextBook:

Book 1

Main TextBook:

Slides and Papers:

Recommended Slides & Papers:


  1. Introduction to Reinforcement Learning

Required Reading:
Suggested Reading:
Additional Resources:
  1. Exploration and Exploitation

Required Reading:
Suggested Reading:
Additional Resources:
  1. Finite Markov Decision Processes

Required Reading:
Suggested Reading:
Additional Resources:
  1. Dynamic Programming

Required Reading:
Suggested Reading:

To get more familiar with dynamic programing, I recommend to read the following blogs:

Additional Resources:
  1. Monte Carlo Methods

Required Reading:
Suggested Reading:
  1. Temporal-Diference Learning

Required Reading:
Suggested Reading:
Additional Resources:
  1. n-step Bootstrapping

Required Reading:
  1. Planning and Learning with Tabular Methods

Required Reading:
Suggested Reading:
  1. On-policy Prediction with Approximation

Required Reading:
Suggested Reading:
  1. On-policy Control with Approximation

Required Reading:
Suggested Reading:
  1. Off-policy Methods with Approximation

Required Reading:
  1. Eligibility Traces

Required Reading:
  1. Policy Gradient Methods

Required Reading:
  1. Deep Reinforcement Learning

Required Reading:
Suggested Reading:
Additional Resources:
  1. Applications

Required Reading:
Additional Resources:
  1. Useful Toolkits and Libraries

Required Reading:
  • Toolkit: Gym is a toolkit for developing and comparing reinforcement learning algorithms. It supports teaching agents everything from walking to playing games like Pong or Pinball.
Suggested Reading:
Additional Resources:

Additional Resources:

Class Time and Location:

Saturday and Monday

Recitation and Assignments:

Tuesday

Projects:

Projects are programming assignments that cover the topic of this course. Any project is written by Jupyter Notebook. Projects will require the use of Python 3.7, as well as additional Python libraries.

Google Colab:

Google Colab is a free cloud service and it supports free GPU!

Fascinating Guides For Machine Learning:

Latex:

The students can include mathematical notation within markdown cells using LaTeX in their Jupyter Notebooks.

  • A Brief Introduction to LaTeX PDF
  • Math in LaTeX PDF
  • Sample Document PDF
  • TikZ: A collection Latex files of PGF/TikZ figures (including various neural networks) by Petar Veličković.

Grading:

  • Projects and Midterm – 50%
  • Endterm – 50%

ُThree Exams:

  • First Midterm Examination:
  • Second Midterm Examination:
  • Final Examination:

Prerequisites:

General mathematical sophistication; and a solid understanding of Algorithms, Linear Algebra, and Probability Theory, at the advanced undergraduate or beginning graduate level, or equivalent.

Linear Algebra:

Probability and Statistics:

Topics:

Have a look at some assignments of Stanford students (Reinforcement Learning to get some general inspiration.

Account:

It is necessary to have a GitHub account to share your projects. It offers plans for both private repositories and free accounts. Github is like the hammer in your toolbox, therefore, you need to have it!

Academic Honor Code:

Honesty and integrity are vital elements of the academic works. All your submitted assignments must be entirely your own (or your own group's).

We will follow the standard of Department of Mathematical Sciences approach:

  • You can get help, but you MUST acknowledge the help on the work you hand in
  • Failure to acknowledge your sources is a violation of the Honor Code
  • You can talk to others about the algorithm(s) to be used to solve a homework problem; as long as you then mention their name(s) on the work you submit
  • You should not use code of others or be looking at code of others when you write your own: You can talk to people but have to write your own solution/code

Questions?

I will be having office hours for this course on Saturday (09:00 AM--10:00 AM). If this is not convenient, email me at hhaji@sbu.ac.ir or talk to me after class.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published