Advanced course on Reinforcement Learning.
- Overview of Reinforcement Learning and its applications
- Markov Decision Processes (MDPs) and Bellman Equations
- Q-Learning and SARSA algorithms
- TD Learning
- TD prediction
- SARSA and TD control
- First-Visit Monte Carlo and Every-Visit Monte Carlo methods
- On-Policy and Off-Policy methods
- Importance Sampling
- Introduction to function approximation for Reinforcement Learning
- Overview of Deep Reinforcement Learning
- Hands-on experience with Gymnasium environment
- Final project: students will work on a real-world Reinforcement Learning problem using the techniques and tools learned in the course
- Course conclusion and future directions in Reinforcement Learning research
- "Reinforcement Learning: An Introduction" by Richard S. Sutton and Andrew G. Barto (2018)
- Python implementation based on the book "Reinforcement Learning: An Introduction" by Richard S. Sutton and Andrew G. Barto (2018)
- "Deep Reinforcement Learning Hands-On" by Maxim Lapan (2018)
- Code for the book "Deep Reinforcement Learning Hands-On" by Maxim Lapan (2018)
- Gymnasium environment (https://gymnasium.farama.org/)
- TensorFlow documentation (https://www.tensorflow.org/guide)
- PyTorch documentation (https://pytorch.org/docs/stable/index.html)
- AlphaGo Documentary
- Monte Carlo Tree Search Another Introduction