Deep Reinforcement Learning Textbook

A collection of comprehensive notes on Deep Reinforcement Learning, based on UC Berkeley's CS 285 (prev. CS 294-112) taught by Professor Sergey Levine.

Compile the latex source code into PDF locally.
Alternatively, you could download this repo as a zip file and upload the zip file to Overleaf and start editing online.
This repo is linked to my Overleaf editor so it is regularly updated.m
Please let me know if you have any questions or suggestions. Reach me via harryhzhang@berkeley.edu

Introduction

In recent years, deep reinforcement learning (DRL) has emerged as a transformative paradigm, bridging the domains of artificial intelligence, machine learning, and robotics to enable the creation of intelligent, adaptive, and autonomous systems. This textbook is designed to provide a comprehensive, in-depth introduction to the principles, techniques, and applications of deep reinforcement learning, empowering students, researchers, and practitioners to advance the state of the art in this rapidly evolving field. As the first DRL class I have taken was Prof. Levine's CS 294-112, this book's organization and materials are based strongly on the CS 294-112 (now CS 285)'s slides and syllabus.

The primary objective of this textbook is to offer a systematic and rigorous treatment of DRL, from foundational concepts and mathematical formulations to cutting-edge algorithms and practical implementations. We strive to strike a balance between theoretical clarity and practical relevance, providing readers with the knowledge and tools needed to develop novel DRL solutions for a wide array of real-world problems.

The textbook is organized into several parts, each dedicated to a specific aspect of DRL:

Fundamentals: This part covers the essential background material in reinforcement learning, including Markov decision processes, value functions, and fundamental algorithms such as Q-learning and policy gradients.
Deep Learning for Reinforcement Learning: Here, we delve into the integration of deep learning techniques with reinforcement learning, discussing topics such as function approximation, representation learning, and the use of deep neural networks as function approximators.
Advanced Techniques and Algorithms: This part presents state-of-the-art DRL algorithms, such as Deep Q-Networks (DQN), Proximal Policy Optimization (PPO), and Soft Actor-Critic (SAC), along with their theoretical underpinnings and practical considerations.
Exploration and Exploitation: We explore strategies for balancing exploration and exploitation in DRL, examining methods such as intrinsic motivation, curiosity-driven learning, and Bayesian optimization.
Real-World Applications: This section showcases the application of DRL to various domains, including robotics, computer vision, natural language processing, and healthcare, highlighting the challenges and opportunities in each area. Throughout the textbook, we supplement the theoretical exposition with practical examples, case studies, and programming exercises, allowing readers to gain hands-on experience in implementing DRL algorithms and applying them to diverse problems. We also provide references to relevant literature, guiding the reader towards further resources for deepening their understanding and pursuing advanced topics.

We envision this textbook as a valuable resource for students, researchers, and practitioners seeking a solid grounding in deep reinforcement learning, as well as a springboard for future innovation and discovery in this exciting and dynamic field. It is our hope that this work will contribute to the ongoing growth and development of DRL, facilitating the creation of intelligent systems that can learn, adapt, and thrive in complex, ever-changing environments.

We extend our deepest gratitude to our colleagues, reviewers, and students, whose invaluable feedback and insights have helped shape this textbook. We also wish to acknowledge the pioneering researchers whose contributions have laid the foundation for DRL and inspired us to embark on this journey.

Update Log

Aug 26, 2020: Started adding Fall 2020 materials
Aug 28, 2020: Fixed typos in Intro. Credit: Warren Deng.
Aug 30, 2020: Added more explanation to the imitation learning chapter.
Sep 13, 2020: Added advanced PG in PG and fixed typos in PG.
Sep 14, 2020: AC chapter format, typos fix, more analysis on A2C
Sep 16, 2020: Chapter 10.1 KL div typo fix. Credit: Cong Wang.
Sep 19, 2020: Chapter 3.7.1 parathesis typo fix. Credit: Yunkai Zhang.
Sep 23, 2020: Q learning chapter fix
Sep 26, 2020: More explanation and fix to the advanced PG chapter (specifically intuition behind TRPO).
Sep 28, 2020: Typo fixed and more explanation in Optimal Control. Typos were pointed out in Professor Levine's lecture.
Oct 6, 2021: Model-based RL chapter fixed. Added Distillation subsection.
Nov. 20, 2021: Fixed typos in DDPG, Online Actor Critic, and PG theory. Credit: Javier Leguina.
Apr. 2, 2023: Fixed typos in VAE and PG theory. Credit: wangcongrobot

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
figures		figures
Deep_RL.pdf		Deep_RL.pdf
QwRB.tex		QwRB.tex
README.md		README.md
REINFORCE.tex		REINFORCE.tex
actorcritic.tex		actorcritic.tex
batchac.tex		batchac.tex
batchacwdf.tex		batchacwdf.tex
cem.tex		cem.tex
ctrasinf.tex		ctrasinf.tex
dagger.tex		dagger.tex
ddpg.tex		ddpg.tex
dqn.tex		dqn.tex
dyna.tex		dyna.tex
dynagen.tex		dynagen.tex
exploration.tex		exploration.tex
fittedQ.tex		fittedQ.tex
fittedvaliter.tex		fittedvaliter.tex
guided.tex		guided.tex
ilqr.tex		ilqr.tex
imitation.tex		imitation.tex
intro.tex		intro.tex
inverse.tex		inverse.tex
lqr.tex		lqr.tex
main.tex		main.tex
maxent.tex		maxent.tex
mb05.tex		mb05.tex
mb10.tex		mb10.tex
mb15.tex		mb15.tex
mb20.tex		mb20.tex
mblatent.tex		mblatent.tex
mbpolicy.tex		mbpolicy.tex
mcts.tex		mcts.tex
modelbased.tex		modelbased.tex
offline.tex		offline.tex
onlineQiter.tex		onlineQiter.tex
onlineac.tex		onlineac.tex
pgtheory.tex		pgtheory.tex
policyiter1.tex		policyiter1.tex
policyiter2.tex		policyiter2.tex
poligrad.tex		poligrad.tex
preface.tex		preface.tex
pretrain.tex		pretrain.tex
pseudocount.tex		pseudocount.tex
qfunc.tex		qfunc.tex
qwrb_tn.tex		qwrb_tn.tex
ref.bib		ref.bib
transfer.tex		transfer.tex
value.tex		value.tex
varinfer.tex		varinfer.tex

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Deep Reinforcement Learning Textbook

A collection of comprehensive notes on Deep Reinforcement Learning, based on UC Berkeley's CS 285 (prev. CS 294-112) taught by Professor Sergey Levine.

Introduction

Update Log

About

Uh oh!

Releases

Packages

Languages

harryzhangOG/Deep-RL-Notes

Folders and files

Latest commit

History

Repository files navigation

Deep Reinforcement Learning Textbook

A collection of comprehensive notes on Deep Reinforcement Learning, based on UC Berkeley's CS 285 (prev. CS 294-112) taught by Professor Sergey Levine.

Introduction

Update Log

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages