Skip to content

Commit

Permalink
Merge remote-tracking branch 'origin/master'
Browse files Browse the repository at this point in the history
  • Loading branch information
Webbah committed Jun 28, 2023
2 parents 3bb2d64 + 1cca3e3 commit 26d9237
Show file tree
Hide file tree
Showing 16 changed files with 413 additions and 2,053 deletions.
10 changes: 5 additions & 5 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -83,11 +83,11 @@ All exercises are based on Python 3.9 and site-packages according to the require
* [Tutorial template](../master/exercises/templates/ex03)
* [Tutorial solution](../master/exercises/solutions/ex03)
04. Drive Through the Race Track with Monte Carlo Learning
* [Tutorial video](https://www.youtube.com/watch?v=TSwWlfZXDWw)
* [Tutorial video](https://youtu.be/TSwWlfZXDWw)
* [Tutorial template](../master/exercises/templates/ex04)
* [Tutorial solution](../master/exercises/solutions/ex04)
05. Drive even Faster Using Temporal-Difference Learning
* [Tutorial video](https://www.youtube.com/watch?v=zXdyABW8Hb8)
* [Tutorial video](https://youtu.be/zXdyABW8Hb8)
* [Tutorial template](../master/exercises/templates/ex05)
* [Tutorial solution](../master/exercises/solutions/ex05)
06. Stabilizing the Inverted Pendulum by Tabular Multi-Step Methods
Expand All @@ -103,18 +103,18 @@ All exercises are based on Python 3.9 and site-packages according to the require
* [Tutorial template](../master/exercises/templates/ex08)
* [Tutorial solution](../master/exercises/solutions/ex08)
09. Evaluate the Performance of Given Agents in the Mountain Car Problem Using Function Approximation
* [Tutorial video](https://www.youtube.com/watch?v=zCv29hVyxNk)
* [Tutorial video](https://www.youtube.com/watch?v=AY7fvqnjmGU)
* [Tutorial template](../master/exercises/templates/ex09)
* [Tutorial solution](../master/exercises/solutions/ex09)
10. Escape from the Mountain Car Valley Using Semi-Gradient Sarsa & Least Square Policy Iteration
* [Tutorial video](https://www.youtube.com/watch?v=FrMSB7Dgp7c&feature=youtu.be)
* [Tutorial video](https://www.youtube.com/watch?v=IPxare_FmlE)
* [Tutorial template](../master/exercises/templates/ex10)
* [Tutorial solution](../master/exercises/solutions/ex10)
11. Landing on the Moon with REINFORCE and Actor-Critic Methods
* [Tutorial video](https://www.youtube.com/watch?v=LeVDStyEjAo)
* [Tutorial template](../master/exercises/templates/ex11)
* [Tutorial solution](../master/exercises/solutions/ex11)
12. Shoot to the moon with DDPG & PPO
12. Shoot for the moon with DDPG & PPO
* [Tutorial video](https://www.youtube.com/watch?v=4RyX7L-MbsU)
* [Tutorial template](../master/exercises/templates/ex12)
* [Tutorial solution](../master/exercises/solutions/ex12)
Expand Down
78 changes: 39 additions & 39 deletions exercises/README.md
Original file line number Diff line number Diff line change
@@ -1,49 +1,49 @@
# Exercise Content
01. Basics of Python for Scientific Computing
* [Tutorial video](https://www.youtube.com/watch?v=MJXVQXkOEAA&feature=youtu.be)
* [Tutorial template](../../master/exercises/templates/ex01)
* [Tutorial solution](../../master/exercises/solutions/ex01)
01. Basics of Python for Scientific Computing
* [Tutorial video](https://www.youtube.com/watch?v=MJXVQXkOEAA&feature=youtu.be) (only 2022 edition available due to technical outage)
* [Tutorial template](./templates/ex01)
* [Tutorial solution](./solutions/ex01)
02. Manually Solving Basic Markov Chain, Reward and Decision Problems
* [Tutorial video](https://www.youtube.com/watch?v=3PPIHd3lO4k)
* [Tutorial template](../../master/exercises/templates/ex02)
* [Tutorial solution](../../master/exercises/solutions/ex02)
* [Tutorial video](https://www.youtube.com/watch?v=d38-TmkEZxQ)
* [Tutorial template](./templates/ex02)
* [Tutorial solution](./solutions/ex02)
03. The Beer-Bachelor and Dynamic Programming (the Shortest Beer Problem)
* [Tutorial video](https://www.youtube.com/watch?v=Z9QTRtJfZaM&feature=youtu.be)
* [Tutorial template](../../master/exercises/templates/ex03)
* [Tutorial solution](../../master/exercises/solutions/ex03)
* [Tutorial video](https://www.youtube.com/watch?v=Z9QTRtJfZaM&feature=youtu.be) (only 2022 edition available due to technical outage)
* [Tutorial template](./templates/ex03)
* [Tutorial solution](./solutions/ex03)
04. Drive Through the Race Track with Monte Carlo Learning
* [Tutorial video](https://www.youtube.com/watch?v=GlL1d4Ivo_g&t=880s)
* [Tutorial template](../../master/exercises/templates/ex04)
* [Tutorial solution](../../master/exercises/solutions/ex04)
* [Tutorial video](https://www.youtube.com/watch?v=TSwWlfZXDWw)
* [Tutorial template](./templates/ex04)
* [Tutorial solution](./solutions/ex04)
05. Drive even Faster Using Temporal-Difference Learning
* [Tutorial video](https://www.youtube.com/watch?v=eTFugChiOaE&feature=youtu.be)
* [Tutorial template](../../master/exercises/templates/ex05)
* [Tutorial solution](../../master/exercises/solutions/ex05)
06. Stabilizing the Inverted Pendulum by Tabular n-Step Methods
* [Tutorial video](https://www.youtube.com/watch?v=4ACrTBFRw50)
* [Tutorial template](../../master/exercises/templates/ex06)
* [Tutorial solution](../../master/exercises/solutions/ex06)
* [Tutorial video](https://www.youtube.com/watch?v=zXdyABW8Hb8)
* [Tutorial template](./templates/ex05)
* [Tutorial solution](./solutions/ex05)
06. Stabilizing the Inverted Pendulum by Tabular Multi-Step Methods
* [Tutorial video](https://www.youtube.com/watch?v=GwbfODvSpX8)
* [Tutorial template](./templates/ex06)
* [Tutorial solution](./solutions/ex06)
07. Boosting the Inverted Pendulum by Integrating Learning & Planning (Dyna Framework)
* [Tutorial video](https://www.youtube.com/watch?v=KXNu7xDZ9Bg&feature=youtu.be)
* [Tutorial template](../../master/exercises/templates/ex07)
* [Tutorial solution](../../master/exercises/solutions/ex07)
* [Tutorial video](https://www.youtube.com/watch?v=FvpIQN4mj2M)
* [Tutorial template](./templates/ex07)
* [Tutorial solution](./solutions/ex07)
08. Predicting the Operating Behavior of a Real Electric Drive Systems with Supervised Learning
* [Tutorial video](https://www.youtube.com/watch?v=zxC87jRnT08)
* [Tutorial template](../../master/exercises/templates/ex08)
* [Tutorial solution](../../master/exercises/solutions/ex08)
* [Tutorial video](https://www.youtube.com/watch?v=Aivh5ykeJ2Q)
* [Tutorial template](./templates/ex08)
* [Tutorial solution](./solutions/ex08)
09. Evaluate the Performance of Given Agents in the Mountain Car Problem Using Function Approximation
* [Tutorial video](https://www.youtube.com/watch?v=zCv29hVyxNk&t=2162s)
* [Tutorial template](../../master/exercises/templates/ex09)
* [Tutorial solution](../../master/exercises/solutions/ex09)
* [Tutorial video](https://www.youtube.com/watch?v=AY7fvqnjmGU)
* [Tutorial template](./templates/ex09)
* [Tutorial solution](./solutions/ex09)
10. Escape from the Mountain Car Valley Using Semi-Gradient Sarsa & Least Square Policy Iteration
* [Tutorial video](https://www.youtube.com/watch?v=FrMSB7Dgp7c&feature=youtu.be)
* [Tutorial template](../../master/exercises/templates/ex10)
* [Tutorial solution](../../master/exercises/solutions/ex10)
11. Improve the Value-Based Mount Car Solution using Sarsa(Lambda)
* [Tutorial video](https://www.youtube.com/watch?v=bhZGpuh5-6M)
* [Tutorial template](../../master/exercises/templates/ex11)
* [Tutorial solution](../../master/exercises/solutions/ex11)
12. Landing on the Moon with REINFORCE and Actor-Critic Methods
* [Tutorial video](https://www.youtube.com/watch?v=IPxare_FmlE)
* [Tutorial template](./templates/ex10)
* [Tutorial solution](./solutions/ex10)
11. Landing on the Moon with REINFORCE and Actor-Critic Methods
* [Tutorial video](https://www.youtube.com/watch?v=LeVDStyEjAo)
* [Tutorial template](../../master/exercises/templates/ex12)
* [Tutorial solution](../../master/exercises/solutions/ex12)
* [Tutorial template](./templates/ex11)
* [Tutorial solution](./solutions/ex11)
12. Shoot for the moon with DDPG & PPO
* [Tutorial video](https://www.youtube.com/watch?v=4RyX7L-MbsU)
* [Tutorial template](./templates/ex12)
* [Tutorial solution](./solutions/ex12)
Loading

0 comments on commit 26d9237

Please sign in to comment.