Skip to content

Files

Latest commit

Sudharsan RavichandiranSudharsan Ravichandiran
Sudharsan Ravichandiran
and
Sudharsan Ravichandiran
Apr 1, 2021
05ead4a · Apr 1, 2021

History

History

05. Understanding Temporal Difference Learning

5. Understanding Temporal Difference Learning

  • 5.1. TD Learning
  • 5.2. TD Prediction
    • 5.2.1. TD Prediction Algorithm
  • 5.3. Predicting the Value of States in a Frozen Lake Environment
  • 5.4. TD Control
  • 5.5. On-Policy TD Control - SARSA
  • 5.6. Computing Optimal Policy using SARSA
  • 5.7. Off-Policy TD Control - Q Learning
  • 5.8. Computing the Optimal Policy using Q Learning
  • 5.9. The Difference Between Q Learning and SARSA
  • 5.10. Comparing DP, MC, and TD Methods