Skip to content

Dynamic programming. Value iteration methods. Monte Carlo controls. Q-learning.

Notifications You must be signed in to change notification settings

Banyc/reinforcement_learning

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Value Iteration

  • prerequisites:
    • finite Markov decision processes
    • the agent knows environment dynamics accurately
      • $p(r, s' \mid s, a)$ for every variable

Monte Carlo

  • prerequisites:
    • Markov decision processes
    • episodic tasks

About

Dynamic programming. Value iteration methods. Monte Carlo controls. Q-learning.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages