Tour De Flags Maze solved by deep reinforcement learning (Q-learning) technique. The Tour De Flags maze game is similar to the classical Mouse/Cheese maze game, except that the mouse is replaced by an agent whose mission is to collect several flags before arriving to the target cell (were the "Cheese" used to be in the previous maze game). For simplicity sake we will assume that the agent always starts from cell (0,0) and the destination cell is always at the bottom right cell of the maze. A more elaborate description: http://www.samyzaf.com/ML/tdf/tdf.html
-
Notifications
You must be signed in to change notification settings - Fork 5
samyzaf/tdfmaze
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Tour De Flags Maze solved by deep reinforcement learning technique (Q-learning)
Resources
Stars
Watchers
Forks
Releases
No releases published