This is a python based simulation for single reinforcement learning agents supporting:
- Q learning
- Q-Landa Learning
- Sarasa Learning
- ADP Learning
- TD Learning
- TD-Landa Learning
Requierments:
-Python 2.7
-Numpy
-Matplotlib
Environment:
[ 0 ,1 ,2 ,3 ,4 ,5
--------
[ 6 ,7 ,8 ,9 ,10,|11
12,13,14,15,16,|17
--
18,19,20|,21,22,23
24,25,26|,27,28,29
30,31,32,33,34,35]]
Usage:
from Agent import *
agent = Agent()
agent.qlearn(iteration, gama) # learning using qlearn
agent.sarsa_learn(200, 0.5, 0.9) # learning using sarsa
agent.qlanda_learn(200, 0.5, 0.9) # learning using q-landa
agent.show_policy() # shows learned policy
agent.use_policy(21) # shows direction to the goal based on learned policy and initial location 21