Agent should learn optimal policies in grid world using reinforcement learning algorithm
To implement this project I used opencv library.
How to install opencv: here
If the libraries are installed, compile and run the project:
make
./rl
It learns the optimal policy after too many iterations.
N stands for north
W stands for west
S stands for south
E stands for east
and the point below the policy shows the relative score for the state