taxi-v2

Implementation for OpenAI taxi-v2 (Using temporal-difference methods)

Udacity Deep reinforcement learning assignment solution.

Solves the problem in paper https://arxiv.org/pdf/cs/9905014.pdf using TD methods

Achieves 9.1 score in 4000 trials

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
__pycache__		__pycache__
README.md		README.md
agent.py		agent.py
main.py		main.py
monitor.py		monitor.py

Provide feedback