taxi-v2 Implementation for OpenAI taxi-v2 (Using temporal-difference methods) Udacity Deep reinforcement learning assignment solution. Solves the problem in paper https://arxiv.org/pdf/cs/9905014.pdf using TD methods Achieves 9.1 score in 4000 trials