Taxi-v2 openAI gym

Running Value Iteration on the Taxi environment

I barrowed heavily from allanbreyes github.com/allanbreyes/gym-solutions/blob/master/analysis/mdp.py The goal was to learn a lot about value iteration, and I've achieved that. My code is simpler than Allen's, so it's worth a look if you are still learning. Otherwise check out his original, it covers policy iteration and multiple openAI environments.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
taxi_test5.py		taxi_test5.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

taxi_test5.py

taxi_test5.py

Repository files navigation

Taxi-v2 openAI gym

About

Releases

Packages

Languages

gary-butler/Taxi-v2-openAI-gym

Folders and files

Latest commit

History

README.md

README.md

taxi_test5.py

taxi_test5.py

Repository files navigation

Taxi-v2 openAI gym

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages