Enviroments in Offline RL

Implementations of Gridworld, Modelwin, and Modelfail to experiment with offline RL. The implementation is inspired by the implementation of NChainEnv found at [https://github.com/openai/gym/blob/master/gym/envs/toy_text/nchain.py].

Please see [https://arxiv.org/pdf/1604.00923.pdf] for the description of these environments.

Grid World // Implemented in GridWorld.py. See [https://github.com/maximecb/gym-minigrid] for an existing implementation.
ModelWin Implemented in ModelWin.py. step(): runs one step in ModelWin. reset(): be sure to call reset when done==True.
ModelFail Implemented in ModelFail.py. step(): runs one step in ModelFail reset(): be sure to call reset when done==True.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.gitignore		.gitignore
ModelFail.py		ModelFail.py
ModelWin.py		ModelWin.py
README.md		README.md

Provide feedback