RL 코드
- Value iteration for (Env. 5x5 grid world)
- Policy iteration for (Env. 5x5 grid world)
- OS: windows 10
- Setup develop environments using Anaconda(
python 3.5
) orpip
conda create -n py35 python=3.5 anaconda
activate py35
- python
3.5.5
- tensorflow
1.0.0
- keras
2.0.3
- gym
0.9.4
- gym-maze
0.4
pip install tensorflow==1.0.0
pip install msgpack
pip install keras==2.0.3
python -m pip install --upgrade pip
pip install gym==0.9.4
cd C:\Users\YOUR_PATH}\rl_env_collection
git clone https://github.com/tuzzer/gym-maze.git 또는 git clone https://github.com/MattChanTK/gym-maze.git
cd C:\Users\{YOUR_PATH}\rl_env_collection\gym-maze
python setup.py install