ME 5406 Deep Learning for Robotics Project1

Problem Statement

Consider a frozen lake with (four) holes covered by patches of very thin ice. Suppose that a robot is to glide on the frozen surface from one location(i.e., the top left corner) to another (bottom right corner) in order to pick up a frisbee, as illustrated below.

3 Model Free RL Algorithms:

Q learning
SARSA
First-visit Monte Carlo control

Requirements

Use conda install --file requirements.txt to install the following requirements:

matplotlib
pillow
pandas
tk

(you can refer to requirements.txt)

How to Run

You can simply set the arguments to specify the training process and run the code in main.py. The default arguments are: args.algorithm="Q_learning"; args.grid_size=4; args.learning_rate=0.01; arg.gamma=0.9; args.epsilon=0.9. You can also simply run the training process of the 3 algorithms in Q_learning.py, SARSA.py and Monte_carlo.py, respectively, the training parameters can be set in Parameters.py.
You can also use the terminal to run this code:

conda create -n WZQ python==3.6
conda activate WZQ  
conda install --file requirements.txt
python main.py --algorithm "Q_learning" --map_size 4

Experiment Results (4 × 4 grid map)

Q learning

SARSA

Monte Carlo First-visit

Final Optimal Policy
- Q learning, SARSA and Monte Carlo, respectively

Acknowledgement

If my code helps you learn reinforcement learning, please give me a star :)

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.idea		.idea
Q_table		Q_table
__pycache__		__pycache__
render_images		render_images
results		results
.gitattributes		.gitattributes
Environment.py		Environment.py
Monte_carlo.py		Monte_carlo.py
Parameters.py		Parameters.py
Q_learning.py		Q_learning.py
README.md		README.md
SARSA.py		SARSA.py
main.py		main.py
requirements.txt		requirements.txt

lovecicinnus/ME5406-project1

Folders and files

Latest commit

History

Repository files navigation

ME 5406 Deep Learning for Robotics Project1

Problem Statement

3 Model Free RL Algorithms:

Requirements

How to Run

Experiment Results (4 × 4 grid map)

Acknowledgement

About

Topics

Resources

Stars

Watchers

Forks

Languages