Q-learning-Exploration

CSE 190 Final Project by Le Lu & Huajun Zhang

In this project, we are interested in implementing Q-learning as an extension to PA3. Similar to MDP, we want to find an optimal policy for each grid in a given map. However, this time probabilistic models are not known and have to be learned. This can be a common case where robot has no idea how accurate its movements are and Q-learning ensures the optimization of the final policy list even when robot takes undesired movements in some iterations.

To run the program:

python learning.py

by default, the algorithm will run with configuration.json. To apply the other configuration(s), simply rename.

*For the sake of clarity and simplicity, we did not use ROS for simulation. *

Youtube link

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Le Lu_Huajun Zhang_190Final.pdf		Le Lu_Huajun Zhang_190Final.pdf
README.md		README.md
configuration.json		configuration.json
configuration_2_10_10_1.json		configuration_2_10_10_1.json
learning.py		learning.py
read_config.py		read_config.py
robotmover.py		robotmover.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Q-learning-Exploration

CSE 190 Final Project by Le Lu & Huajun Zhang

To run the program:

About

Releases

Packages

Languages

raphaellu/Q-learning-Exploration

Folders and files

Latest commit

History

Repository files navigation

Q-learning-Exploration

CSE 190 Final Project by Le Lu & Huajun Zhang

To run the program:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages