dqnTurtlebot

This project implements Deep Q-Learning for a turtlebot in a gazebo environment using ROS. Results of the learning process including the final model as well as result plots are saved in the dqnmodels folder. Hyperparameters can be changed and compared by editing the hyperParameterList variable in the turt_q_learn_hypers.py file. Each hyperparameter combination is saved and plotted separately, with a combined plot being at the top level. The project also supports loading a previously constructed model by placing a Tensorflow Keras model with the filename model in the load_model folder, and setting the hyperParameterList["Load Model"] value to True.

Launch Gazebo environment by running roslaunch turt_q_learn brick_world.launch
Run DQN script with rosrun turt_q_learn turt_q_learn_hypers.py

The original goal of this project was to apply deep Q-learning in order to train a neural network drive a simulated bot from the turtle_bot3 package within an Gazebo 3-D phyics simulation environment while avoiding obstacles. The code for this module is contained in the turt_q_learn.py file in the scripts folder. In addition to the deep Q-learning algorithm, the script is structured so that a user can easily compare and contrast different types of models, or alter specific environment variables.

The environment variables are listed at the top of the script. The model parameters are contained in a hyperParameterList variable which is used to generate a dictionary for every possible combination of hyperparameters selected. The models and graphs detailing the results of each parameter combination are automatically saved in the dqnmodels folder. A user can also use a previously built model by adding a model file to the load_model folder and setting the Load Model value in the hyperparmeterList to True.

A video including a demonstration of the learning process in action, as well as a walkthrough of the code is located here.

The project was then extended to include a pathfinding module, using a map generated from the turtlebot3_slam ROS package as input, a user can set a 'start' and 'goal' coordinate. The script will generate a path using either the wavefront or a-star pathfinding algorithms, selected by calling the pathFinder.gradDesc or pathFinder.aStar functions respectively. The path is then converted to a set of waypoints which is translated into gazebo environment coordinates which are fed to a simple pathwalking algorithm in order to drive the bot through the set of waypoints.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
gazebo_ros_pkgs_sub @ a5c9684		gazebo_ros_pkgs_sub @ a5c9684
turt_q_learn		turt_q_learn
turtlebot3_simulations_sub @ 0ac5de4		turtlebot3_simulations_sub @ 0ac5de4
.gitignore		.gitignore
.gitmodules		.gitmodules
CMakeLists.txt		CMakeLists.txt
README.md		README.md
environment.yml		environment.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gazebo_ros_pkgs_sub @ a5c9684

gazebo_ros_pkgs_sub @ a5c9684

turt_q_learn

turt_q_learn

turtlebot3_simulations_sub @ 0ac5de4

turtlebot3_simulations_sub @ 0ac5de4

.gitignore

.gitignore

.gitmodules

.gitmodules

CMakeLists.txt

CMakeLists.txt

README.md

README.md

environment.yml

environment.yml

requirements.txt

requirements.txt

Repository files navigation

dqnTurtlebot

About

Releases

Packages

Languages

sgawalsh/dqnTurtlebot

Folders and files

Latest commit

History

Repository files navigation

dqnTurtlebot

About

Topics

Resources

Stars

Watchers

Forks

Languages