Q-learning navigation system

Lidar Sensor Version

train an agent to navigate and collect bananas in a large, square world.

A reward of +1 is provided for collecting a yellow banana, and a reward of -1 is provided for collecting a blue banana. Thus, the goal of your agent is to collect as many yellow bananas as possible while avoiding blue bananas.

The state space has 37 dimensions and contains the agent's velocity, along with ray-based perception of objects around agent's forward direction. Given this information, the agent has to learn how to best select actions. Four discrete actions are available, corresponding to:

0 - move forward.
1 - move backward.
2 - turn left.
3 - turn right.

Camera version

In this version the only difference is that the state is an 84 x 84 RGB image, corresponding to the agent's first-person view.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
Navigation.ipynb		Navigation.ipynb
Navigation_Pixels.ipynb		Navigation_Pixels.ipynb
README.md		README.md
checkpoint.pth		checkpoint.pth
dqn_agent.py		dqn_agent.py
dqn_agent_pixels.py		dqn_agent_pixels.py
env.py		env.py
model.py		model.py
model_pixels.py		model_pixels.py
unity-environment.log		unity-environment.log

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Q-learning navigation system

Lidar Sensor Version

Camera version

About

Releases

Packages

Languages

am-shb/dqn-navigation

Folders and files

Latest commit

History

Repository files navigation

Q-learning navigation system

Lidar Sensor Version

Camera version

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages