Skip to content

This is an implementation of Deep Reinforcement Learning for a navigation task. Specifically, DQN algorithm with experience replay method is used to solve the task.

Notifications You must be signed in to change notification settings

SHIVOH/Deep-Reinforcement-Learning-My-First-DQN-Agent

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Deep Reinforcement Learning Agent (DQN algorithm)

This is an implementation of Deep Reinforcement Learning for a navigation task. Specifically, DQN algorithm with experience replay method is used to solve the task.

Details of the environment

THe environment is a Unity Environment which consists of a square surface with Yellow and Blue Bananas scattered around.

Task of the agent

The agent needs to collect as many yellow bananas as possible while avoiding the blue bananas.

Actions possible for agent

  • move forward
  • move backward
  • turn left
  • turn right

Reward for the agent

  • Yellow banana +1 reward
  • Blue banana -1 reward

When is environment said to be solved ?

The banana collection is an episodic game. Idea is to maximise the total score in an episode. The environment is said to be solved if the agent learns to secure an average score of at least +13 points over 100 consecutive episodes.

How to get started ?

My Solution

image

About

This is an implementation of Deep Reinforcement Learning for a navigation task. Specifically, DQN algorithm with experience replay method is used to solve the task.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published