Skip to content

Latest commit

 

History

History
25 lines (25 loc) · 2.54 KB

File metadata and controls

25 lines (25 loc) · 2.54 KB

Deep Reinforcement Learning Agent (DQN algorithm)

This is an implementation of Deep Reinforcement Learning for a navigation task. Specifically, DQN algorithm with experience replay method is used to solve the task.

Details of the environment

THe environment is a Unity Environment which consists of a square surface with Yellow and Blue Bananas scattered around.

Task of the agent

The agent needs to collect as many yellow bananas as possible while avoiding the blue bananas.

Actions possible for agent

  • move forward
  • move backward
  • turn left
  • turn right

Reward for the agent

  • Yellow banana +1 reward
  • Blue banana -1 reward

When is environment said to be solved ?

The banana collection is an episodic game. Idea is to maximise the total score in an episode. The environment is said to be solved if the agent learns to secure an average score of at least +13 points over 100 consecutive episodes.

How to get started ?

My Solution

image