Skip to content

markub3327/Dueling-DQN-with-AutoEncoder

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Signal novelty detection as an intrinsic reward for robotics

Paper: Signal Novelty Detection as an Intrinsic Reward for Robotics

AutoEncoder Architecture

AutoEncoder is one of the deep neural network methods used to detect anomalies in the signals received from the sensors on the robot. The principle is to train on the previously observed states from the experience replay buffer (RB), while the prediction is done over the observed state. In this way, early detection of new previously unobserved states is possible, which will help the agent to explore the environment and try to access previously unvisited states in the agent's state space.

AutoEncoder

Results

The results were obtained by the Dueling Deep Q Network (DQN) algorithm, which represented the agent. All tested tasks were learned without extrinsic reward from the environment using only this novelty intrinsic reward method.

  • Acrobot-v1 AutoEncoder

  • CartPole-v1 AutoEncoder

  • LunarLander-v2 AutoEncoder

  • MountainCar-v0 AutoEncoder