Skip to content

viraat/hindsight-experience-replay

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

hindsight-experience-replay

Reproducing results from the Hindsight Experience Replay Paper in PyTorch

Details

  • implemented the bit flip environment
    • Task: given a starting string of n bits and a target string of same length, flip the bits in the start string till the target string is achieved. The number of flips allowed is equal to the number of bits in the string - n
  • implemented a deep q-network with one hidden layer of 256 nodes
  • implemented hindsight-experience-replay with goal selection rule of s_T, i.e. the new goal is the last state achieved in the sequence of flips (in the file dqn-her.ipynb)
  • implemented a baseline DQN network without hindsight-experience-replay (in the file dqn.ipynb)

Results

  • Compared success rate across number of episodes for bit length n=6,7,8 for both DQN and DQN+HER (could not do higher bit length as no GPU)

6dqn 6her 7dqn 7her 8dqn 8her

Only HER Results

10her 10herloss

To do

  • Higher bit length using Google Colab notebook

About

Reproducing results from the Hindsight Experience Replay paper in PyTorch

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published