Latent Learning with Go-Explore

This project replicates the findings from Edward Tolman's Latent Learning experiment using reinforcement learning. The goal of the project is to figure out if reinforcement learning can be used to replicate the findings of classical psychological experiments. All the code and documentation can be found in the main notebook in the repository.

The Latent Learning Experiment

In 1948, psychologist Edward Tolman published a seminal paper in Cognitive Science [1] where he introduced the idea of latent learning. The experiment involved rats navigating a maze to find food. The rats were divided into three groups: a group that received no reward during training, a group that received a reward at the end of each training trial, and a group that received a reward only after a delay.

The results of the experiment showed that the rats in the group that received no reward during training (the "latent learning" group) performed just as well as the rats in the group that received a reward at the end of each training trial when a reward was eventually introduced. This suggests that the rats in the latent learning group had learned the layout of the maze during training, despite not receiving any immediate reward for doing so.

Replicating Latent Learning with Go-Explore

Learning without rewards or with sparse rewards has been a widely studied problem in reinforcement learning and the Go-Explore algorithm from OpenAI [2] tackles that problem performing remarkably well at Atari's infamous Montezuma's Revenge. I've implemented a bare bones version of the algorithm for a very simple maze environment and the rat-like agent is trained in a similar manner to the rats in Tolman's experiment, with three different training conditions: no reward, immediate reward, and delayed reward. The results of the simulation show that the agent trained with no reward during training (the "latent learning" condition) performs just as well as (even slightly better than) the agent trained with an immediate reward when a reward is eventually introduced.

Conclusion

This project demonstrates that reinforcement learning can be used to replicate the findings of classical psychological experiments like Tolman's latent learning experiment. The results of the simulation support the idea that RL agents, much like Tolman's rats, can explore their environment learn even in the absence of reinforcement and they can demonstrate the learned behavior when the reinforcement is introduced.

References

Tolman, E. C. (1948). Cognitive maps in rats and men. Psychological Review, 55(4), 189–208.
Ecoffet, Adrien, et al. "First return, then explore." Nature 590.7847 (2021): 580-586.

Acknoledgements

The maze environment has been created using mazelab.

This project has been undertaken as a class project for the Systems Design, Integration, and Control class at Universitat Pompeu Fabra.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
mazelab		mazelab
videos/PPO		videos/PPO
LICENSE		LICENSE
Latent Learning with Go-Explore.ipynb		Latent Learning with Go-Explore.ipynb
README.md		README.md
go-explore-nature.pdf		go-explore-nature.pdf
tolman_maze.npy		tolman_maze.npy
tolman_maze.png		tolman_maze.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Latent Learning with Go-Explore

The Latent Learning Experiment

Replicating Latent Learning with Go-Explore

Conclusion

References

Acknoledgements

About

Releases

Packages

Languages

License

rrrajjjj/latent-learning-with-go-explore

Folders and files

Latest commit

History

Repository files navigation

Latent Learning with Go-Explore

The Latent Learning Experiment

Replicating Latent Learning with Go-Explore

Conclusion

References

Acknoledgements

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages