This repository contains a PyTorch implementation of VIME: Variational Information Maximizing Exploration. It is based off the following repository.
It uses standard policy gradient methods
- Include other environments
- Need continuous policy option
- Q learning
- CEM
- Check parameters against VIME
- Finish logging / plotting
- Saving / loading models
- Plot states visited
- Consider sine function learning as gym environment # vime