Prioritized experience replay in supervised learning

PER is one of the most important components of Rainbow DQN and was also suggested to be used in supervised learning in the original paper. The paper indicates faster learning speed and greater generalization but dit not make an exhaustive investigation.

Importance sampling of mini-batches was also investigated in this paper and shows that it increases learning speed.

However, despite the benefits of this approach it has not been adopted widely in supervised learning maybe due to implementation issues in current frameworks like pytorch and tensorflow. The data loaders in pytorch have not been made with this purpose in mind.

This is a quick minimal experiment with the focus of investigating PER/importance sampling in supervised learning.

Problem setup

MNIST standard pytorch example as base and decrease the number of training examples to only 500 or the problem is too easy.

Results

Standard random sampling without replacement

Importance sampling with replacement

Uniform sampling with replacement

Compare standard and importance sampling

On this small and relatively simple task of MNIST, any speedups are not obvious (rather it could be the other way around) but generalization might be better. This is consistent with results on other smaller problems where I have tried PER.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.gitignore		.gitignore
README.md		README.md
compare.png		compare.png
importance_sampling.png		importance_sampling.png
importance_sampling.py		importance_sampling.py
network.py		network.py
standard.png		standard.png
standard.py		standard.py
uniform.png		uniform.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.gitignore

.gitignore

README.md

README.md

compare.png

compare.png

importance_sampling.png

importance_sampling.png

importance_sampling.py

importance_sampling.py

network.py

network.py

standard.png

standard.png

standard.py

standard.py

uniform.png

uniform.png

Repository files navigation

Prioritized experience replay in supervised learning

Problem setup

Results

About

Releases

Packages

Languages

samedii/prioritized-experience-replay

Folders and files

Latest commit

History

Repository files navigation

Prioritized experience replay in supervised learning

Problem setup

Results

About

Resources

Stars

Watchers

Forks

Languages