Explainable Reinforcement Learning: Visual Policy Rationalizations Using Grad-CAM

This reposity is holds the code needed to reproduce the results for my thesis. The Grad-CAM implementation used is based on https://github.com/kazuto1011/grad-cam-pytorch, the A3C implementation is based on https://github.com/ikostrikov/pytorch-a3c.

Examples

Some example analysis can be found here.

How to use the code

Note that it is still a work in progress, but eventually it should look like this:

python3 main.py --env-name ENV_NAME --gradcam_layer GCAM_LAYER

This command should run an episode of ENV_NAME using the pretrained models and collect Grad-CAM outputs for each state/action at layer GCAM_LAYER (which would default to features.elu4).

trained models

The models can be found in the pretrained folder. Each model has both a 'full' and a 'half' version (see paper).

	Full Agent Mean	Full Agent Variance	Half Agent Mean	Half Agent Variance	DeepMind
Pong	21.00	0.00	14.99	0.09	10.7
BeamRider	4659.04	1932.58	1597.40	1202.00	24622.2
Seaquest	1749.00	11.44	N/A	N/A	1326.1

Note that these models are based on amount of frames whereas DeepMind is based on 4 day training on 16 CPU cores, which makes comparing them hard.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
pretrained		pretrained
Grad_CAM.py		Grad_CAM.py
README.md		README.md
envs.py		envs.py
main.py		main.py
model.py		model.py
play.py		play.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pretrained

pretrained

Grad_CAM.py

Grad_CAM.py

README.md

README.md

envs.py

envs.py

main.py

main.py

model.py

model.py

play.py

play.py

Repository files navigation

Explainable Reinforcement Learning: Visual Policy Rationalizations Using Grad-CAM

Examples

How to use the code

trained models

About

Releases

Packages

Languages

lweitkamp/rl_rationalizations

Folders and files

Latest commit

History

Repository files navigation

Explainable Reinforcement Learning: Visual Policy Rationalizations Using Grad-CAM

Examples

How to use the code

trained models

About

Resources

Stars

Watchers

Forks

Languages