Explainable Reinforcement Learning (XRL) Resources

This repository aims to keep an up-to-date list of research on explainable reinforcement learning (XRL). The repository supplements the survey paper found here. If you find this helpful, please give this repository a star, share it, and cite the survey paper.

Missing resources, issues, or questions? Please open an issue here.

Resources

Awesome Explainable Reinforcement Learning. Link

Survey Papers

#/Link	Title	Venue/Journal	Year
1	Redefining Counterfactual Explanations for Reinforcement Learning: Overview, Challenges and Opportunities	ACM Comput. Surv.	2024
2	A Survey of Global Explanations in Reinforcement Learning	Explainable Agency in Artificial Intelligence	2024
3	Explainable reinforcement learning (XRL): a systematic literature review and taxonomy	Mach. Learn.	2024
4	Explainability in Deep Reinforcement Learning, a Review into Current Methods and Applications	ACM Comput. Surv.	2023
5	Explainable Reinforcement Learning: A Survey and Comparative Review	ACM Comput. Surv.	2023
6	A Survey on Explainable Reinforcement Learning: Concepts, Algorithms, Challenges	CoRR	2023
7	Explainable reinforcement learning for broad-XAI: a conceptual framework and survey	Neural Comput. Appl.	2023
8	Explainable Deep Reinforcement Learning: State of the Art and Challenges	ACM Comput. Surv.	2022
9	A Survey on Interpretable Reinforcement Learning	CoRR	2022
10	Explainability in reinforcement learning: perspective and position	CoRR	2022
11	Explainable AI and Reinforcement Learning - A Systematic Review of Current Approaches and Trends	Frontiers Artif. Intell.	2021
12	Explainability in deep reinforcement learning	Knowl. Based Syst.	2021
13	Explainable Reinforcement Learning: A Survey	CD-MAKE	2020
14	Reinforcement Learning Interpretation Methods: A Survey	IEEE Access	2020

Papers

#/Link	Title	Venue/Journal	Year
1	Explaining Reinforcement Learning Agents Through Counterfactual Action Outcomes	AAAI	2024
2	ASAP: Attention-Based State Space Abstraction for Policy Summarization	ACML	2024
3	Interpretable Concept Bottlenecks to Align Reinforcement Learning Agents	CoRR	2024
4	Detection of Important States through an Iterative Q-value Algorithm for Explainable Reinforcement Learning	HICSS	2024
5	Local Explanations for Reinforcement Learning	AAAI	2023
6	Explainable Reinforcement Learning Based on Q-Value Decomposition by Expected State Transitions	AAAI-MAKE	2023
7	GANterfactual-RL: Understanding Reinforcement Learning Agents' Strategies through Visual Counterfactual Explanations	AAMAS	2023
8	Interpreting a deep reinforcement learning model with conceptual embedding and performance analysis	Appl. Intell.	2023
9	Deep Explainable Relational Reinforcement Learning: A Neuro-Symbolic Approach	ECML PKDD	2023
10	Inherently Interpretable Deep Reinforcement Learning Through Online Mimicking	EXTRAAMAS	2023
11	Towards Interpretable Deep Reinforcement Learning with Human-Friendly Prototypes	ICLR	2023
12	Explaining Reinforcement Learning with Shapley Values	ICML	2023
13	Counterfactual Explanation Policies in RL	ICML Workshop on Counterfactuals in Minds and Machines	2023
14	Explaining Black Box Reinforcement Learning Agents Through Counterfactual Policies	IDA	2023
15	Extracting Decision Tree From Trained Deep Reinforcement Learning in Traffic Signal Control	IEEE Trans. Comput. Soc. Syst.	2023
16	Explainable Multi-Agent Reinforcement Learning for Temporal Queries	IJCAI	2023
17	Explainable Reinforcement Learning via a Causal World Model	IJCAI	2023
18	Unveiling Concepts Learned by a World-Class Chess-Playing Agent	IJCAI	2023
19	Extracting tactics learned from self-play in general games	Inf. Sci.	2023
20	Learning state importance for preference-based reinforcement learning	Mach. Learn.	2023
21	Interpretable and Explainable Logical Policies via Neurally Guided Symbolic Abstraction	NeurIPS	2023
22	State2Explanation: Concept-Based Explanations to Benefit Agent Learning and User Understanding	NeurIPS	2023
23	StateMask: Explaining Deep Reinforcement Learning through State Mask	NeurIPS	2023
24	Comparing explanations in RL	Neural Comput. Appl.	2023
25	Explainable robotic systems: understanding goal-driven actions in a reinforcement learning scenario	Neural Comput. Appl.	2023
26	Hierarchical goals contextualize local reward decomposition explanations	Neural Comput. Appl.	2023
27	Achieving efficient interpretability of reinforcement learning via policy distillation and selective input gradient regularization	Neural Networks	2023
28	Integrating Policy Summaries with Reward Decomposition for Explaining Reinforcement Learning Agents	PAAMS	2023
29	IxDRL: A Novel Explainable Deep Reinforcement Learning Toolkit Based on Analyses of Interestingness	xAI	2023
30	"I Don't Think So": Summarizing Policy Disagreements for Agent Comparison	AAAI	2022
31	CAPS: Comprehensible Abstract Policy Summaries for Explaining Reinforcement Learning Agents	AAMAS	2022
32	Interpretable Preference-based Reinforcement Learning with Tree-Structured Reward Functions	AAMAS	2022
33	Lazy-MDPs: Towards Interpretable RL by Learning When to Act	AAMAS	2022
34	Explaining Online Reinforcement Learning Decisions of Self-Adaptive Systems	ACSOS	2022
35	Analysis of Explainable Goal-Driven Reinforcement Learning in a Continuous Simulated Environment	Algorithms	2022
36	BEERL: Both Ends Explanations for Reinforcement Learning	Applied Sciences	2022
37	Energy-Efficient Driving for Adaptive Traffic Signal Control Environment via Explainable Reinforcement Learning	Applied Sciences	2022
38	Concept Learning for Interpretable Multi-Agent Reinforcement Learning	CoRL	2022
39	Comparing Strategies for Visualizing the High-Dimensional Exploration Behavior of CPS Design Agents	DESTION	2022
40	InAction: Interpretable Action Decision Making for Autonomous Driving	ECCV	2022
41	Enhanced Oblique Decision Tree Enabled Policy Extraction for Deep Reinforcement Learning in Power System Emergency Control	Electric Power Systems Research	2022
42	Attributation Analysis of Reinforcement Learning-Based Highway Driver	Electronics	2022
43	Multi-objective Genetic Programming for Explainable Reinforcement Learning	EuroGP	2022
44	Deep-Learning-based Fuzzy Symbolic Processing with Agents Capable of Knowledge Communication	ICAART	2022
45	Bridging the Gap: Providing Post-Hoc Symbolic Explanations for Sequential Decision-Making Problems with Inscrutable Representations	ICLR	2022
46	POETREE: Interpretable Policy Learning with Adaptive Decision Trees	ICLR	2022
47	Programmatic Reinforcement Learning without Oracles	ICLR	2022
48	Explaining Reinforcement Learning Policies through Counterfactual Trajectories	ICML Workshop on HILL	2022
49	Mean-variance Based Risk-sensitive Reinforcement Learning with Interpretable Attention	ICMVA	2022
50	Towards Interpretable Deep Reinforcement Learning Models via Inverse Reinforcement Learning	ICPR	2022
51	Explaining Intelligent Agent's Future Motion on Basis of Vocabulary Learning With Human Goal Inference	IEEE Access	2022
52	Interpretable Autonomous Flight Via Compact Visualizable Neural Circuit Policies	IEEE Robotics Autom. Lett.	2022
53	Explainable AI in Deep Reinforcement Learning Models for Power System Emergency Control	IEEE Trans. Comput. Soc. Syst.	2022
54	Hierarchical Program-Triggered Reinforcement Learning Agents for Automated Driving	IEEE Trans. Intell. Transp. Syst.	2022
55	Interpretable End-to-End Urban Autonomous Driving With Latent Deep Reinforcement Learning	IEEE Trans. Intell. Transp. Syst.	2022
56	Continuous Action Reinforcement Learning From a Mixture of Interpretable Experts	IEEE Trans. Pattern Anal. Mach. Intell.	2022
57	Self-Supervised Discovering of Interpretable Features for Reinforcement Learning	IEEE Trans. Pattern Anal. Mach. Intell.	2022
58	Temporal-Spatial Causal Interpretations for Vision-Based Reinforcement Learning	IEEE Trans. Pattern Anal. Mach. Intell.	2022
59	Visual Analytics for RNN-Based Deep Reinforcement Learning	IEEE Trans. Vis. Comput. Graph.	2022
60	Toward Interpretable-AI Policies Using Evolutionary Nonlinear Decision Trees for Discrete-Action Systems	IEEE Transactions on Cybernetics	2022
61	Understanding via Exploration: Discovery of Interpretable Features With Deep Reinforcement Learning	IEEE Transactions on Neural Networks and Learning Systems	2022
62	Summarising and Comparing Agent Dynamics with Contrastive Spatiotemporal Abstraction	IJCAI Workshop on XAI	2022
63	ACMViz: a visual analytics approach to understand DRL-based autonomous control model	J. Vis.	2022
64	Incorporating Explanations to Balance the Exploration and Exploitation of Deep Reinforcement Learning	KSEM	2022
65	Towards Explainable Reinforcement Learning Using Scoring Mechanism Augmented Agents	KSEM	2022
66	Explainable Reinforcement Learning via Model Transforms	NeurIPS	2022
67	GALOIS: Boosting Deep Reinforcement Learning via Generalizable Logic Synthesis	NeurIPS	2022
68	Inherently Explainable Reinforcement Learning in Natural Language	NeurIPS	2022
69	Non-Markovian Reward Modelling from Trajectory Labels via Interpretable Multiple Instance Learning	NeurIPS	2022
70	ProtoX: Explaining a Reinforcement Learning Agent via Prototyping	NeurIPS	2022
71	(When) Are Contrastive Explanations of Reinforcement Learning Helpful?	NeurIPS workshop on HiLL	2022
72	Mo"ET: Mixture of Expert Trees and its application to verifiable reinforcement learning	Neural Networks	2022
73	Analysing deep reinforcement learning agents trained with domain randomisation	Neurocomputing	2022
74	Why? Why not? When? Visual Explanations of Agent Behaviour in Reinforcement Learning	PacificVis	2022
75	Driving behavior explanation with multi-level fusion	Pattern Recognit.	2022
76	Acquisition of chess knowledge in AlphaZero	Proc. Natl. Acad. Sci. U.S.A.	2022
77	Learning Interpretable, High-Performing Policies for Autonomous Driving	Robotics: Science and Systems	2022
78	Event-driven temporal models for explanations - ETeMoX: explaining reinforcement learning	Softw. Syst. Model.	2022
79	Toward a Psychology of Deep Reinforcement Learning Agents Using a Cognitive Architecture	Top. Cogn. Sci.	2022
80	DeepSynth: Automata Synthesis for Automatic Task Segmentation in Deep Reinforcement Learning	AAAI	2021
81	Iterative Bounding MDPs: Learning Interpretable Policies via Non-Interpretable Methods	AAAI	2021
82	TripleTree: A Versatile Interpretable Representation of Black Box Agents and their Environments	AAAI	2021
83	Explaining Deep Reinforcement Learning Agents in the Atari Domain through a Surrogate Model	AIIDE	2021
84	A framework of explanation generation toward reliable autonomous robots	Adv. Robotics	2021
85	Explainable Deep Reinforcement Learning for UAV autonomous path planning	Aerospace Science and Technology	2021
86	Explaining robot policies	Applied AI Letters	2021
87	Counterfactual state explanations for reinforcement learning agents via generative deep learning	Artif. Intell.	2021
88	Local and global explanations of agent behavior: Integrating strategy summaries with saliency maps	Artif. Intell.	2021
89	XPM: An Explainable Deep Reinforcement Learning Framework for Portfolio Management	CIKM	2021
90	Interactive Explanations: Diagnosis and Repair of Reinforcement Learning Based Agent Behaviors	CoG	2021
91	CDT: Cascading Decision Trees for Explainable Reinforcement Learning	CoRR	2021
92	Contrastive Explanations for Comparing Preferences of Reinforcement Learning Agents	CoRR	2021
93	Approximating a deep reinforcement learning docking agent using linear model trees	ECC	2021
94	Robotic Lever Manipulation using Hindsight Experience Replay and Shapley Additive Explanations	ECC	2021
95	Off-Policy Differentiable Logic Reinforcement Learning	ECML PKDD	2021
96	Neuro-Symbolic Reinforcement Learning with First-Order Logic	EMNLP	2021
97	Explainable Reinforcement Learning for Longitudinal Control	ICAART	2021
98	Explainable deep reinforcement learning for portfolio management: an empirical approach	ICAIF	2021
99	Explainable Reinforcement Learning for Human-Robot Collaboration	ICAR	2021
100	DRIVE: Deep Reinforced Accident Anticipation with Visual Explanation	ICCV	2021
101	Contrastive Explanations for Reinforcement Learning via Embedded Self Predictions	ICLR	2021
102	Explaining by Imitating: Understanding Decisions by Interpretable Policy Learning	ICLR	2021
103	Learning "What-if" Explanations for Sequential Decision-Making	ICLR	2021
104	Discovering symbolic policies with deep reinforcement learning	ICML	2021
105	Re-understanding Finite-State Representations of Recurrent Policy Networks	ICML	2021
106	Explainable Reinforcement Learning with the Tsetlin Machine	IEA/AIE	2021
107	A Blood Glucose Control Framework Based on Reinforcement Learning With Safety and Interpretability: In Silico Validation	IEEE Access	2021
108	Symbolic Regression Methods for Reinforcement Learning	IEEE Access	2021
109	Efficient Robotic Object Search Via HIEM: Hierarchical Policy Learning With Intrinsic-Extrinsic Modeling	IEEE Robotics Autom. Lett.	2021
110	Learning to Discover Task-Relevant Features for Interpretable Reinforcement Learning	IEEE Robotics Autom. Lett.	2021
111	Explaining Deep Learning Models Through Rule-Based Approximation and Visualization	IEEE Trans. Fuzzy Syst.	2021
112	Interpretable Decision-Making for Autonomous Vehicles at Highway On-Ramps With Latent Space Reinforcement Learning	IEEE Trans. Veh. Technol.	2021
113	Explainable AI methods on a deep reinforcement learning agent for automatic docking	IFAC-PapersOnLine	2021
114	Visual Explanation using Attention Mechanism in Actor-Critic-based Deep Reinforcement Learning	IJCNN	2021
115	Programmatic Policy Extraction by Iterative Local Search	ILP	2021
116	Explaining the Decisions of Deep Policy Networks for Robotic Manipulations	IROS	2021
117	XAI-N: Sensor-based Robot Navigation using Expert Policies and Decision Trees	IROS	2021
118	Mixed Autonomous Supervision in Traffic Signal Control	ITSC	2021
119	Can You Trust Your Autonomous Car? Interpretable and Verifiably Safe Reinforcement Learning	IV	2021
120	Explaining a Deep Reinforcement Learning Docking Agent Using Linear Model Trees with User Adapted Visualization	Journal of Marine Science and Engineering	2021
121	Visual Analysis of Deep Q-network	KSII Trans. Internet Inf. Syst.	2021
122	Automatic discovery of interpretable planning strategies	Mach. Learn.	2021
123	EDGE: Explaining Deep Reinforcement Learning Policies	NeurIPS	2021
124	Learning Tree Interpretation from Object Representation for Deep Reinforcement Learning	NeurIPS	2021
125	Learning to Synthesize Programs as Interpretable and Generalizable Policies	NeurIPS	2021
126	Machine versus Human Attention in Deep Reinforcement Learning Tasks	NeurIPS	2021
127	Explainable Artificial Intelligence (XAI) for Increasing User Trust in Deep Reinforcement Learning Driven Autonomous Systems	NeurIPS Workshop on Deep RL	2021
128	Identifying Decision Points for Safe and Interpretable Reinforcement Learning in Hypotension Treatment	NeurIPS Workshop on Machine Learning for Health	2021
129	Feature-Based Interpretable Reinforcement Learning based on State-Transition Models	SMC	2021
130	A co-evolutionary approach to interpretable reinforcement learning in environments with continuous action spaces	SSCI	2021
131	Interpretable AI Agent Through Nonlinear Decision Trees for Lane Change Problem	SSCI	2021
132	Learning Sparse Evidence- Driven Interpretation to Understand Deep Reinforcement Learning Agents	SSCI	2021
133	Explainable Reinforcement Learning through a Causal Lens	AAAI	2020
134	Attribution-based Salience Method towards Interpretable Reinforcement Learning	AAAI-MAKE	2020
135	Learning an Interpretable Traffic Signal Control Policy	AAMAS	2020
136	Optimization Methods for Interpretable Differentiable Decision Trees Applied to Reinforcement Learning	AISTATS	2020
137	Interestingness elements for explainable reinforcement learning: Understanding agents' capabilities and limitations	Artif. Intell.	2020
138	Model primitives for hierarchical lifelong reinforcement learning	Auton. Agents Multi Agent Syst.	2020
139	Understanding the Behavior of Reinforcement Learning Agents	BIOMA	2020
140	Methodology for Interpretable Reinforcement Learning Model for HVAC Energy Control	Big Data	2020
141	Explaining Autonomous Driving by Learning End-to-End Visual Attention	CVPRW	2020
142	Understanding Learned Reward Functions	CoRR	2020
143	Interpretable policy derivation for reinforcement learning based on evolutionary feature synthesis	Complex & Intelligent Systems	2020
144	DRLViz: Understanding Decisions and Memory in Deep Reinforcement Learning	Comput. Graph. Forum	2020
145	Understanding RL Vision	Distill	2020
146	Interpretable policies for reinforcement learning by empirical fuzzy sets	Eng. Appl. Artif. Intell.	2020
147	Neuroevolution of self-interpretable agents	GECCO	2020
148	Topological Visualization Method for Understanding the Landscape of Value Functions and Structure of the State Space in Reinforcement Learning	ICAART	2020
149	Identifying Critical States by the Action-Based Variance of Expected Return	ICANN	2020
150	TLdR: Policy Summarization for Factored SSP Problems Using Temporal Abstractions	ICAPS	2020
151	Explain Your Move: Understanding Agent Actions Using Specific and Relevant Feature Attribution	ICLR	2020
152	Exploratory Not Explanatory: Counterfactual Analysis of Saliency Maps for Deep Reinforcement Learning	ICLR	2020
153	Finding and Visualizing Weaknesses of Deep Reinforcement Learning Agents	ICLR	2020
154	Interpretable Off-Policy Evaluation in Reinforcement Learning by Highlighting Influential Transitions	ICML	2020
155	Deep Reinforcement Learning for Safe Local Planning of a Ground Vehicle in Unknown Rough Terrain	IEEE Robotics Autom. Lett.	2020
156	Towards Interpretable Reinforcement Learning with State Abstraction Driven by External Knowledge	IEICE Trans. Inf. Syst.	2020
157	Improved Policy Extraction via Online Q-Value Distillation	IJCNN	2020
158	Visualization of topographical internal representation of learning robots	IJCNN	2020
159	Explainable navigation system using fuzzy reinforcement learning	IJIDeM	2020
160	Explainability of Intelligent Transportation Systems using Knowledge Compilation: a Traffic Light Controller Case	ITSC	2020
161	xGAIL: Explainable Generative Adversarial Imitation Learning for Explainable Human Decision Analysis	KDD	2020
162	What Did You Think Would Happen? Explaining Agent Behaviour through Intended Outcomes	NeurIPS	2020
163	Explaining Conditions for Reinforcement Learning Behaviors from Real and Imagined Data	NeurIPS Workshop on Challenges of Real-World RL	2020
164	DynamicsExplorer: Visual Analytics for Robot Control Tasks involving Dynamics and LSTM-based Control Policies	PacificVis	2020
165	Combining reinforcement learning with rule-based controllers for transparent and general decision-making in autonomous driving	Robotics Auton. Syst.	2020
166	Modelling Agent Policies with Interpretable Imitation Learning	TAILOR	2020
167	Interpretable, Verifiable, and Robust Reinforcement Learning via Program Synthesis	xxAI - Beyond Explainable AI	2020
168	Generation of Policy-Level Explanations for Reinforcement Learning	AAAI	2019
169	SDRL: Interpretable and Data-Efficient Deep Reinforcement Learning Leveraging Symbolic Planning	AAAI	2019
170	Towards Better Interpretability in Deep Q-Networks	AAAI	2019
171	Toward Robust Policy Summarization	AAMAS	2019
172	Towards Governing Agent's Efficacy: Action-Conditional \textdollar(\beta)\textdollar-VAE for Deep Transparent Reinforcement Learning	ACML	2019
173	Memory-Based Explainable Reinforcement Learning	AI	2019
174	Summarizing agent strategies	Auton. Agents Multi Agent Syst.	2019
175	Enabling robots to communicate their objectives	Auton. Robots	2019
176	Visualization of Deep Reinforcement Learning using Grad-CAM: How AI Plays Atari Games?	CoG	2019
177	Explaining Reward Functions in Markov Decision Processes	FLAIRS	2019
178	Explanation-Based Reward Coaching to Improve Human Performance via Reinforcement Learning	HRI	2019
179	Free-Lunch Saliency via Attention in Atari Agents	ICCVW	2019
180	Deep reinforcement learning with relational inductive biases	ICLR	2019
181	Learning Finite State Representations of Recurrent Policy Networks	ICLR	2019
182	Neural Logic Reinforcement Learning	ICML	2019
183	Interpretable Approximation of a Deep Reinforcement Learning Agent as a Set of If-Then Rules	ICMLA	2019
184	Semantic Predictive Control for Explainable and Efficient Policy Learning	ICRA	2019
185	DQNViz: A Visual Analytics Approach to Understand Deep Q-Networks	IEEE Trans. Vis. Comput. Graph.	2019
186	Visualizing Deep Q-Learning to Understanding Behavior of Swarm Robotic System	IES	2019
187	Exploring Computational User Models for Agent Policy Summarization	IJCA	2019
188	Explaining Reinforcement Learning to Mere Mortals: An Empirical Study	IJCAI	2019
189	Counterfactual States for Atari Agents via Generative Deep Learning	IJCAI Workshop on XAI	2019
190	Distilling Deep Reinforcement Learning Policies in Soft Decision Trees	IJCAI Workshop on XAI	2019
191	Dot-to-Dot: Explainable Hierarchical Reinforcement Learning for Robotic Manipulation	IROS	2019
192	Reinforcement Learning with Explainability for Traffic Signal Control	ITSC	2019
193	Interestingness Elements for Explainable Reinforcement Learning through Introspection	IUI Workshops	2019
194	Explainable Reinforcement Learning via Reward Decomposition	JCAI Workshop on XAI	2019
195	Enhancing Explainability of Deep Reinforcement Learning Through Selective Layer-Wise Relevance Propagation	KI	2019
196	Imitation-Projected Programmatic Reinforcement Learning	NeurIPS	2019
197	Towards Interpretable Reinforcement Learning Using Attention Augmented Agents	NeurIPS	2019
198	Verbal Explanations for Deep Reinforcement Learning Neural Networks with Attention on Extracted Features	RO-MAN	2019
199	A formal methods approach to interpretable reinforcement learning for robotic planning	Sci. Robotics	2019
200	HIGHLIGHTS: Summarizing Agent Behavior to People	AAMAS	2018
201	Rationalization: A Neural Machine Translation Approach to Generating Natural Language Explanations	AIES	2018
202	Transparency and Explanation in Deep Reinforcement Learning Neural Networks	AIES	2018
203	Visual Rationalizations in Deep Reinforcement Learning for Atari Games	BNAIC	2018
204	Textual Explanations for Self-Driving Vehicles	ECCV	2018
205	Toward Interpretable Deep Reinforcement Learning with Linear Model U-Trees	ECML PKDD	2018
206	Interpretable policies for reinforcement learning by genetic programming	Eng. Appl. Artif. Intell.	2018
207	Generating interpretable fuzzy controllers using particle swarm optimization and genetic programming	GECCO	2018
208	Hierarchical and Interpretable Skill Acquisition in Multi-task Reinforcement Learning	ICLR	2018
209	Programmatically Interpretable Reinforcement Learning	ICML	2018
210	Visualizing and Understanding Atari Agents	ICML	2018
211	Deep Reinforcement Learning Monitor for Snapshot Recording	ICMLA	2018
212	Contrastive Explanations for Reinforcement Learning in terms of Expected Consequences	IJCAI Workshop on XAI	2018
213	Explaining Deep Adaptive Programs via Reward Decomposition	IJCAI/ECAI Workshop XAI	2018
214	Establishing Appropriate Trust via Critical States	IROS	2018
215	Unsupervised Video Object Segmentation for Deep Reinforcement Learning	NeurIPS	2018
216	Verifiable Reinforcement Learning via Policy Extraction	NeurIPS	2018
217	Visual Sparse Bayesian Reinforcement Learning: A Framework for Interpreting What an Agent Has Learned	SSCI	2018
218	Particle swarm optimization for generating interpretable fuzzy reinforcement learning policies	Eng. Appl. Artif. Intell.	2017
219	Autonomous Self-Explanation of Behavior for Interactive Reinforcement Learning Agents	HAI	2017
220	Improving Robot Controller Transparency Through Autonomous Policy Explanation	HRI	2017
221	Interpretable Learning for Self-Driving Cars by Visualizing Causal Attention	ICCV	2017
222	Application of Instruction-Based Behavior Explanation to a Reinforcement Learning Agent with Changing Policy	ICONIP	2017
223	Graying the black box: Understanding DQNs	ICML	2016

Citation

@article{Bekkemoen24,
author       = {Yanzhe Bekkemoen},
title        = {Explainable reinforcement learning {(XRL):} a systematic literature
review and taxonomy},
journal      = {Mach. Learn.},
volume       = {113},
number       = {1},
pages        = {355--441},
year         = {2024},
url          = {https://doi.org/10.1007/s10994-023-06479-7},
doi          = {10.1007/S10994-023-06479-7},
}

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LICENSE

LICENSE

README.md

README.md

Repository files navigation

Explainable Reinforcement Learning (XRL) Resources

Resources

Survey Papers

Papers

Citation

About

License

yanzheb/explainable-reinforcement-learning

Folders and files

Latest commit

History

LICENSE

LICENSE

README.md

README.md

Repository files navigation

Explainable Reinforcement Learning (XRL) Resources

Resources

Survey Papers

Papers

Citation

About

Topics

Resources

License

Stars

Watchers

Forks