Robust constrained Markov decision processes (RCMDPs)

This repository implements algorithms for robust constrained Markov decision processes (RCMDPs; [1,2]):

Robust Constrained Policy Gradient (RCPG) and variants thereof
Adversarial RCPG

and related ablations:

CPG: ablation without robustness
PG: ablation without constraints, corresponds to REINFORCE

[1] R. H. Russel, M. Benosman, and J. Van Baar (2021). “Robust Constrained- MDPs: Soft-Constrained Robust Policy Optimization under Model Uncertainty.” Advances in Neural Information Processing Systems workshop (NeurIPS 2021). https://arxiv.org/abs/2010.04870

[2] D. M. Bossens (2024). "Robust Lagrangian and Adversarial Policy Gradient for Robust Constrained Markov Decision Processes." IEEE Conference on Artificial Intelligence (CAI 2024). https://arxiv.org/abs/2308.11267

Specifications

Tested on python 3.8

Dependencies: Keras and Tensorflow

Running the algorithm

You can run the algorithm on the experiments from [1] with the following commands.

Run the algorithm on SafeNavigation1:

python SafeNavigation1.py

Run the algorithm on SafeNavigation2:

python SafeNavigation2.py

Run the algorithm on InventoryManagement:

python InventoryManagement.py

Name		Name	Last commit message	Last commit date
Latest commit History 87 Commits
agent		agent
Analysis.py		Analysis.py
Analysis_IM.py		Analysis_IM.py
CMDP.py		CMDP.py
Choose_Method.py		Choose_Method.py
Critic.py		Critic.py
InventoryManagement.py		InventoryManagement.py
LR_Schedule.py		LR_Schedule.py
Policy.py		Policy.py
RCPG.py		RCPG.py
README.md		README.md
SafeNavigation1.py		SafeNavigation1.py
SafeNavigation2.py		SafeNavigation2.py
State.py		State.py
UncertaintySet.py		UncertaintySet.py
Utils.py		Utils.py
__init__.py		__init__.py
install		install

bossdm/RCMDP

Folders and files

Latest commit

History

Repository files navigation

Robust constrained Markov decision processes (RCMDPs)

Specifications

Running the algorithm

About

Resources

Stars

Watchers

Forks

Languages