Explainable MARL For Temporal Queries

Repository for supplementary material for "Explainable Multi-Agent Reinforcement Learning For Temporal Queries" published in IJCAI 2023.
Paper Link: https://arxiv.org/abs/2305.10378
Previous Paper Link: https://arxiv.org/abs/2204.12568
*More information on abstract policy graph generation present in repository

OS tested on "Centos Linux 7"

Set up:
1. Download Shared Experience Actor Critic for MARL and install as instructed
*Used for training and evaluation policies
Location: https://github.com/semitable/seac
Rename "seac-master" folder to "seac"
2. Download Level-based Foraging Environment and install as instructed
*Used to register search and rescue and level-based foraging environments to gym
Location: https://github.com/uoe-agents/lb-foraging
3. Download further domains if preferred and install as instructed
Robotic warehouse domain location: https://github.com/uoe-agents/robotic-warehouse
Pressure plate location: https://github.com/semitable/pressureplate
4. Download "Improving Robot Controller Transparency Through Autonomous Policy Explanation" Implemenation and install as instructed
*Used to set up Quine-Mccluskey algorithm
Location: https://gitlab.tue.nl/ha800-hri/hayes-shah
5. Install remaining requirements in "requirements.txt" if needed

Training a policy:
1. Polices should be trained as described by Shared Experience Actor Critic for MARL
Location: https://github.com/semitable/seac
2. Any other MARL method can be used to train a policy a preferred
3. An example policy for each domain is given in "Code Pipeline/results/trained_models/###/u####
4. An example environment for training is given in Code Pipeline/seac MARL replacement files

Evaluating a policy:
1. Add the "evaluateForConRoll.py" file to "seac/seac" folder
2. Set policy path, environment name, time limit, number of evaluation steps, number of agents, number of tasks, and start state in file (Examples given in file)
3. Define any new rules for low-level to high-level state conversion in file
4. Run file -> "python evaluateForConRoll.py"
5. Example files produced from small evaluation (100 steps) are given in "Model Examples/runningexample"
4. An example environment for evaluation is given in Code Pipeline/seac MARL replacement files

Generating an explanation with guided rollout:
1. Add the "abstractToPrism.py", "generateExplanations.py", "generateNotPossExp_3agTotalQ.py", and "policyConRoll3agABATotalQ.py" files to seac/seac folder. You may need to add "convertStates.py" to the seac/seac folder if the file is available.
2. Make sure files produced from evaluating a policy are in the seac/seac folder.
3. Add the "hayes_shah" and "quine_mcclusky" folders to seac/seac folder, replacing others if needed
4. Replace seac files of the same name with those found in seac MARL replacement files folder
5. Set user query in main function of "policyConRoll3agABATotalQ.py"(Examples in file)
6. Set search parameters in main function of "policyConRoll3agABATotalQ.py"(Examples in file)
7. Define any new rules for low-level to high-level state conversion
8. Run file -> "python policyConRoll3agABATotalQ.py"
*Generates abstract mmdp through guided rollout and explanation for any impossible tasks
10. Example log output produced from evaluation are given in explanation output folder along with generated PRISM model and properties

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
LBF Code Pipeline/Code Pipeline		LBF Code Pipeline/Code Pipeline
PP Code Pipeline/Code Pipeline		PP Code Pipeline/Code Pipeline
RWARE Code Pipeline/Code Pipeline		RWARE Code Pipeline/Code Pipeline
SR Code Pipeline/Code Pipeline		SR Code Pipeline/Code Pipeline
Contents.txt		Contents.txt
README.md		README.md
technical_appendix.pdf		technical_appendix.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Explainable MARL For Temporal Queries

About

Releases

Packages

Languages

kjboggess/ijcai23

Folders and files

Latest commit

History

Repository files navigation

Explainable MARL For Temporal Queries

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages