IRLEED

This directory is supplementary material for our work presented at the Thirty-Ninth AAAI Conference on Artificial Intelligence (AAAI-25):

"Inverse Reinforcement Learning by Estimating Expertise of Demonstrators" Mark Beliaev and Ramtin Pedarsani.

All relevant citations for methods used are found in the paper's list of references. Note that this directory contains the implementation for the Gridworld experiments performed, as well as the code and data used to generate all figures in the report. For additional implementation details, please refer to the Appendix.

I. Requirements

II. Instructions

III. Contents

I. Requirements

We recommend using pacakge manager pip as well as conda to install the relative packages:

conda:

python-3.8.5 python
numpy-1.19.2 numpy

pip:

tqdm
matplotlib
pandas

II. Instructions

You can run the scripts directly using the bash script provided:

bash bash_scripts/run_mix.sh

You can also edit parameters accoridgly by using src/run_mix.py directly. All parameters have been set to the default used in the paper. Afterwards, use the notebook analyze.ipynb to generate the figures.

III. Contents

bash_scripts/ - The bash script used to run the Gridworld experiment. Uses 121 dataset settings, with 100 random initializations for each.

run_mix.py script used for running IRLEED and MaxEnt IRL.

results/ - Where results are stored. Note that both the Gridoworld and Hopper results are added for reference, where the gridworld results can be generated with the corresponding bash script.

analyze.ipynb - Main analysis file used to generate the figures used in the paper.

figures/ - Final storage place of figures.

src/ - Main implementation of MaxEnt IRL as well as IRLEED.

src/irl_maxent contains a basic implementation of maximum entropy IRL that is used for reference. Only the plotting, gridworld, and stochastic gradient ascent implementations are used directly from this folder. The original code was forked from this repository, and the corresponding License has been attached.
src/mix_irl contains the implementation of IRLEED and maximum entroy IRL. Supporting files are included here.

Note that the main IRLEED implementation can be found in the 'irleed' class within the file src/mix_irl/irleed.py. This class is then used within the script src/run_mix.py to perform the experiment. The remaining files support this implementation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

IRLEED

I. Requirements

II. Instructions

III. Contents

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
bash_scripts		bash_scripts
figures		figures
results		results
src		src
CITATION.cff		CITATION.cff
LICENSE		LICENSE
README.md		README.md
analyze.ipynb		analyze.ipynb
run_mix.py		run_mix.py

Folders and files

Latest commit

History

Repository files navigation

IRLEED

I. Requirements

II. Instructions

III. Contents

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages