OPIRL: Sample Efficient Off-Policy Inverse Reinforcement Learning via Distribution Matching

Official implementation for OPIRL: Sample Efficient Off-Policy Inverse Reinforcement Learning via Distribution Matching.
Accepted to IEEE International Conference on Robotics and Automation (ICRA) 2022.

Installation

Run the following command to install all Python dependencies:

$ pip install -e .
$ pip install -r requirements.txt

Other dependencies:

Python 3.8+
TensorFlow 2.4+
CUDA=11.0
cuDNN=8.0
Experts/reward functions are provided on Google Drive

Run Experiments

First, unzip the expert/reward files from Google Drive.
Then, to simply run experiments on MuJoCo tasks, run the bash scripts in /scripts directory.
E.g.

$ sh ./scripts/run_halfcheetah.sh

Others

OPOLO: code
f-IRL: code

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
scripts		scripts
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
run_opirl.py		run_opirl.py
run_transfer_opirl.py		run_transfer_opirl.py
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

scripts

scripts

src

src

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

requirements.txt

requirements.txt

run_opirl.py

run_opirl.py

run_transfer_opirl.py

run_transfer_opirl.py

setup.py

setup.py

Repository files navigation

OPIRL: Sample Efficient Off-Policy Inverse Reinforcement Learning via Distribution Matching

Installation

Run Experiments

Others

About

Releases

Packages

Languages

License

sff1019/opirl

Folders and files

Latest commit

History

Repository files navigation

OPIRL: Sample Efficient Off-Policy Inverse Reinforcement Learning via Distribution Matching

Installation

Run Experiments

Others

About

Resources

License

Stars

Watchers

Forks

Languages