PBICRL

Preference-Based Bayesian Inverse Constraint Reinforcement Learning (PBICRL) is a Bayesian approach that infers constraints from demonstrations. The likelihood function is based on a modification of the Bradley-Terry model that allows it to compensate for different margins among the preferences.

Requirements

The code was written in Python 3.8.13
To install the requirements:
pip install -r requirements.txt

Each folder contains the code for the four simulation environments used in the paper. You can run the code by simply running run_experiments.sh. The data files containting the demonstrations can be downloaded using the following link. The data files should be saved in the corresponding data folder for each environment. https://drive.google.com/drive/folders/1YKynJct0_ZeBkZCNKA7L2OGFtMs6v1VW?usp=sharing

If you find this code and paper useful and relevant to your work, please cite the paper as follows:

@article{papadimitriou2024bayesian,
title={Bayesian Constraint Inference from User Demonstrations Based on Margin-Respecting Preference Models},
author={Papadimitriou, Dimitris and Brown, Daniel S},
journal={arXiv preprint arXiv:2403.02431},
year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 65 Commits
Ant		Ant
HalfCheetah		HalfCheetah
fetch_reach		fetch_reach
point_mass		point_mass
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PBICRL

Requirements

Contents

About

Releases

Packages

Languages

gitcal/PBICRL

Folders and files

Latest commit

History

Repository files navigation

PBICRL

Requirements

Contents

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages