Skip to content

Official Implementation of the paper "Equal Long-term Benefit Rate: Adapting Static Fairness Notions to Sequential Decision Making" by Yuancheng Xu, Chenghao Deng, Yanchao Sun, Ruijie Zheng, Xiyao Wang, Jieyu Zhao and Furong Huang.

License

umd-huang-lab/ELBERT

Repository files navigation

Equal Long-term Benefit Rate (ELBERT)

The codebase for the paper "Equal Long-term Benefit Rate: Adapting Static Fairness Notions to Sequential Decision Making" (https://arxiv.org/abs/2309.03426) by Yuancheng Xu*, Chenghao Deng*, Yanchao Sun, Ruijie Zheng, Xiyao Wang, Jieyu Zhao, Furong Huang.

The implementation of ELBERT Policy Optimization (ELBERT-PO) is provided.

Environment set-up

First, install Anaconda to set up virtual environment. Then, run:

conda env create -f elbert.yaml
conda activate elbert
pip install -r requirements.txt

Running ELBERT and other baselines

The .scripts/ folder includes bash scripts for ELBERT and other baselines (G-PPO, R-PPO, A-PPO) in five enviroments:

Lending

  • ELBERT
bash scripts/lending_elbert.sh
  • Baseline (G-PPO, R-PPO, A-PPO)
bash scripts/lending_original.sh

Infectious control, orginal version

  • ELBERT
bash scripts/infectious_original_env_elbert.sh
  • Baseline (G-PPO, R-PPO, A-PPO)
bash scripts/infectious_original_env_original.sh

Infectious control, harder version

  • ELBERT
bash scripts/infectious_harder_env_elbert.sh
  • Baseline (G-PPO, R-PPO, A-PPO)
bash scripts/infectious_harder_env_original.sh

Attention allocation, orginal version

  • ELBERT
bash scripts/attention_original_env_elbert.sh
  • Baseline (G-PPO, R-PPO, A-PPO)
bash scripts/attention_original_env_original.sh

Attention allocation, harder version

  • ELBERT
bash scripts/attention_harder_env_elbert.sh
  • Baseline (G-PPO, R-PPO, A-PPO)
bash scripts/attention_harder_env_original.sh

Comments

Our codebase is based on the following repo:

Thanks for their efforts to make the code public!

About

Official Implementation of the paper "Equal Long-term Benefit Rate: Adapting Static Fairness Notions to Sequential Decision Making" by Yuancheng Xu, Chenghao Deng, Yanchao Sun, Ruijie Zheng, Xiyao Wang, Jieyu Zhao and Furong Huang.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published