Skip to content

ZexuSun/OILCA-NeurIPS23

Repository files navigation

Offline Imitation Learning with Variational Counterfactual Reasoning

This is the code for reproducing the results of the paper Offline Imitation Learning with Variational Counterfactual Reasoning accepted at NeurIPS'2023. The paper can be found here.

Usage

Paper results were collected with Deep Mind Control (and Causal World). Networks are trained using PyTorch 1.4.0 and Python 3.6.

You can also run OILCA on the setting used in DWBC by running main_ivae_model.py and main_ivae_model.py:

python main.py \
    --aux_dim="5" \  
    --epochs="30" \
    --task="cheetah_run" 

After the counterfactual model training, you can run the main.py to train the offline policy with the augmented data with the pretrained policy path:

python main_setting_demodice.py \
    --data_path="../dataset/dm_control_suite/" \  
    --env="cheetah_run" \
    --expert-policy-path="../learned_models/BC_all/bc_model_cheetah_run.pkl" 

Bibtex

@inproceedings{sun2023offline,
  title     = {Offline Imitation Learning with Variational Counterfactual Reasoning},
  author    = {Sun, Zexu and He, Bowei and Liu, Jinxin and Chen, Xu and Ma, Chen and Zhang, Shuai},
  booktitle = {Proceedings of the 37th Conference on Neural Information Processing Systems},
  year      = {2023}
}

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published