Skip to content

rui-miao/ProxOPE

Repository files navigation

Off-Policy Evaluation for Episodic Partially Observable Markov Decision Processes under Non-Parametric Models

Requirements

python >= 3.7.10
numpy
scipy
pandas
sklearn >=0.24.1
torch >= 1.10.2
matplotlib
seaborn
gym

Instructions

  • We attached the gym package cloned from openAI, we only use the classes defined in gym.
  • Run simulation.sh to get ResultA.csv and ResultB.csv, we use a cuda device (RTX3090) in the experiment
  • Run plots.py to generate figures 5 (a) and (b) in simulation study.

Citation

@inproceedings{miao2022off,
  title={Off-Policy Evaluation for Episodic Partially Observable Markov Decision Processes under Non-Parametric Models},
  author={Miao, Rui and Qi, Zhengling and Zhang, Xiaoke},
  booktitle={Advances in Neural Information Processing Systems}
}

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages