Off-Policy Evaluation for Episodic Partially Observable Markov Decision Processes under Non-Parametric Models

Requirements

python >= 3.7.10
numpy
scipy
pandas
sklearn >=0.24.1
torch >= 1.10.2
matplotlib
seaborn
gym

Instructions

We attached the gym package cloned from openAI, we only use the classes defined in gym.
Run simulation.sh to get ResultA.csv and ResultB.csv, we use a cuda device (RTX3090) in the experiment
Run plots.py to generate figures 5 (a) and (b) in simulation study.

Citation

@inproceedings{miao2022off,
  title={Off-Policy Evaluation for Episodic Partially Observable Markov Decision Processes under Non-Parametric Models},
  author={Miao, Rui and Qi, Zhengling and Zhang, Xiaoke},
  booktitle={Advances in Neural Information Processing Systems}
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
gym @ dcd1858		gym @ dcd1858
.gitmodules		.gitmodules
ContSimuOffPolicy.py		ContSimuOffPolicy.py
README.md		README.md
agents.py		agents.py
envs.py		envs.py
license		license
plots.py		plots.py
prox_fqe.py		prox_fqe.py
rkhs_torch.py		rkhs_torch.py
simulation.sh		simulation.sh
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Off-Policy Evaluation for Episodic Partially Observable Markov Decision Processes under Non-Parametric Models

Requirements

Instructions

Citation

About

Releases

Packages

Languages

License

rui-miao/ProxOPE

Folders and files

Latest commit

History

Repository files navigation

Off-Policy Evaluation for Episodic Partially Observable Markov Decision Processes under Non-Parametric Models

Requirements

Instructions

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages