Skip to content
No description, website, or topics provided.
Python Jupyter Notebook
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Type Name Latest commit message Commit time
Failed to load latest commit information.


This repository contains the code for replicating the experiments from the paper

"Double Reinforcement Learning for Efficient Off-Policy Evaluation in Markov Decision Processes"

Experiments in Section 5.1

The relevant code is in the subdirectory exp5_1.

  • runs the experiment with the in-sample variant of the estimators.
  • runs the experiment with the samples-splitting variant of the estimators.

For example, to run 10 parallel replications, one can run the command seq 10 | xargs -L 1 -P 10 ./

Experiments in Section 5.2

The relevant code is in the subdirectory exp5_2.

You can’t perform that action at this time.