Fork of ValueDICE code that supports discrete action spaces, PyBullet, and is truly off-policy.
To install dependencies, run:
conda env create -f environment.yml
To download expert data, run:
cd ..
git clone https://github.com/gkswamy98/pillbox.git
To train a learner:
./value_dice/run_experiments.sh "env_name"