PulseRL

Code for PulseRL: Enabling Offline Reinforcement Learning for Digital Marketing Systems via Conservative Q-Learning.

In this repository we provide code for PulseRL offline experiments described in the paper. Part of our code is built on top of the CQL repository.

Installation

First install install Python 3.x. The remaining dependencies can be installed by executing the following command:

sh dependencies.sh

We use a customized version of ray, to fix some bugs in the desired features. The above script will build Ray from source available here, among others dependencies.

Usage

Download and extract the provided processed dataset into the folder data. The experiments featured in the paper can be executed by running the provided bash script, as follows:

sh train_model.sh -m <MODEL>

Where available models are: "BC", "DQN", "CQL". The respective models configuration files can be found at models/<MODEL>_model.yaml.

If you want to evaluate an already trained model, you can run the following:

sh evaluation.sh -m <MODEL> -d <DATASET_NAME> -v <DATASET_VERSION> -i <TRAIN_ID> -w <NUM_WORKERS>

To train the reward predictor model, you can run:

sh train_reward_pred_model.sh

To view the training and evaluation graphs, just run:

tensorboard --logdir models/

In case of any questions, bugs, suggestions or improvements, please feel free to open an issue.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
models		models
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
dependencies.sh		dependencies.sh
evaluation.sh		evaluation.sh
requirements.txt		requirements.txt
train_model.sh		train_model.sh
train_reward_pred_model.sh		train_reward_pred_model.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PulseRL

Installation

Usage

About

Languages

License

dlb-rl/pulse-rl

Folders and files

Latest commit

History

Repository files navigation

PulseRL

Installation

Usage

About

Topics

Resources

License

Stars

Watchers

Forks

Languages