DeepACE

Introduction

This repository contains the code to our paper Estimating average causal effects from patient trajectories

Requirements

The project is build with python 3.9.7 and uses the following packages:

[Pytorch 1.10.0, Pytorch lightning 1.5.1] - deep learning models
[Optuna 2.10.0] - hyperparameter tuning
[EconML 0.12.0] - Static baselines
Other: Pandas 1.3.4, numpy 1.21.5, scikit-learn 1.0.1

Some baseline models are implemented as R scripts. For running the models, all packages that are imported at the beginning of the script (via "library()") need to be installed.

Datasets

In our paper we used three datasets: Synthetic, semi-synthetic and real-world data.

Synthetic data

The script for synthetic data generation is datasets/sim.py. Here, the data is simulated accourding to Sec. 5.1. in the paper.

Semi-synthetic data

We use MIMIC-III, which is accessible but must be requested at https://physionet.org/content/mimiciii/1.4/. When MIMIC-III access is granted, the pre-processed data by Wang et. al. (2020) is accessible with instructions in the respective paper. The preprocessed file needs to be added to datasets/mimic and should be named all_hourly_data.h5. The script datasets/mimic/mimic.py extracts covariates and generated synthetic treatments and outcomes.

Real-world data

We use the pre-processed data from the clinical study on low back pain patients from Nielsen et al (2017). The data is available in the folder datasets/backpain/data_preprocessed.

Reproducing the experiments

The scripts running the experiments are contained in the /experiments folder. There are three python scripts, one for each dataset (synthetic = sim, semi-synthetic = mimic and real-world = backpain). For the synthetic and semi-synthetic experiments, one needs to specify a configuration file in the main running procedure before running the script. This indicates the models used to obtain results. of the respective script, which specifies the models used to obtain results. The following configurations are possible:

config_deepace: DeepACE without targeting
config_deepace_tar: DeepACE with targeting
config_ltmle_super: LTMLE with super learner
config_other: other longitudinal baselines
config_gnet: G-Net
config_static: Static baselines

The corresponindg .yaml configuration files can be found in /experiments/conifg/. Here, the "treat" parameter denotes the treatment configuration (setting) and takes values in {1,2,3}.

Reproducing hyperparameter tuning

The hyperparameters for the models trained from the /experiments folder are stored under /hyperparame/parameters. For reproducing hyperparameter tuning, one needs to run hyperparam/hyperparameter.py (synthetic + semi-synthetic data) or hyperparam/hyperparameter_backpain.py (real-world data). Again, the correct configuration files need to be specified, indicating the models and settings.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
Doc		Doc
datasets		datasets
experiments		experiments
hyperparam		hyperparam
models		models
plotting		plotting
results		results
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Doc

Doc

datasets

datasets

experiments

experiments

hyperparam

hyperparam

models

models

plotting

plotting

results

results

README.md

README.md

Repository files navigation

DeepACE

Introduction

Requirements

Datasets

Synthetic data

Semi-synthetic data

Real-world data

Reproducing the experiments

Reproducing hyperparameter tuning

About

Releases

Packages

Languages

DennisFrauen/DeepACE

Folders and files

Latest commit

History

Repository files navigation

DeepACE

Introduction

Requirements

Datasets

Synthetic data

Semi-synthetic data

Real-world data

Reproducing the experiments

Reproducing hyperparameter tuning

About

Resources

Stars

Watchers

Forks

Languages