Hyperparameter Tuning and Model Evaluation in Causal Effect Estimation

Investigating the interplay between causal estimators, ML base learners, hyperparameters and model evaluation metrics.

Paper

This code accompanies the paper:

D. Machlanski, S. Samothrakis, and P. Clarke, ‘Hyperparameter Tuning and Model Evaluation in Causal Effect Estimation’. arXiv, Mar. 02, 2023. doi: 10.48550/arXiv.2303.01412. link.

Data and Results

All datasets and results (in progress) are available here.

Replicating the paper

Follow the steps below.

Download datasets from here and put them under 'datasets' folder.
Prepare Python environment.
1. Install miniconda.
2. If you intend to run Neural Networks, run conda env create -f environment_tf.yml.
3. Otherwise, you can use the default environment conda env create -f environment.yml.
Go to 'scripts' folder and run bash paper.sh. This will run ALL the experiments.
Go to 'analysis' folder.
If you want the results in the form of latex tables:
1. Go to utils.py and set RESULTS = 'latex'.
2. Run python compare_save_latex.py.
3. Now you can use: metrics_meta_latex.ipynb, correlations_meta_latex.ipynb and test_correlations.ipynb.
If you want the results visualised with plots:
1. Use: plot_estimators.ipynb, plot_hyperparams.ipynb.
2. In order to use plot_metrics.ipynb, you have to perform some extra steps.
3. Go to utils.py and set RESULTS = 'mean'.
4. Run python compare_save_mean.py.
5. Now you can use plot_metrics.ipynb.

Note that running all experiments (step 3.) may take a LONG time (weeks, likely months). Highly parallelised computing environments are recommended.

It is possible to skip step 3. by downloading our results from here.

It is also possible to skip scripts compare_save_xxx.py as the most important CSV files obtained as part of the paper are included in this repository.

Project organisation

The following description explains only the most important files and directories necessary to replicate the paper.

├── environment.yml                     <- Replicate the environment to run all the scripts.
├── environment_tf.yml                  <- As above but with Tensorflow (required to run neural networks).
│
├── analysis
│   ├── compare_save_xxx.py             <- Post-processes 'results' into CSV files.
│   ├── tables                          <- CSVs from above are stored here.
│   ├── utils.py                        <- Important functions used by `compare_save.py'.
│   ├── plot_estimators.ipynb           <- Visualise performance of CATE estimators.
│   ├── plot_hyperparams.ipynb          <- Visualise performance against types of hyperparameters.
│   ├── plot_metrics.ipynb              <- Visualise performance of metrics.
│   ├── test_correlations.ipynb         <- Compute correlations between test metrics (e.g., ATE and PEHE).
│   ├── correlations_meta_latex.ipynb   <- Compute correlations between validation and test metrics (e.g, MSE and PEHE).
│   └── metrics_meta_latex.ipynb        <- Compute all metrics (latex format).
│
├── datasets                            <- All four datasets go here (IHDP, Jobs, Twins and News).
│
├── helpers                             <- General helper functions.
│
├── models
│   ├── data                            <- Models for datasets.
│   ├── estimators                      <- Implementations of CATE estimators.
│   ├── estimators_tf                   <- Code for Neural Networks (Tensorflow).
│   └── scorers                         <- Implementations of learning-based metrics.
│
├── results
│   ├── metrics                         <- Conventional, non-learning metrics (MSE, R^2, PEHE, etc.).
│   ├── predictions                     <- Predicted outcomes and CATEs.
│   ├── scorers                         <- Predictions of scorers (plugin, matching and rscore).
│   └── scores                          <- Actual scores (combines 'predictions' and 'scorers').
│
└── scripts
    └── paper.sh                        <- Replicate all experiments from the paper.

Name		Name	Last commit message	Last commit date
Latest commit History 115 Commits
.vscode		.vscode
analysis		analysis
datasets		datasets
helpers		helpers
models		models
results		results
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
compute_metrics.py		compute_metrics.py
compute_scores.py		compute_scores.py
convert_preds.py		convert_preds.py
debug.py		debug.py
environment.yml		environment.yml
environment_tf.yml		environment_tf.yml
init_folds.py		init_folds.py
init_train_test.py		init_train_test.py
make_matching.py		make_matching.py
make_plugin.py		make_plugin.py
make_predictions.py		make_predictions.py
make_predictions_tf.py		make_predictions_tf.py
make_rscorer.py		make_rscorer.py

License

misoc-mml/hyperparam-sensitivity

Folders and files

Latest commit

History

Repository files navigation

Hyperparameter Tuning and Model Evaluation in Causal Effect Estimation

Paper

Data and Results

Replicating the paper

Project organisation

About

Resources

License

Stars

Watchers

Forks

Languages