A critical look at the consistency of causal estimation using deep latent variable models

The project is structured into notebooks that run the experiments and plot the results, and to Python files containing most of the actual code. Most of the experiments use the definition of CEVAE in CEVAE.py, which is a very flexible approach. In the MNIST experiment, we use a different Pytorch model defined in imageCEVAE.py.

The seven experiments are in the following files (Corresponding to the order in the paper): -running_lineargaussian_data.ipynb -running_binary_data.ipynb -running_irrelevantnoise_data.ipynb -running_copyproxy_data.ipynb -running_IHDP_data.ipynb -running_MNIST_data.ipynb -running_Twins_data.ipynb

The Python file cevaetools.py contains lots of the most relevant code for the experiments, and imagedatatools.py as well for the MNIST data. The other Python files may be referenced in some specific parts of code. In particular, the binarytoydata.py, lineartoydata.py, imagedata.py, contain code for generating different data sets. datagenVAE.py and GANmodel.py contain the models for generating new data for the IHDP and MNIST experiments.

We didn't include the actual data or most of the trained models, as those take up lots of space, but the folders GANmodels and datageneratormodels contain pretrained models for generating mode MNIST and IHDP data, respectively. The results of the experiments are saved in the data/ folder, which contains the data generating parameters and such used in the experiments, if they are not written out in the notebooks.

Note that the IHDP experiment won't run as is. You will need to add the file ihdp_npci_1-100.train.npz from https://www.fredjo.com/ to the folder data/IHDP.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
GANmodels		GANmodels
data		data
datageneratormodels		datageneratormodels
CEVAE.py		CEVAE.py
GAN_model_training.ipynb		GAN_model_training.ipynb
GANmodel.py		GANmodel.py
README.md		README.md
binary_analytical_methods.py		binary_analytical_methods.py
binary_data_tools.py		binary_data_tools.py
binarytoydata.py		binarytoydata.py
cevaetools.py		cevaetools.py
datagenVAE.py		datagenVAE.py
imageCEVAE.py		imageCEVAE.py
imagedata.py		imagedata.py
imagedatatools.py		imagedatatools.py
lineardatatools.py		lineardatatools.py
lineargaussian_analytical_methods.py		lineargaussian_analytical_methods.py
lineartoydata.py		lineartoydata.py
running_IHDP_data.ipynb		running_IHDP_data.ipynb
running_MNIST_data.ipynb		running_MNIST_data.ipynb
running_binary_data.ipynb		running_binary_data.ipynb
running_copyproxy_data.ipynb		running_copyproxy_data.ipynb
running_irrelevantnoise_data.ipynb		running_irrelevantnoise_data.ipynb
running_lineargaussian_data.ipynb		running_lineargaussian_data.ipynb
running_twins_data.ipynb		running_twins_data.ipynb

severi-rissanen/critical_look_causal_dlvms

Folders and files

Latest commit

History

Repository files navigation

A critical look at the consistency of causal estimation using deep latent variable models

About

Resources

Stars

Watchers

Forks

Languages