IWDD: Importance-Weighted Diffusion Distillation for Causal Estimation

This repository provides the official PyTorch implementation of the paper:

A Generative Framework for Causal Estimation via Importance-Weighted Diffusion Distillation
Xinran Song, Tianyu Chen, Mingyuan Zhou
[arXiv:2505.11444]

Overview

IWDD (Importance-Weighted Diffusion Distillation) is a generative framework for causal estimation that combines diffusion model pretraining with importance-weighted score distillation. It enables accurate estimation of potential outcomes and treatment effects with reduced gradient variance and without explicit computation of inverse probability weights.
IWDD achieves state-of-the-art results on benchmark datasets.

Installation

Prerequisites

Python ≥ 3.9
PyTorch ≥ 1.13
CUDA-compatible GPU

Getting Started

Clone the repository and navigate to the project directory:

git clone https://github.com/XinranSong/IWDD.git
cd IWDD

Create a virtual environment and activate it:

conda create -n iwdd python=3.9
conda activate iwdd

Install the required dependencies:

pip install -r requirements.txt

Dataset Preparation

The preprocessing procedure for datasets follows the same pipeline as DiffPO. Once the original ACIC 2018, ACIC 2016, and IHDP dataset is downloaded, run the corresponding preprocessing notebook (e.g., load_ihdp.ipynb) to generate causal masks and normalized data. The processed files will be saved under:

data_ihdp/
├── ihdp_norm_data/
└── ihdp_mask/

The preprocessing scripts for ACIC 2018 and ACIC 2016 follow the same structure.

Running Experiments

Example: Single ACIC 2018 Dataset

You can reproduce IWDD results on a specific ACIC 2018 dataset using the provided configuration file:

CUDA_VISIBLE_DEVICES=1 python exe_acic.py \
    --config acic2018.yaml \
    --current_id "9333a461d3944d089ef60cdf3b88fd40" \
    --pretrain 1 \
    --train_sid 1

Example: Running Multiple Datasets

For large-scale experiments across multiple ACIC 2018 datasets, use the shell script script_acic2018.sh. Run the scrip with:

bash script_acic2018.sh

Citation

If you find this work useful, please cite:

@misc{song2025generativeframeworkcausalestimation,
      title={A Generative Framework for Causal Estimation via Importance-Weighted Diffusion Distillation}, 
      author={Xinran Song and Tianyu Chen and Mingyuan Zhou},
      year={2025},
      eprint={2505.11444},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2505.11444}, 
}

Acknowledgments

This implementation builds upon the SiD for diffusion distillation and the DiffPO pipeline for data preprocessing.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.idea		.idea
config		config
data_ihdp		data_ihdp
src		src
README.md		README.md
dataset_acic.py		dataset_acic.py
exe_acic.py		exe_acic.py
requirements.txt		requirements.txt
script_acic2018.sh		script_acic2018.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

IWDD: Importance-Weighted Diffusion Distillation for Causal Estimation

Overview

Installation

Prerequisites

Getting Started

Dataset Preparation

Running Experiments

Example: Single ACIC 2018 Dataset

Example: Running Multiple Datasets

Citation

Acknowledgments

About

Uh oh!

Releases

Packages

Languages

XinranSong/IWDD

Folders and files

Latest commit

History

Repository files navigation

IWDD: Importance-Weighted Diffusion Distillation for Causal Estimation

Overview

Installation

Prerequisites

Getting Started

Dataset Preparation

Running Experiments

Example: Single ACIC 2018 Dataset

Example: Running Multiple Datasets

Citation

Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages