Self-supervised diffusion pretraining for thoracic pathology detection offers insights into confounders

Try a Colab demo:

Prerequisites

See requirements.txt

pip install -r requirements.txt

Pretrained models

We provide checkpoints for DiffChest model trained on a joint collection of CXRs, namely, MIMIC-CXR (USA), CheXpert (USA), and PadChest (Spain). In addition, we also offer latent statistics and the classifier model finetuned on high-quality PadChest subset which was annotated by physicians.

Checkpoints ought to be put into a separate directory checkpoints. Download the checkpoints and put them into checkpoints directory. It should look like this:

checkpoints/
- padchest_autoenc
    - last.ckpt # DiffChest checkpoint
- padchest_autoenc_cls
    - last.ckpt # Finetuned logistic regression classifier

Medical Imaging Datasets

Please register and download those publicly available CXR datasets:

After downloading and preprocessing those images, you need to create a LMDB dataset for model training.

Training

We provide scripts for training & evaluate DiffChest on the following datasets: MIMIC-CXR, CheXpert, and PadChest.

Note: Most experiment requires at least 3x 3090s during training the DPM models while requiring 1x 2080Ti during training the accompanying classification head.

PadChest: 256 $\times$ 256

We only trained the DiffChest due to the high computation cost. This requires 3x 3090s.

python run_padchest.py

After the previous stage, a classifier (for manipulation) can be trained using:

python run_padchest_cls.py

Testing

We provide a testing script for evaluating the classification performance of DiffChest:

python test_padchest_cls.py

Visual explanations

To generate visual explanations described in our paper, run:

python manipulate.py

Citing Us

If you use DiffChest, we would appreciate your references to our paper.

Issues

Please open new issue threads specifying the issue with the codebase or report issues directly to than@ukaachen.de.

License

The source code for the site is licensed under the MIT license, which you can find in the LICENSE file.

Acknowledgments

The official implementation of diffae model: https://github.com/phizaz/diffae

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
diffusion		diffusion
imgs		imgs
model		model
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
choices.py		choices.py
config.py		config.py
config_base.py		config_base.py
d.json		d.json
dataset.py		dataset.py
dataset_util.py		dataset_util.py
dist_utils.py		dist_utils.py
experiment.py		experiment.py
experiment_classifier.py		experiment_classifier.py
experiment_classifier_funetune.py		experiment_classifier_funetune.py
manipulate.py		manipulate.py
metrics.py		metrics.py
predict.py		predict.py
renderer.py		renderer.py
requirements.txt		requirements.txt
run_padchest.py		run_padchest.py
run_padchest_cls.py		run_padchest_cls.py
run_padchest_latent.py		run_padchest_latent.py
ssim.py		ssim.py
templates.py		templates.py
templates_cls.py		templates_cls.py
templates_latent.py		templates_latent.py
test_padchest_cls.py		test_padchest_cls.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Self-supervised diffusion pretraining for thoracic pathology detection offers insights into confounders

Try a Colab demo:

Prerequisites

Pretrained models

Medical Imaging Datasets

Training

Testing

Visual explanations

Citing Us

Issues

License

Acknowledgments

About

Releases 1

Packages

Languages

License

peterhan91/diffchest

Folders and files

Latest commit

History

Repository files navigation

Self-supervised diffusion pretraining for thoracic pathology detection offers insights into confounders

Try a Colab demo:

Prerequisites

Pretrained models

Medical Imaging Datasets

Training

Testing

Visual explanations

Citing Us

Issues

License

Acknowledgments

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages