Diffusion Models for Inpainting

Repository containing the code for all explored and extended methods used in the paper "Denoising Diffusion Models for 3D Healthy Brain Tissue Inpainting". The paper is available at https://arxiv.org/abs/2403.14499.

General overview over training and sampling procedure:

Evaluated methods:

The following list provides details of the different modified methods we evaluate:

DDPM 2D slice-wise: A baseline method as presented in [1] with the input being defined as X_t = (x_t, b, m). Here, x_t, b, and m are 2D slices. During training and sampling, only slices with a non-zero mask are considered. Finally, the samples are stacked to a 3D volume.
DDPM 2D seq-pos: The above baseline method is extended by conditioning on the previous slice and positional embedding. The input is defined as X_t = (x_t, b, m, x_prev), with x_t, m, and b being 2D slices and x_prev being the previous ground truth 2D slice (without noise) during training, or the previously sampled slice during sampling. In addition, we use a position embedding of the slice index. We perform slice-by-slice sampling of non-zero mask slices, where each slice is conditioned on the previous by concatenation, and the samples are stacked to a 3D volume in the end.
DDPM Pseudo3D: A pseudo-3D method as described in [2], modified for inpainting. Pseudo-3D convolutions result from 2D convolutional layers followed by 1D convolutions in the z-axis. The input is defined as X_t = (x_t, b, m) with x_t, b, and m being stacks of 2D slices. In contrast to [2], we apply the model in the image space and directly use the pseudo-3D convolutions without the proposed fine-tuning strategy used by [2].
DDPM 3D mem-eff: A memory efficent 3D diffusion model as presented in [3]. The input is defined as X_t = (x_t, b, m). We decided to use the memory-efficent architecture by [3] for the 3D model in the image space as it allowed using two residual blocks per scale, which was not possible if we simply replaced the 2D convolutions by 3D convolutions in the baseline model.
LDM 3D: A 3D latent diffusion model as presented in [4]. The input is defined as X_t = (x_{lat, t}, b_lat, m_lat), with x_{lat, t}, b_lat, and m_lat being the latent representations of x_GT, b, and m. These latent representations are obtained through an autoencoder (AE) following a VQ-GAN implementation. The diffusion model in the latent space is less memory-intense than in the image space. The AE required to obtain the latent representations, however, exceeds the to us available GPU memory for this experiment (40 GB) at the initial image resolution. Therefore, downsampling of the input volume x_GT was required.
WDM 3D: A 3D wavelet diffusion model as presented in [5]. The input is defined as X_t = (x_{wav, t}, b_wav, m_wav), with x_{wav, t}, b_wav, and m_wav being the concatenated wavelet coefficients of x_GT, b, and m. An inverse wavelet transform is applied to reconstruct the images from the predicted x_{wav, 0}.

All models were trained on the publicly available dataset from the "BraTS 2023 Local Synthesis of Healthy Brain Tissue via Inpainting Challenge" [6-11].

The requirements.txt file in the main folder (Diffusion_Models_Inpainting) applies to all subfolders, except LDM_3D (has an own requirements.txt file).

Exemplary inpainting generated by the different methods:

References

[1] Durrer, A, et al.: "Denoising Diffusion Models for Inpainting of Healthy Brain Tissue." arXiv preprint arXiv:2402.17307 (2024).

[2] Zhu, L., et al.: Make-a-volume: Leveraging latent diffusion models for cross-modality 3d brain mri synthesis. In: International Conference on Medical Image Computing and Computer-Assisted Intervention. pp. 592–601. Springer (2023)

[3] Bieder, F., et al.: Memory-efficient 3d denoising diffusion models for medical image processing. In: Medical Imaging with Deep Learning (2023)

[4] Khader, F., et al.: Medical diffusion–denoising diffusion probabilistic models for 3d medical image generation. arXiv preprint arXiv:2211.03364 (2022)

[5] Friedrich, P., et al.: "WDM: 3D Wavelet Diffusion Models for High-Resolution Medical Image Synthesis." arXiv preprint arXiv:2402.19043 (2024).

[6] Baid, U., et al.: The rsna-asnr-miccai brats 2021 benchmark on brain tumor segmentation and radiogenomic classification. arXiv preprint arXiv:2107.02314 (2021)

[7] Bakas, S., et al.: Advancing the cancer genome atlas glioma mri collections with expert segmentation labels and radiomic features. Scientific data 4(1), 1–13 (2017)

[8] Bakas, S., et al.: Identifying the best machine learning algorithms for brain tumor segmentation, progression assessment, and overall survival prediction in the brats challenge. arXiv preprint arXiv:1811.02629 (2018)

[9] Karargyris, A., et al.: Federated benchmarking of medical artificial intelligence with medperf. Nature Machine Intelligence 5(7), 799–810 (2023)

[10] Kofler, F., et al.: The brain tumor segmentation (brats) challenge 2023: Local synthesis of healthy brain tissue via inpainting. arXiv preprint arXiv:2305.08992 (2023)

[11] Menze, B.H., et al.: The multimodal brain tumor image segmentation benchmark (brats). IEEE transactions on medical imaging 34(10), 1993–2024 (2014)

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
DDPM_2D_seq_pos		DDPM_2D_seq_pos
DDPM_2D_slice_wise		DDPM_2D_slice_wise
DDPM_3D_mem_eff		DDPM_3D_mem_eff
DDPM_Pseudo3D		DDPM_Pseudo3D
LDM_3D		LDM_3D
WDM_3D		WDM_3D
LICENSE		LICENSE
README.md		README.md
exemplary_images.png		exemplary_images.png
preprocessing_2D.py		preprocessing_2D.py
preprocessing_3D.py		preprocessing_3D.py
requirements.txt		requirements.txt
training_sampling_overview.png		training_sampling_overview.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Diffusion Models for Inpainting

References

About

Releases

Packages

Languages

License

AliciaDurrer/DM_Inpainting

Folders and files

Latest commit

History

Repository files navigation

Diffusion Models for Inpainting

References

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages