A Data-Efficient Deep Learning Framework for Segmentation and Classification of Histopathology Images

Official PyTorch implementation of the following paper: A Data-Efficient Deep Learning Framework for Segmentation and Classification of Histopathology Images. arXiv 2022.

Accepted at ECCV-2022: Medical Computer Vision Workshop (ECCV-MCV)

Updates

August 16, 2022

Accpeted at ECCV-MCV 2022

July 26, 2022

Added example and support for CoAtNet. This is a hybrid architecture with the abilities of CNN and Transformers, developed to overcome the shortcomings of transformers in low data regimes. Results have been updated in RESULTS.md

July 13, 2022

Initial Code release for DEDL

Description of the mathematical models

We provide code for classification and segmentation on the Whole slide Histopathology Images of biopsies of dermatomyositis. For Segmentation, we tested algorithms using U-Net and U-Net++ architectures and our Novel Autoencoder Post Processing (APP) architecture. APP Improves performance upon traditional segmentation techniques by an average of 3% for UNet and Unet++ with Resnet Family (Refer Section 5.1 of Paper) of Encoder and 5% for Efficient net family of encoders (Refer Section 1.2 of Supplementary material of Paper).

For Classification, we have tested our proposed approach on CNNs and Transformers; a complete list of all results can be found in section 5.2.

Assumptions

We don't account for any meta-data.

Dataset

We developed and tested our approach on a private dataset consisting of biopsies of dermatomyositis. This dataset has also been used by [Van Buren et al.] [1]

Segmentation

We use the same splits as mentioned in [Van Buren et al.] [1]

Classification

We split the dataset in 80/20 for training and testing. Within the training set, we perform a 6-fold cross-validation.

Dataset description

The dataset we used contained 198 tiff image sets containing eight slide images per tiff image. These eight slide images had different protein staining DAPI, CXCR3, CD19, CXCR5, PD1, CD4, CD27 and autofluorescence. The DAPI stained slides were used for image segmentation. These are black and white images with sizes 1408 and 1876. For classification, we use the autofluorescence slides, which contain all the phenotype labels from CXCR3, CD19, CXCR5, PD1, CD4, and CD27. Autofluorescence slides are 352 by 469 in RGB.

Specification of dependencies

Run the following commands to make sure all dependencies are installed. (Latest version recommended)

pip install pytorch_lightning
pip install torchcontrib
pip install torchmetrics
pip install -U git+https://github.com/qubvel/segmentation_models.pytorch
pip install timm
pip install tqdm
pip install einops

NOTE: segmentation_models.pytorch installs an older version timm package for itself, which might break some part of the code. So, it is better to uninstall that and install the latest version of timm (version 0.5 and beyond).

We assume that the practitioner has a working installation (latest recommended) of Pytorch, Numpy, matplotlib and pandas.

Explore the files

DEDL/ Segmentation with APP/Segmentation with APP.ipynb contains the training code for Training segmentation architecture with Autoencoder post-processing.

At the bottom it also has helper function for testing the results on Test set.

For changing the segmentation architecture:

import segmentation_models_pytorch as smp

model = smp.Unet(encoder_name=encoder_name,......

This model is currently using UNet architecture, to chnage to Unet++ simply change Unet to Unetplusplus. For more details check the complete list of supported architecture by segmentation_models_pytorch.

Note: we have only tested on Unet an Unet++ architectures.

DEDL/ Segmentation with APP/Segmentation.ipynb contains the segmentation architecture without the post-processing autoencoder architecture (APP). This notebook also contains the testing code at the bottom. Similarly, segmentation architecture can be changed as mentioned in the above step.
DEDL/Classification/ contains the code for supervised classification of the given dataset. This algorithm has been tested for CNNs and Transformers supported by timm and other architectures that haven't been implemented in timm library. To test the saved models, use the code provided in DEDL/Classification/Testing.ipynb

Results & Reproduction

See RESULTS.md for more details.

Time Complexity Comparison with and without APP

For segmentation, APP is only used during training. This increases the training time marginally (more details in supplementary material section 1.2). But the size of saved model weights and inference time remains constant.

References

[1]: https://www.sciencedirect.com/science/article/abs/pii/S0022175922000205
segmentation_models.pytorch: https://github.com/qubvel/segmentation_models.pytorch
Timm encoders/CNNs and Transformer Implementation for Pytorch: https://github.com/rwightman/pytorch-image-models

Citation


@inproceedings{singh2023data,
  title={A data-efficient deep learning framework for segmentation and classification of histopathology images},
  author={Singh, Pranav and Cirrone, Jacopo},
  booktitle={Computer Vision--ECCV 2022 Workshops: Tel Aviv, Israel, October 23--27, 2022, Proceedings, Part III},
  pages={385--405},
  year={2023},
  organization={Springer}
}

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
Classification		Classification
Segmentation with APP		Segmentation with APP
Segmentation		Segmentation
.gitignore		.gitignore
README.md		README.md
RESULTS.md		RESULTS.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Data-Efficient Deep Learning Framework for Segmentation and Classification of Histopathology Images

Updates

August 16, 2022

July 26, 2022

July 13, 2022

Description of the mathematical models

Assumptions

Dataset

Segmentation

Classification

Dataset description

Specification of dependencies

Explore the files

Results & Reproduction

Time Complexity Comparison with and without APP

References

Citation

About

Languages

pranavsinghps1/DEDL

Folders and files

Latest commit

History

Repository files navigation

A Data-Efficient Deep Learning Framework for Segmentation and Classification of Histopathology Images

Updates

August 16, 2022

July 26, 2022

July 13, 2022

Description of the mathematical models

Assumptions

Dataset

Segmentation

Classification

Dataset description

Specification of dependencies

Explore the files

Results & Reproduction

Time Complexity Comparison with and without APP

References

Citation

About

Resources

Stars

Watchers

Forks

Languages