Deep Gaussian Processes for classification with multiple noisy annotators. Application to breast cancer tissue classification

Code of Deep Gaussian Processes for crowdsourcing

Citation

@article{lopez2023deep,
  title={Deep Gaussian processes for classification with multiple noisy annotators. Application to breast cancer tissue classification},
  author={L{\'o}pez-P{\'e}rez, Miguel and Morales-{\'A}lvarez, Pablo and Cooper, Lee AD and Molina, Rafael and Katsaggelos, Aggelos K},
  journal={IEEE Access},
  volume={11},
  pages={6922--6934},
  year={2023},
  publisher={IEEE}
}

To run the code

Find the features, preprocessing and all the info about the data in the repo of our previous work.

To use Deep Gaussian Processes, you need this repo installed.

Abstract

Machine learning (ML) methods often require large volumes of labeled data to achieve meaningful performance. The expertise necessary for labeling data in medical applications like pathology presents a significant challenge in developing clinical-grade tools. Crowdsourcing approaches address this challenge by collecting labels from multiple annotators with varying degrees of expertise. In recent years, multiple methods have been adapted to learn from noisy crowdsourced labels. Among them, Gaussian Processes (GPs) have achieved excellent performance due to their ability to model uncertainty. Deep Gaussian Processes (DGPs) address the limitations of GPs using multiple layers to enable the learning of more complex representations. In this work, we develop Deep Gaussian Processes for Crowdsourcing (DGPCR) to model the crowdsourcing problem with DGPs for the first time. DGPCR models the (unknown) underlying true labels, and the behavior of each annotator is modeled with a confusion matrix among classes. We use end-to-end variational inference to estimate both DGPCR parameters and annotator biases. Using annotations from 25 pathologists and medical trainees, we show that DGPCR is competitive or superior to Scalable Gaussian Processes for Crowdsourcing (SVGPCR) and other state-of-the-art deep-learning crowdsourcing methods for breast cancer classification. Also, we observe that DGPCR with noisy labels obtains better results (F1 = 81.91%) than GPs (F1 = 81.57%) and deep learning methods (F1 = 80.88%) with true labels curated by experts. Finally, we show an improved estimation of annotators’ behavior.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
LICENSE		LICENSE
README.md		README.md
dgpcr.py		dgpcr.py
run_dgpcr.py		run_dgpcr.py
utils_svgp.py		utils_svgp.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep Gaussian Processes for classification with multiple noisy annotators. Application to breast cancer tissue classification

Code of Deep Gaussian Processes for crowdsourcing

Citation

To run the code

Abstract

About

Releases

Packages

Languages

License

wizmik12/DGPCR

Folders and files

Latest commit

History

Repository files navigation

Deep Gaussian Processes for classification with multiple noisy annotators. Application to breast cancer tissue classification

Code of Deep Gaussian Processes for crowdsourcing

Citation

To run the code

Abstract

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages