Skip to content

veb-101/Kaggle-Human-Protein-Classification

Repository files navigation


Cover image


Introduction

In this kaggle in-class competition, we will develop models capable of classifying mixed patterns of proteins in microscope images. Images visualizing proteins in cells are commonly used for biomedical research, and these cells could hold the key for the next breakthrough in medicine. However, thanks to advances in high-throughput microscopy, these images are generated at a far greater pace than what can be manually evaluated. Therefore, the need is greater than ever for automating biomedical image analysis to accelerate the understanding of human cells and disease.

This is a multilabel image classification problem, where each image can belong to several classes. The class labels are as follows:

  1. Mitochondria,
  2. Nuclear bodies',
  3. Nucleoli,
  4. Golgi apparatus,
  5. Nucleoplasm,
  6. Nucleoli fibrillar center,
  7. Cytosol,
  8. Plasma membrane,
  9. Centrosome,
  10. Nuclear speckles

Overview

  • This competition was solved using the PyTorch deep learning framework.
  • During the course of the competition, we performed various steps such as:
  1. Data preparation
  2. Building custom Dataset class
  3. Performing data augmentations
  4. Creating our own train and test dataloaders
  5. A wrapper class for dataloaders to move data and model to the GPU memory
  6. Building model Base class and our Protein Model class
  7. Some helper methods to bind together all our functions together and start the training proces
  8. Generating the prediction process.

Training Metrics

metrics

Releases

No releases published

Packages

No packages published