MNIST Classification using various Deep Learning Techniques

Classification of the MNIST dataset in Pytorch using 3 different approaches:

Convolutional Neural Networks (CNN)
Contrastive Learning (CL) framework SimCLR
Multiple Instance Learning (MIL)

1. Dataset

The MNIST dataset is an acronym that stands for the Modified National Institute of Standards and Technology dataset. It is a dataset of 70,000 small square 28×28 pixel grayscale images of handwritten single digits between 0 and 9. It has a training set of 60,000 examples, and a test set of 10,000 examples.

2. What is Contrastive Learning

In recent years, a resurgence of work in CL has led to major advances in selfsupervised representation learning. The common idea in these works is the following: pull together an anchor and a “positive” sample in embedding space, and push apart the anchor from many “negative” samples. If no labels are available (unsupervised contrastive learning), a positive pair often consists of data augmentations of the sample, and negative pairs are formed by the anchor and randomly chosen samples from the minibatch.

More information on:

SupContrast - Self-Supervised (see https://arxiv.org/abs/2002.05709)
SimCLR - Unsupervised (see https://arxiv.org/abs/2004.11362)

3. What is Multiple Instance Learning

In the classical (binary) supervised learning problem one aims at finding a model that predicts a value of a target variable, y ∈ {0, 1}, for a given instance, x. In the case of the MIL problem, however, instead of a single instance there is a bag of instances, X = {x1, . . . , xn}, that exhibit neither dependency nor ordering among each other. We assume that n could vary for different bags. There is also a single binary label Y associated with the bag. Furthermore, we assume that individual labels exist for the instances within a bag, i.e., y1,...,yn and yk ∈ {0, 1}, for k = 1,..., n, however, there is no access to those labels and they remain unknown during training.

More information on:

Attention (see https://github.com/AMLab-Amsterdam/AttentionDeepMIL)
Gated Attention (see https://github.com/AMLab-Amsterdam/AttentionDeepMIL)

4. Ensemble Methods

An ensemble is a collection of models designed to outperform every single one of them by combining their predictions.

5. Requirements

torch
torchvision
matplotlib
numpy
seaborn

6. Note

This is for demonstation purposes only. The results are not validated correctly. That means that no validation protocol is applied (e.g. KFold Cross Validation). The parameters are not optimized, rather than arbitrarily chosen. The network is chosen to demonstate every possible CNN layer. Early Stopping and Scheduler are implemented for demonstration aswell.

Name		Name	Last commit message	Last commit date
Latest commit History 79 Commits
notebooks		notebooks
results		results
util		util
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
bg.jpg		bg.jpg
enviroment.yml		enviroment.yml
main_cnn.py		main_cnn.py
main_cnn_ensemble_selector.py		main_cnn_ensemble_selector.py
main_cnn_torchensemble.py		main_cnn_torchensemble.py
main_mil.py		main_mil.py
main_simclr.py		main_simclr.py
main_simclr_ensemble_selector.py		main_simclr_ensemble_selector.py
requirements.txt		requirements.txt

License

giakoumoglou/classification

Folders and files

Latest commit

History

Repository files navigation

MNIST Classification using various Deep Learning Techniques

1. Dataset

2. What is Contrastive Learning

3. What is Multiple Instance Learning

4. Ensemble Methods

5. Requirements

6. Note

7. Results

7.1 CNN

7.2 SimCLR

7.3 MIL

7.4 CNN Ensemble Selector

7.5 CNN TorchEnsemble

7.5.1 Voting Classifier

7. Support

About

Topics

Resources

License

Stars

Watchers

Forks

Languages