Machine learning - Image classification

Today we want to look at different image classification methods with 2 different datasets, MNIST and Fashion MNIST.

Datasets

As mentioned in the previous section, we will be performing our test and evaluation on the two MNIST dataset. One is the handwritten digit MNIST and the other is the Fashion MNIST dataset, both dataset has the similar attributes where both datasets contains sets of 28 x 28 greyscale image label under 10 classes.

By Josef Steppan - Own work, CC BY-SA 4.0, Link

Fashion Mnist dataset is created because hand written digits MNIST dataset is too simple, where it can easily achieve 90% accuracy with simple model setup. Image taken from Fashion-MNIST oficial github website.

For more information on the used dataset, you may refer to the links below:
MNIST handwritten digits dataset.
Fashion MNIST dataset.

Pre-processing

$z = \frac{x - min(x)}{max(x)- min(x)}$ , where x will be each and every pixel in our case, and z would be our rescaled data that is ranging between $0 \le z \le 1$ .

Another way to perform normalization is to divide all pixel data by 255 since we already know pixel has a min value of 0 and a max of 255.

Methods

Bernoulli Naive Bayes

Naive Bayes is a probabilitic classifier that has strong (naïve) independence assumptions between the features. Out of all the Naive Bayes model, we chose to work with bernoulli naive bayes as I believe that we can treat the image data as binary representation when calculating the posterior hence it should work relatively better compared to Gaussian or Multinomial Naive Bayes model.

The definition of the likelihood of each Class C_k is shown below:

$p(x|C_k) = \prod_{i=1}^{n} p_{k_i}^{x_i} (1 - p_{k_i})^{(1-x_i)}$

Additional reading material:
Wikipedia
NLP Standford

Logistic regression

We will use multi-class logistic regression for this task, with softmax function, where the definition can be found below:

$\textbf{x} \in \textbf{R}^{D}$

When there are K different classes C₁,C₂, ...... , C_k. For each class C_k, we have parameter vector w_k and the posterior probability of the model will be:

$h_\textbf{w}(\textbf{x}) = p(y = C_k|\textbf{x};\textbf{w}) = \frac{\exp(\textbf{w}_k^T\textbf{x})}{\sum_{k=1}^K \exp(\textbf{w}_k^T\textbf{x})}$

Where the formula of multinomial logistics loss is:

$\textbf{J}(\textbf{w}) = - \left[\sum_{n=1}^{N} \sum_{k=1}^K 1_{\{y^{n} = C_k\}}log(h_\textbf{w}(\textbf{x}^n))\right] = - \left[\sum_{n=1}^{N} \sum_{k=1}^K 1_{\{y^{n} = C_k\}}log(y^{n} =C_k| \textbf{x}^{n};\textbf{w})\right]$

Additional reading material:
Wikipedia
Stackoverflow

Convolutional Neural Network

Anyone that has some exposure to machine learning would have heard of convolutional neural network, it is one of the most commonly used machine learning method for analyzing visual imagery.

Additional reading material:
Wikipedia
Adit Deshpande

Results

Without any doubts we can directly assume that Neural Network will have better results compared to the other machine learning methods and the final results shows. However, Bernoulli Naive Bayes Classifier is getting comparable results to logistic regression which is kinda of a surprise where Naive Bayes does not really have good prediction results in real life tasks though that work really well in theory. But our current implementation of logistic regression is not optimal where if we have implement regularization we can sure to see huge improvement in creating a more robust classifier. Though CNN have achieved a better prediction results, there are extremely sensitive to different dataset, meaning with different task you will need to define different architechture where there is no such thing as one model to fit all (No free lunch theory).

From the table below, You can find the summary of accuracy/performance of all the different classifiers for MNIST and Fashion-MNIST dataset.

Future Work

Add regularization to logistic regression to add robustness to the classifier.
Implement multinomial Naive Bayes and Gaussian Naive Bayes and compare between the Naive Bayes models.
Add different evaluation methods to further analyze the listed models, ROC curve, confusion matrix etc.

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
img		img
scripts		scripts
Bernoulli_NaiveBayes.ipynb		Bernoulli_NaiveBayes.ipynb
Logistic_regression_softmax.ipynb		Logistic_regression_softmax.ipynb
README.md		README.md
Sequential_MNIST_CNN_example.ipynb		Sequential_MNIST_CNN_example.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

img

img

scripts

scripts

Bernoulli_NaiveBayes.ipynb

Bernoulli_NaiveBayes.ipynb

Logistic_regression_softmax.ipynb

Logistic_regression_softmax.ipynb

README.md

README.md

Sequential_MNIST_CNN_example.ipynb

Sequential_MNIST_CNN_example.ipynb

Repository files navigation

Machine learning - Image classification

Datasets

Pre-processing

Methods

Bernoulli Naive Bayes

Logistic regression

Convolutional Neural Network

Results

Future Work

About

Releases

Packages

Languages

MingSheng92/Image_Classification

Folders and files

Latest commit

History

Repository files navigation

Machine learning - Image classification

Datasets

Pre-processing

Methods

Bernoulli Naive Bayes

Logistic regression

Convolutional Neural Network

Results

Future Work

About

Topics

Resources

Stars

Watchers

Forks

Languages