Preprocessing Techniques for Image Classification

Introduction

In this project we tried to solve a classification problem using only ML models and preprocessing techniques. The idea is not to foucuse on the models but on the preprocessing of the data to achive better results.

Models: SVM, KNN, Random Forest, Logistic Regression.

Preprocessing Techniques: Edge detection, MeanRGB, and feature-extraction - Bag of Visual Words (SIFT, k-means, histograms).

Dataset

Link: https://www.kaggle.com/tongpython/cat-and-dog

This dataset contains difficult data cats and dogs - different sizes and dirty imgs. The training contains 8k imgs and test 1k, with even numbers of images from each label. Because the dataset is of raw images first we need to read each one, resize, and then reffer to the pixels as featuers. The featuers for each sample is big meaning the demention is big, while also they have no "sence" behind them only the pixel value, this makes the classification problem all the more difficult.

Bssic Ground Truth

To have a basic result to compare with, we did simple preprocessing and tested the models.

Resize - set all the images to the same size.
Gray Scale - reduce the demention from 3 (RGB) to 1 (Gray)
Flatteren
Test Models - SVM, KNN, Random Forest, Logistic Regression

Data Preprocessing

Gray Scale

Reduce the demention from 3 (RGB) to 1 (Gray), each pixel value will be 0-255.

Mean RGB

Reduce the demention from 3 (RGB) to 1, but each pixel value will be the mean value from each on of the dementions - Red Green Blue.

Edge detection

Finding the eadges of the images and use the result. At first we wanted to use Canny but wanted to use a more modern model.

Using Structured Forests / Structured Edge detector - For more details: https://debuggercafe.com/edge-detection-using-structured-forests-with-opencv/

BoVW - Bag of visual words (Feature extraction)

How to:

Read each sample as RGB and resize 90x90
Extraction of 50 key points, that would be the new features, using SIFT. (each key point is represented as a vector of size 128)
Using K-Means clustering on the new features we extracted, this in order to cluster togther similar feature groups.
For each smaple create a histogram - the size of the hist is the amount of centers from the K-Means. And for each smaple we count the fetuers it contains.
Normalizing
Run the models on the hist that represent each smaple.

Parameters Optimization

Grid Search

Receving many parametes and try to find the best combination.

Random Grid Search

Receving many parametes, but is randomly picking parameters to find a good combination that give good results. The idea is to optimize the parametes search and not try all combinations.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
cat_and_dog		cat_and_dog
model.yml		model.yml
ML_models.ipynb		ML_models.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Preprocessing Techniques for Image Classification

Introduction

Dataset

Bssic Ground Truth

Data Preprocessing

Gray Scale

Mean RGB

Edge detection

BoVW - Bag of visual words (Feature extraction)

Parameters Optimization

Grid Search

Random Grid Search

Results

Models with Parameters Optimization

Models with Data Preprocessing

BoVW

About

Releases

Packages

Contributors 2

Languages

omerugi/Preprocessing_Techniques_Image_Classification

Folders and files

Latest commit

History

Repository files navigation

Preprocessing Techniques for Image Classification

Introduction

Dataset

Bssic Ground Truth

Data Preprocessing

Gray Scale

Mean RGB

Edge detection

BoVW - Bag of visual words (Feature extraction)

Parameters Optimization

Grid Search

Random Grid Search

Results

Models with Parameters Optimization

Models with Data Preprocessing

BoVW

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages