Vehicle Type Recognition from image data

This project aims to recognize vehicles through images. It is based on a Convolutional Neural Network (from now named CNN) following the essence of Machine Learning algorithms.

The intention is to classify a dataset composed of images that includes different types of vehicles including cars, bicycles, boats, trucks, etc. with a total of 15 categories.

The dataset has been collected from the Open Images dataset (over 9 million images) using a subset, selected to contain only vehicle categories among the total of 600 object classes.

The data contains a folder of training data with the class labels and a folder of test data without the labels. The model deals to predict the secret labels for the test data. So, this is an image classification task.

Details of dataset available

The dataset consists of two files listed below.

The training set (train.zip): a set of images with true labels in the folder names. The zip file contains altogether 27290 files organized in folders. The folder name is the true class; i.e., "Boat" folder has all boat images, "Car" folder has all the car images and so on.
The test set (test.zip): a set of images without labels. The zip file contains altogether 7958 files in a single folder. The file name is the id for the solution's first column; i.e., the predicted class for file "000000.jpg".

First step: Exploratory Data Analysis

In this first phase, I analyze the data that I have available for training the neural network. For this I ask ourselves the following questions that have been resolved in the corresponding notebooks:

How many images do we have to train?
How many images do we have to test?
How many images do we have per class?
Is there a very unbalanced class? (If a class has many more samples than another class, the data set is not balanced.)
In order to visualize the dataset, I use the matplotlib library to display a random subset of images for the different classes.

In addition, I cleaned the training dataset. It can be seen that image classes may contain peculiar or odd images. Therefore, to enhance the dataset quality, I filtered the data and removed these "outliers" from the training set.

Second step: Train a predictive model

I started the project and learning process by building my own CNN models. Testing how adding different layers affect on validation loss. Validation loss was my primary score during training. Own models were prone to overfitting, but batch normalization after each convolution helped. Also, I did not use yet image augmentation with my own models. Could not reach 80% accuracy and kept in mind that I can not create competitive CNN model without deep knowledge so I switched to pretrained models offered by Keras.

Model parameters

First trained a simple model sequential to achieve fast training and quickly test how model parameters like pooling affects. Testing also that how many Dense layers should be added on top of the model.

After that, I tested pre-trained models with keras and added the sequential model as base model.

Data augmentation and training

To load images, I used Keras Image Generators, which generate batches of tensor image data with real-time data augmentation. In particular, I used horizontal flip, rotation, width shift, and height shift. In this setup, zoom augmentation did not give any gain in the evaluation score.

If training accuracy tend to go higher than validation accuracy (or loss lower) I added more augmented image sets and decreased epochs for each set.

I used 85% of images for training and 15% for validation. Stratified and different random_state for each model.

After the val loss did not decrease anymore I finalized the model by training quickly with validation images. I left the original unprocessed images for validation and run the same sequential augmentation and training for the previous validation images.

Models trained

It can be noticed, that certain classes are dominant in the data set (big class imbalance). My solution was to use sklearn.utils.class_weight as class weights when training networks.

I tried the following architectures: Sequential, InceptionV3, MobileNet, ResNet50, NasNetLarge, EfficientNet. Different image sizes starting from 224 and up tu 331 as well as batch sizes were used.

MobileNet and ResNet50 seemed to be the best.

Here you have a tutorial MobileNet and ResNet50.

Streamlit as visualization result

It is a library that makes it easy to create web applications to display results of your data analysis.

💻 Technology stack

Python
numpy
pandas
matplotlib
seaborn
Image
cv2
keras
tensorflow
sklearn

All processes are built on a data pipeline through PyCharm IDE. Only need to run main_script.py

💥 Core technical concepts and inspiration

This project was born from the need to apply the knowledge learned during the data analytics bootcamp in a real application in my daily work.

💩 ToDo

Possible new steps into the project:

Road speed according to each predicted vehicle.
Vehicles, pedestrians and sign lights recognition

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
data/results		data/results
notebooks		notebooks
p_acquisition		p_acquisition
p_analysis		p_analysis
p_reporting		p_reporting
.env.txt		.env.txt
.gitignore		.gitignore
README.md		README.md
main_script.py		main_script.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Vehicle Type Recognition from image data

Details of dataset available

First step: Exploratory Data Analysis

Second step: Train a predictive model

Model parameters

Data augmentation and training

Models trained

Streamlit as visualization result

💻 Technology stack

💥 Core technical concepts and inspiration

💩 ToDo

About

Releases

Packages

Languages

ChristianJavierMelo/Vehicle-Type-Recognition

Folders and files

Latest commit

History

Repository files navigation

Vehicle Type Recognition from image data

Details of dataset available

First step: Exploratory Data Analysis

Second step: Train a predictive model

Model parameters

Data augmentation and training

Models trained

Streamlit as visualization result

💻 Technology stack

💥 Core technical concepts and inspiration

💩 ToDo

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages