Automatic concept recognition in images

This project aims to implement a strong C++ algorithm for concept recognition in images based on Neural Networks. Moreover, it implements a friendly user interface for evaluation and testing using Qt library. We also use the well-known OpenCv library for images processing and classification. Our method is a two phases approach. First, we detect and localize a concept (shape) in an image. Then, we recognize and classify the concept with our Neral Networks. However, before starting detection and classification, we need to do some work :

create a bag of visual words and extract image signatures from the training dataset of images
train the Neural Networks with the training dataset

Building of the bag of visual words and image signatures extraction

Image dataset and features extraction

First of all, we need to choose a right training dataset for feature extraction. Here are some features we have to take into account when choosing a training images dataset :

the size of the dataset
the level of detail
the variety of image viewpoints

Once the training dataset has been selected, we can start features extraction. A good feature should be invariant to geometry (rotation, scaling, affine transformation) and photometry. There are many methods of extraction implemented by OpenCv, we choose the SIFT method because it has a better performance for some features.

Method	Time	Scale	Rotation	Blur	Illumination	Affine
SIFT	Normal	Best	Best	Best	Normal	Good
PCA-SIFT	Good	Normal	Good	Normal	Good	Good
SURF	Best	Good	Normal	Good	Best	Good

Bag of visual words

This is done by clustering the extracted features with kMeans algorithm provided by OpenCv.

Image signatures

Once we built the bag of visual words, for each image we can now extract its signature. This signature is a vector V where |V| = the size of the clusters set and V[i] is the number of descriptors contained in the ith cluster.

Classification

The classification aims to associate automatically keywords to images or concepts in our context. In order to achieve this goal, we are going to use Artificial Neural Networks. But before using them for classification, we must train them with the training dataset. We use a supervised learning method which consists of a mapping between an input and an output. So, we map a concept signature to a class name or label. Also, our learning method is based on multilayer perceptron using a backpropagation algorithm to calculate the gradient. Once this step is done, we can present a concept to our Artificial Neural Network and it will return its class name.

Implementation

Our algorithm is implemented through 3 major classes :

BagOfWords : create and load image signatures
Classifier : classify concepts by the mean of Artificial Neural Networks
Processing : process images and return a set of concepts (shape)

Evaluation and testing

Evaluation

In order to evaluate our algorithm, we use Columbia Object Image Library (COIL) which is a dataset available in several different versions : COIL-100, COIL-20. We use cross validation as our evaluation method where we use the half dataset for training and the other for evaluation and testing. Here are the results we got according to data classification indices for COIL-100 dataset :

True Positive Rate (TPR) : 69%
False Positive Rate (TPR) : 0.4%
Recall : 69%
Precision : 61%
Accuracy : 99%
F-mesure : 65%

Testing

Our software present 3 different menus :

Bag of visual words : here we can create a new dictionary (image signatures) with precise configurations and load a saved dictionary.
Neural Networks : here we can configure, train or load a trained Neural Networks.
Tests and applications : here we can evaluate our classification, find different concepts in a given set of images (directory) and for every found concept, we can use it for searching in a given directory.

Authors

F. Ndongmo Silatsa

Licence

This project is licensed under the MIT License - see the LICENSE.md file for details

Acknowledgments

D. Frolova and D. Simakov. Matching with invariant features. page 35, mars 2004.
L. Juan and O. Gwun. A comparison of sift, pca-sift and surf. International Journal of Image Processing (IJIP), pages 143-152, 2009.
P. Borne, M. Benrejeb, and J. Haggege. Les réseaux de neurones : Présentation et applications. Editions TECHNIP, 2007.
B. Tomasik, P. Thiha, and D. Turnbull. Tagging products using image classification.
S. A. Nene, S. K. Nayar, and H. Murase. Columbia object image library (coil-100). Technical Report No. CUCS-006-96.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
images		images
setup		setup
src		src
LICENSE.md		LICENSE.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Automatic concept recognition in images

Building of the bag of visual words and image signatures extraction

Image dataset and features extraction

Bag of visual words

Image signatures

Classification

Implementation

Evaluation and testing

Evaluation

Testing

Authors

Licence

Acknowledgments

About

Releases

Packages

Languages

License

ndongmo/Automatic-concept-recognition-in-images

Folders and files

Latest commit

History

Repository files navigation

Automatic concept recognition in images

Building of the bag of visual words and image signatures extraction

Image dataset and features extraction

Bag of visual words

Image signatures

Classification

Implementation

Evaluation and testing

Evaluation

Testing

Authors

Licence

Acknowledgments

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages