CBIR

A demo program of a content-based image retrieval system using visual and textual features

How to run the program?

Prerequisites

OpenCV (v3.0, built from the source code)
Boost (the latest version download from Homebrew)
CMake

Compile and run in command line

Download the repository

 git clone https://github.com/chunweiliu/cbir

Compile the source code

 cd cbir && mkdir build && cd build && cmake .. && make

Run the demo program

 ./demo <dataset> <queryset> <kNumQuery> <kNumLexicon> <kAlpha>

Here is an example of expected result:

$ ./demo <dataset> <queryset> 9 23 0.5
CBIR DEMO
Dataset: <dataset>
Queryset: <queryset>
kNumQuery=9, kNumLexicon(unpruned)=23, kAlpha=0.5
Image retrieval accuracy: 0.925 (37/40)
Lexicon (8)
       836 [-]
   1.3e+03 [;]
       488 [Leather]
       494 [clutch]
       432 [features]
       429 [hobo]
  1.21e+03 [leather]
       413 [zip]
Text retrieval accuracy: 0.925 (37/40)
Hybrid retrieval accuracy: 0.95 (38/40)

Overview of the program

Pre-computing the image/text feature for all images in a data set;
Computing the image/text feature of a query image from another set, and calculating the sum of square (SSD) distance between the feature of the query image and the entire data set
Picking the top kNumQuery images in terns of minimum SSD, and then choosing the majority of the ensemble as a final prediction
Evaluating the accuracy of the predictions through the entire query image set

The image feature

The image feature is a simple 32 by 32 black and white patch.

The text feature

The text feature is a histogram of lexicon choosing from top kNumLexicon. Here is an example of a lexicon built with the 20 most frequent words (i.e. kNumLexicon=20) in the data set:

3393 [a]
3220 [and]
2586 [with]
2382 [the]
1297 [;]
1212 [leather]
1185 [of]
1058 [in]
1013 [The]
1004 [for]
 980 [to]
 872 [your]
 836 [-]
 811 [from]
 764 [is]
 646 [This]
 520 [A]
 494 [clutch]
 488 [Leather]
 432 [features]

The hybrid feature

The hybrid feature is a linear combination of the SSDs computed from image and text features. E.g kAlpha * SSD_image + (1-kAlpha) * SSD_text

##Evaluations

Th image feature

Using the image feature only have achieved a high accuracy of 0.925 (37/40).

The textual feature

I tried both not pruning and pruning the lexicon by removing the stop words, such as "a", "for", "the", ...etc.

My observation is the accuracy is roughly proportional to the number of words in the lexicon. Even using the unpruned lexicon, the accuracy would slowly converge to an accuracy of 0.95. Here is a summary of the number of words in the lexicon to the accuracy:

The pruned lexicon reaches the convergence accuracy 0.95 only using 11 words (the 11th word is "bag"). Yet the unpruned lexicon needs 71 words to get a 0.95 accuracy rate. There are some powerful keywords (annotating in the figure) boosting the accuracy. In summery, the burned lexicon has the following benefits:

Gathering such powerful keywords faster than the unpruned one
Having no distraction from the stop words

The hybrid feature

From the above results we know the accuracy using image feature is 0.925 and the best accuracy using text feature is 1 with a pruned lexicon of kNumLexicon=39, chosen from the top 62 frequent words. The hybrid feature might not perform better than that lexicon. So let's make a scenario to see the benefit for the hybrid feature.

Say we only have limit budget on the number of lexicon, for instance, kNumLexicon=8 for reaching the 0.925 accuracy. With the hybrid feature, the result can be slightly better than the individual image/text retrieval.

`kAlpha`	Text	Hybrid	Image
0	0.925	0.925	0.925
0.1	0.925	0.95	0.925
0.01	0.925	0.95	0.925
1	0.925	0.925	0.925

Log

01/29/2015 System setup (installed the latest OpenCV and Boost, and wrote the CMakeLists.txt)
02/04/2015 Commit ImageRetrieval v1.0
02/12/2015 Commit TextRetrieval v1.0
02/14/2015 Commit HybridRetrieval v.10
02/16/2015 Evaluation
02/17/2015 Fixed bugs in HybridRetrieval (zero comparisons, including other text files)

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
images		images
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
HybridRetrieval.cc		HybridRetrieval.cc
HybridRetrieval.h		HybridRetrieval.h
ImageRetrieval.cc		ImageRetrieval.cc
ImageRetrieval.h		ImageRetrieval.h
README.md		README.md
Retrieval.cc		Retrieval.cc
Retrieval.h		Retrieval.h
TextRetrieval.cc		TextRetrieval.cc
TextRetrieval.h		TextRetrieval.h
demo.cc		demo.cc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CBIR

How to run the program?

Prerequisites

Compile and run in command line

Overview of the program

The image feature

The text feature

The hybrid feature

Th image feature

The textual feature

The hybrid feature

Log

About

Releases

Packages

Languages

chunweiliu/cbir

Folders and files

Latest commit

History

Repository files navigation

CBIR

How to run the program?

Prerequisites

Compile and run in command line

Overview of the program

The image feature

The text feature

The hybrid feature

Th image feature

The textual feature

The hybrid feature

Log

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages