Skip to content

Commit

Permalink
Expand discussion
Browse files Browse the repository at this point in the history
  • Loading branch information
weltenwort committed Oct 10, 2012
1 parent e0ecf62 commit f5fc6e5
Show file tree
Hide file tree
Showing 4 changed files with 33 additions and 11 deletions.
2 changes: 1 addition & 1 deletion thesis/chapters/chapter04/benchmarking.tex
Expand Up @@ -18,7 +18,7 @@ \section{Benchmarking Method}\label{sec:results_benchmarking}
It consists of 20.000 hand-drawn sketches obtained via crowd-sourcing, that are It consists of 20.000 hand-drawn sketches obtained via crowd-sourcing, that are
evenly divided into 250 categories. To speed up computations, 50 of those evenly divided into 250 categories. To speed up computations, 50 of those
categories are chosen to derive precision-recall statistics. From each categories are chosen to derive precision-recall statistics. From each
category, an image is randomly chosen as the query and the rest is used as category, an image is randomly selected as the query and the rest is used as
positive results. In this case, both the query images and the database images positive results. In this case, both the query images and the database images
are from the sketch domain, so the effectiveness of the retrieval process are from the sketch domain, so the effectiveness of the retrieval process
without preprocessing biases can be examined. without preprocessing biases can be examined.
Expand Down
38 changes: 29 additions & 9 deletions thesis/chapters/chapter05.tex
Expand Up @@ -4,6 +4,21 @@ \chapter{Discussion}\label{ch:discussion}
that clearly favor one approach over the other. This might be explained by the that clearly favor one approach over the other. This might be explained by the
multitude of influencing factors, that can impact the performance. multitude of influencing factors, that can impact the performance.


Based on the results displayed previously, one can conclude that for
cross-domain retrieval, an approach based on Canny edge detection and local
features performs better than global approaches. Of the similarity measures
used to compare the images' signatures, histogram intersection provides the
most consistent performance.
The advantage over the global descriptors, however, is not large and poorly
chosen parameter values can lead to the global descriptors outperforming the
local variants.

When both the query image and the database images are sketches, the situation
seems to be reversed. The precision and recall statistics for the global
LUMA+MEAN pipeline show a slight advantage over the local LUMA+PMEAN variant.
This is probably strongly influenced by the images used in the retrieval
benchmark, as will be discussed below.

\section{Structural Choices} \section{Structural Choices}


The intention underlying this thesis was to perform an evaluation of the The intention underlying this thesis was to perform an evaluation of the
Expand All @@ -21,20 +36,25 @@ \section{Structural Choices}
established distance metrics and the TF-IDF weighting scheme, that has been established distance metrics and the TF-IDF weighting scheme, that has been
successfully applied to information retrieval for some time. successfully applied to information retrieval for some time.


Based on the results displayed previously, one can conclude that an approach
based on Canny edge detection and local features performs better than global
approaches. The distance however, is not large and poorly chosen parameters can
lead to the global descriptors exceeding the local variants. This is probably
strongly influenced by the images involved in the retrieval process, as will be
discussed below.

\section{Parameter Choices} \section{Parameter Choices}


Since many of the processing steps are based on commonly-used algorithms, Since many of the processing steps are based on commonly-used algorithms,
literature already presented reasonable starting values for the evaluation. As literature already presented reasonable starting values for the evaluation. As
the experiments in section \ref{sec:results_parameters} show, the initial the experiments in section \ref{sec:results_parameters} show, the initial
values already produce competitive results. values already produce competitive results. The best parameter values seem to
strike a balance between losing information due to small resolution and
becoming overly sensitive to noise or unrelated image background. For local
sampling methods, a neighborhood size of $\frac{1}{3}$ of the image dimensions
repeatedly performs best. A value of $\sigma=1.5$ for the Gaussian blur of the
Canny edge detector appears to be suitable to extract the edges that correspond
to a human sketch of the object or scene. The advantage of an angluar resultion
larger than $N_{\theta}=12$ for the curvelet transform is probably limited by
to the poor accuracy of hand-drawn sketches.


\section{Benchmark Dataset Choices} \section{Benchmark Dataset Choices}


TBD Assigning general validity to the results presented above would be unjustified,
because some properties of the benchmark datasets must be taken into account as
possible biases.

normalization in 2nd dataset
4 changes: 3 additions & 1 deletion thesis/chapters/chapter06.tex
Expand Up @@ -2,6 +2,8 @@ \chapter{Conclusion}\label{ch:conclusion}


Conclusion goes here\dots Conclusion goes here\dots


cross-domain distribution is effect of semantic gap

\section{Future Work} \section{Future Work}


TBD why are some categories so difficult?
Binary file modified thesis/thesis.pdf
Binary file not shown.

0 comments on commit f5fc6e5

Please sign in to comment.