GitHub - Vipermdl/ideranknet: Image diffculty estimate multi-task rankNet

IDE Ranknet

This is the repo of paper [IDE RankNet: Estimating the difficulty of visual search in an image](/paper/IDE RankNet.pdf)

Abstract

Estimating the difficulty of visual search in images is an interesting work that can be applied to weakly supervised object localization and semi-supervised object classification, and has potential applications in object detection. In this paper, we proposed a simple loss function based on learning to rank and applied it to an end-to-end multitask neural network for estimating difficulty scores of images. Our model shows better results for predicting the ground-truth visual search difficulty scores produced by human annotators in PASCAL VOC2012.

Requirements

python 3.0+
pytorch 0.3
visdom
opencv2

Installation

git clone https://github.com/Vipermdl/ideranknet

Details

Dataset: The images from PASCAL VOC2012 in the benchmark dataset are used as the difficulty dimensioning source. The dataset contains a total of 11540 training and test images, and the dataset contains 20 objects (including airplanes, boats, cats, dogs, etc.) that have been labeled with categories, outlines and borders. This task is on a crowd-sourcing platform named CrowdFlower, after 736 trusted annotator observe the information in the image, the time required to answer the question is used as a measure of the image difficulty and convert it into image difficulty score. They designed a series of related processing methods, such as clearing outliers to ensure data reality. Based on the usual visual search tasks, they propose an explanation of the difficulty of the image close to the human visual level. Since the background of each image in the PASCAL VOC2012 dataset is different, the density of the objects is different, the number, size and appearance of the objects are different, we can apply this dataset to our visual search task. In this paper, the dataset is divided into training, validation, and test sets in a 2:1:1 ratio using the same way as the previous work. We trained our model using 5,770 images in the training set, and use others to test and validate the MSE and Kendall's τ correlation coefficients.
train and test: python train.py
Metrics: MSE and Kendall's corelation efficients
Results: As follow investigation of IDE RankNet with different settings on test dataset

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
codes		codes
paper		paper
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

codes

codes

paper

paper

README.md

README.md

Repository files navigation

IDE Ranknet

About

Releases

Packages

Languages

Vipermdl/ideranknet

Folders and files

Latest commit

History

Repository files navigation

IDE Ranknet

About

Resources

Stars

Watchers

Forks

Languages