Executable binaries of Joint Recovery of Dense Correspondence and Cosegmentation in Two Images (CVPR 2016)
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
Readme.txt

Readme.txt

This application provides executable binaries for a demonstration of the following paper.
If you use our algorithm, please cite our CVPR 2016 paper.
We do not plan to release our original source code of the algorithm due to the lisence issue.
Our dataset and results are available at http://taniai.space/projects/cvpr16_dccs/
We also provide an evaluation and visualization kit at https://github.com/t-taniai/TSS_CVPR2016_EvaluationKit

@InProceedings{Taniai2016,
    author = {Tatsunori Taniai and Sudipta N. Sinha and Yoichi Sato},
    title = {{Joint Recovery of Dense Correspondence and Cosegmentation in Two Images}},
    booktitle = {IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
    month = {June},
    year = {2016}
}


--------
Download:
--------
You can download the binaries at https://github.com/t-taniai/TSS_CVPR2016_Demo/releases


-----
Usage:
-----
Run the demo.bat file.
It first runs feature extraction and then runs the main algorithm.
There are following input and output data.

/data (input)
    - image1.png : One of input images.
    - image2.png : One of input images.
    - flow1.flo : Ground truth floating-point 2D flow map (from image1 to image2). Optional.
    - flow2.flo : Ground truth floating-point 2D flow map (from image2 to image1). Optional.
    - mask1.png : Ground truth foreground mask for image1. Optional.
    - mask2.png : Ground truth foreground mask for image2. Optional.

/feature (output)
    - Output files of feature extraction.

/out (output)
    - flow1.flo : Estimated flow map (from image1 to image2).
    - flow2.flo : Estimated flow map (from image2 to image1).
    - mask1.png : Estimated foreground mask for image1.
    - mask2.png : Estimated foreground mask for image2.
    - options.txt : List of option settings. (Some of them are explained below).
    - summary_f1.png : Warped image results at each layer. (Flip between f1 and f2 to verify alignment).
    - summary_f2.png : Original images
    - summary_f3.png : Warped image results with segmentation at each layer.
    - summary1.png : Results of image1. See explanations below.
    - summary2.png : Results of image2. See explanations below.

When -doPerPixelRefinement is on (1), there will be four more files.
Files (flow and mask) with _01 _00 are results before and after the final refinement, respectively.


---------------
Visualization :
---------------
Summary images (summary1 and 2) show results of finest to coarsest layers from left to right columns.
1st row : Superpixel hierarchy.
2nd row : Estimated flow maps.
3rd row : Ground truth flow maps (if provided in the data directory).
4th row : End-point flow error maps (if ground truth is provided in the data directory).
5th row : Estimated warped images. 
6th row : Estimated segmentation results.
7th row : Ground truth segmentation (if provided in the data directory).

---------------------
Process of algorithm :
---------------------
[DenseFeatureExtract]
1. Extract dense feature pyramids and save them as binary files.
   This process is skipped if feature files already exist.
   (By preparing your own feature files, the later algorithm will be proecssed with your features).
2. Estimate ratio maps (foreground/background clues) and initial flow maps
   using region featurs computed by bag-of-visual-words codebook of the local features.
[CosegMatching]
3. Load feature files, initial flow maps and ratio maps.
4. Estimate initial segmentation using the ratio maps,
   and flow maps are also initialized by the loaded data.
5. Perform the two-pass optimizaton algorithm.
   If "-doPerPixelRefinement" is on, the final refinement is performed before saving results.

------------------
Option parameters :
------------------
Options of feature extraction (DenseFeatureExtract.exe):
    -maxImageDimension    [int]    : The maximum image dimension (mostly width) is seto to this value.
    -forceImageDimension  [1 or 0] : If 0, images are resized only if they are larger than the size.

Options of the main algorithm (CosegMatching.exe):
    -doPerPixelRefinement [1 or 0] : Use the finest pixel layer or not. This refinement takes time.
    -featureType          [string] : Extension of feature files. You can use your features. See below.
    -matchingMetric       [1 or 2] : Norm of Equation 4 in the paper.
    -matchingEngW         [float]  : lambda_flo in Equation 2.
    -matchingEngOcc       [float]  : lambda_occ in Equation 3.
    -matchingEngTrunc     [float]  : tau_D in Equation 4.

--------------------
Feature file format :
--------------------
The feature files (image1.hogc and image2.hogc in /feature) are structured as follows.
The core algorithm assumes 3 levels of pyramid,
but only the feature vectors of the finest level are used.
When you make your own feature files,
the image size of the finest level should be the same with the settings of DenseFeatureExtract.


[int32] : Width of the original image.
[int32] : Height of the original image.
[int32] : Number of image pyramid levels.
[int32] : Size of feature patches (not refered in the main algorithm).
[int32] : Dimension of feature vectors (dim).
for ( each pyramid level in coase to fine order )
{
    [int32] : Number of feature vectors in this level.
    [int32] : Width of this level (wk).
    [int32] : Height of this level (hk).
    [float] : Scale of this level.
    
    for ( each feature vector in this level )
    {
        [float] : Normalized x coordinate. (int)(x * wk) is x coodinate in this level.
        [float] : Normalized y coordinate. (int)(y * hk) is y coodinate in this level.
        [float] : Sum of absolute gradient of paches (feature reliability). Not refered by default.
        [float array] : Feature vector data (float array[dim]).
    }
}


--------
History:
--------

11/02/2016 v1.0 Released the demonstration binaries.