Skip to content
Demo of Fast Multi-frame Stereo Scene Flow with Motion Segmentation (CVPR 2017)
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
readme.txt

readme.txt

This application provides executable binaries for demonstrating our stereo scene flow method in the following paper.
If you use our algorithm, please cite our CVPR 2017 paper.
We do not plan to release our original source code of the algorithm due to the lisence issue.

@InProceedings{Taniai2017,
  author    = {Tatsunori Taniai and
               Sudipta N. Sinha and
               Yoichi Sato},
  title     = {{Fast Multi-frame Stereo Scene Flow with Motion Segmentation}},
  booktitle = {IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  pages     = {6891--6900},
  year      = {2017}
}

Note that the software will produce slighly different results than those reported in the paper
due to a different feature tracking method used in the algorithm.
When comparing results of our method on KITTI and Sintel dataset, please use the original results
available from our project site (see Data section below).


---------
Download :
---------
You can download the binaries at https://github.com/t-taniai/FSF_CVPR2017_Demo/releases
Note that this software requires a CPU with SSE intrinsics.


---------
Usage    :
---------
Run the demo.bat file. The algorithm runs for three example sequences:
1) Sequence "alley_1" from Sintel training (in dataset/Sintel/training/...)
2) Sequence "000000" from KITTI training (in dataset/kitti_scene_flow_multiview/training/...)
3) A custom scequence without ground truth data (in dataset/Custom/...)

In "results" directory, you will get for each scene following data.
/data ---------- /disp_0/frame_****.png  : Encoded disparity map of first frame.
      ---------- /disp_1/frame_****.png  : Encoded disparity map of second frame.
      ---------- /flow/frame_****.png    : Encoded flow map (see below for decoding).
      ---------- /label/frame_****.png   : Binary motion segmentation

/debug --------- Visualization of results (use -vizSaveFlags to choose which steps we visualize).
                 This is disabled if -debug 0 is given.

log.txt -------- Log of comandline messages.

timestamp.txt -- Running times of individual steps of the algorithm in each frame.
                 Times for logging and saving data are excluded from time stamps.

---------
Options  :
---------
Some important commandline options are explained below.

General parameters
   -datasetType    [string] {sintel,kitti,custom}
   -datasetDir     [string] Path to the root directory of a dataset
   -targetName     [string] 000000-000199 (kitti) or {alley_1,alley_2,...} (sintel)
   -saveAsKittiFormat [int] 1: Save as KITTI format, 0: Save as common format
   -debug             [int] 1: Enable debug mode (output visualization), 0: Disable debug mode
   -vizSaveFlags      [int] Flags to output results of individual stages.
   -vizOutputSep      [int] 1: Output visualization as separate image files, 0: Output concatenated images.
See also demo.bat for examples.

Stereo parameters
   -ndisp             [int] Defines disparity range as [0, ndisp-1] at full scale.
                            If ndisp < 0 for Sintel, try to get from Sintel/training/info/.../disp.txt
   -sgmScale        [float] Image resolution rate at which SGM stereo is performed.

Optical flow parameters
    -sgmflowScale   [float] Image resolution rate at which SMG flow is performed.

Segmentation parameters
   -occThresh (tau_ncc)    [float] Threshold in (0, 1) for classifying TNCC values to FG and BG.
   -segColorW (lambda_col) [float] Weight of color likelihood terms.


---------
Format   :
---------
For KITTI scenes (when "-datasetType kitti" or "-saveAsKittiFormat 1"),
the data format is the same with KITTI's submission format, that is,

Disparity image: 16 bit 1-channel png where intensities I = 256*disparity and I = 0 mean "invalid" disparity.
Flow image     : 16 bit 3-channel png where intensities of RGB channels represent following values.
                 R = 64 * u - 2^15
                 G = 64 * v - 2^15
                 B = 1 if a pixel has a valid flow and 0 if invalid

For other scenes (when "-saveAsKittiFormat 1"), the scaling factor of disparity is changed from 256 to 64.
Example MATLAB scripts are included in the results data mentioned below.


--------
Data    :
--------
Please visit our project site (https://taniai.space/projects/cvpr17_fsf/) where you can find
1) Results of our method on Sintel and KITTI dataset, as well as results of PRSM and OSF on Sintel.
   Visualization videos of Sintel results by our method, PRSM, and OSF are also included in the archive.
   The archive file also contains MATLAB scripts of our data format and notes on Sintel evaluations.
2) Ground truth motion segmentation masks of Sintel dataset (our creation).

Note that "cave_2" and "sleeping_1" in Sintel are excluded from evaluations
because camera parameters K are not constant (due to zooming) in those scenes.


--------
History :
--------
11/27/2018  v1.0 Released the demonstration software.
You can’t perform that action at this time.