FLIR fork

===============================================================================

FLIR fork

Tested with Matlab R2017b, Ubuntu 16.04, cuda 8.0, GTX-1080Ti GPU

Matlab notes

You must purchase the parallel processing toolbox.
Matlab mex spews warnings that it uses gcc 4.9, and Ubuntu uses 5.4.0. Ignore these warnings. You will explicitly specify libstc++ using LD_PRELOAD, and you will launch matlab from a command shell where LD_PRELOAD is exported.
Start a command shell. In the shell (or in .bashrc) run the following: export LD_PRELOAD=$LD_PRELOAD:/usr/lib/x86_64-linux-gnu/libstdc++.so.6:/usr/local/lib/libprotobuf.so.13
Launch matlab from the command line (you might need to create a symbolic link) ln -s /usr/local/MATLAB/R2017b/bin/matlab /usr/bin/matlab
Matlab has openCV binaries bundled into R2017b. This is awful, because caffe-rfcn builds with the system version of opencv, while matlab mex builds with it's own version (3.1.0). Millions of link errors result when you try to run the matlab mex code. The workaround used here is to first build opencv version 3.1.0, and link caffe-rfcn with it.
Matlab needs a recent libprotobuf (3.3.1 works)
When you run a test or training script a second time, matlab crashes with: [libprotobuf ERROR google/protobuf/descriptor_database.cc:57] File already exists in database: caffe.proto Very annoying. Possible fix: compile protobuf with ./configure --enable-shared --with-pic

Ubuntu Notes

These instructions assume your system already has all required caffe dependencies installed.

OpenCV 3.1.0 build

As mentioned above, we need to build OpenCV 3.1.0 in order to match the matlab binaries.

Two modules don't compile, (cudalegacy and stitching) so disable them
Don't install in /usr/local

   mkdir opencv_3_1_0
   cd opencv_3_1_0
   git clone https://github.com/opencv/opencv.git
   git checkout tags/3.1.0
   rm opencv/modules/cudalegacy/CMakeLists.txt
   rm opencv/modules/stitching/CMakeLists.txt
   mkdir install
   mkdir build
   cd build
   cmake -DCMAKE_INSTALL_PREFIX:PATH=$(HOME)/git/opencv_3_1_0/install ../opencv
   make -j8
   make install

Building caffe-rfcn:

To download the code, and link the two repositories: run clone.sh
The repository assumes really old versions of cudnn, so build without cudnn
Using Makefile.config and Makefile to build, instead of cmake, because cmake doesn't know how to compile caffe mex file ( mex include and link directories and flags are missing from CMakeList.txt )
Makefile.config needs to be updated to know where you built opencv (headers and libs)

git clone 
cd ~/git/caffe-rfcn 
make -j8
make pycaffe
make matcaffe

Note that running make mattest will fail. It should at least load a prototext file, without getting link errors before failing (and crashing matlab).

Getting data

You need ILSVRC2015 data, both DET and VID. These are huge. 48 Gbytes and 86 Gbytes respectively. It takes many days to get them from imagenet, so there are copies on the FLIR S3 bucket. flir-data/imagenet-det flir-data/imagenet-vid

From detect-track root directory:

tar -xvf /mnt/data/flir-data/imagenet-vid/ILSVRC2015_VID.tar.gz
tar -xvf /mnt/data/flir-data/imagenet-vid/ILSVRC2015_DET.tar.gz
ln -s /mnt/data/flir-data/imagenet-vid/ILSVRC2015/Annotations data/ILSVRC/Annotations
ln -s /mnt/data/flir-data/imagenet-vid/ILSVRC2015/Data data/ILSVRC/Data
ln -s  mnt/data/flir-data/imagenet-vid/ILSVRC2015/ImageSets data/ILSVRC/ImageSets

Getting propososals and pretrained models

You can get these from the authors website or the FLIR S3 bucket: flir-data/detect-track

From detect-track home directory:

unzip /mnt/data/flir-data/detect-track/proposals/RPN_proposals_DET.zip data/ILSVRC/Data/DET/RPN_proposals
unzip /mnt/data/flir-data/detect-track/proposals/RPN_proposals_VID_train.zip data/ILSVRC/Data/VID_train/RPN_proposals
unzip /mnt/data/flir-data/detect-track/proposals/RPN_proposals_VID_val.zip data/ILSVRC/Data/VID_val/RPN_proposals

Pretrained models are downloaded automatically if they don't exist already in the models directory check_dl_model(model.net_file);

Setting up detect-track

Start matlab. Don't forget LD_PRELOAD...
Compile mex code by running rfcn_build.m.
Set matlab paths by running startup.m.

Once you get this far, proceed to Training, Testing below.

===============================================================================

Detect to Track and Track to Detect (The Original Readme)

This repository contains the code for our ICCV 2017 paper:

Christoph Feichtenhofer, Axel Pinz, Andrew Zisserman
"Detect to Track and Track to Detect"
in Proc. ICCV 2017

This repository also contains results for a ResNeXt-101 backbone network that performs slightly better (81.6% mAP on ImageNet VID val) than the ResNet-101 backbone (80.0% mAP) used in the conference version of the paper
This code builds on the original Matlab version of R-FCN
We are preparing a Python version of D&T that will support end-to-end training and inference of the RPN, Detector & Tracker.

If you find the code useful for your research, please cite our paper:

    @inproceedings{feichtenhofer2017detect,
      title={Detect to Track and Track to Detect},
      author={Feichtenhofer, Christoph and Pinz, Axel and Zisserman, Andrew},
      booktitle={International Conference on Computer Vision (ICCV)},
      year={2017}
    }

Requirements

The code was tested on Ubuntu 14.04, 16.04 and Windows 10 using NVIDIA Titan X or Z GPUs.

If you have questions regarding the implementation please contact:

Christoph Feichtenhofer <feichtenhofer AT tugraz.at>

================================================================================

Setup

Download the code git clone --recursive https://github.com/feichtenhofer/detect-track

This will also download a modified version of the Caffe deep learning framework. In case of any issues, please follow the installation instructions in the corresponding README as well as on the Caffe website.

Compile the code by running rfcn_build.m.
Edit the file get_root_path.m to adjust the models and data paths.
- Download the ImageNet VID dataset from http://image-net.org/download-images
- Download pretrained model files and the RPN proposals, linked below and unpack them into your models/data directory.
- In case the models are not present, the function check_dl_model will attempt to download the model to the respective directories
- In case the RPN files are not present, the function download_proposals will attempt to download & extract the proposal files to the respective directories

Training

You can train your own models on ImageNet VID as follows
- script_Detect_ILSVRC_vid_ResNet_OHEM_rpn(); to train the image-based Detection network.
- script_DetectTrack_ILSVRC_vid_ResNet_OHEM_rpn(); to train the video-based Detection & Tacking network.

Testing

The scripts above have subroutines that test the learned models after training. You can also test our trained, final models available for download below. We provide three testing functions that work with a different numbers of frames at a time (i.e. processed by one GPU during the forward pass)
1. rfcn_test(); to test the image-based Detection network.
2. rfcn_test_vid(); to test the video-based Detection & Tacking network with 2 frames at a time.
3. rfcn_test_vid_multiframe(); to test the video-based Detection & Tacking network with 3 frames at a time.
Moreover, we provide multiple testing network definitions that can be used for interesting experiments, for examüple
- test_track.prototxt is the most simple form of D&T testing
- test_track_reg.prototxt is a D&T version that additionally regresses the tracking boxes before performing the ROI tracking. Therefore, this procedure produces tracks that tightly encompass the underlying objects, whereas the above function tracks the proposal region (and therefore also the background area).
- test_track_regcls.prototxt is a D&T version that additionally classifies the tracked region and computes the detection confidence as the mean of the detection score from the current frame, as well as the detection score of the tracked region in the next frame. Therefore, this method produces better results, especially if the temporal distance between the frames becomes larger and more complementary information can be integrated from the tracked region

Results on ImageNet VID

The networks are trained as decribed in the paper; i.e. on an intersection of the ImageNet object detection from video (VID) dataset which contains 30 classes in 3862 training videos and and the ImageNet object detection (DET) dataset (only using the data from the 30 VID classes). Validation results on the 555 videos of ImageNet VID validation are shown below.

_Method	_{Backbone net}	_{test method}	_{mAP (in %)}
_Detect	_ResNet-50	_{test.prototxt}	72.1
_{Detect & Track}	_ResNet-50	_{test_track.prototxt}	76.5
_{Detect & Track}	_ResNet-50	_{test_track_regcls.prototxt}	76.7
_Detect	_ResNet-101	_{test.prototxt}	74.1
_{Detect & Track}	_ResNet-101	_{test_track.prototxt}	79.8
_{Detect & Track}	_ResNet-101	_{test_track_regcls.prototxt}	80.0
_Detect	_ResNeXt-101	_{test.prototxt}	75.9
_{Detect & Track}	_ResNeXt-101	_{test_track.prototxt}	81.4
_{Detect & Track}	_ResNeXt-101	_{test_track_regcls.prototxt}	81.6

Trained models

Download our backbone and final networks trained on ImageNet here:
- ImageNet CLS models: ResNet-50 [OneDrive] / ResNet-101 [OneDrive] / ResNeXt-101 [OneDrive]
- Detect models: ResNet-50 [OneDrive] / ResNet-101 [OneDrive] / ResNeXt-101 [OneDrive]
- Detect & Track models: ResNet-50 [OneDrive] / ResNet-101 [OneDrive] / ResNeXt-101 [OneDrive]

Data

Our models were trained using region proposals extracted using a Region Proposal Network that is trained on the same data as D&T. We use the RPN from craftGBD and provide the extracted proposals for training and testing on ImageNet VID and the DET subsets below.

Pre-computed object proposals for

ImageNet DET: [FTP server] [OneDrive]
ImageNet VID_train: [FTP server] [OneDrive part1] [OneDrive part2]
ImageNet VID_val: [FTP server] [OneDrive]

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
experiments		experiments
functions		functions
imdb		imdb
models		models
utils		utils
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
bbox_moments.mat		bbox_moments.mat
download_proposals.m		download_proposals.m
get_root_path.m		get_root_path.m
rfcn_build.m		rfcn_build.m
startup.m		startup.m

License

FLIR/Detect-Track

Folders and files

Latest commit

History

Repository files navigation

FLIR fork

Matlab notes

Ubuntu Notes

OpenCV 3.1.0 build

Building caffe-rfcn:

Getting data

Getting propososals and pretrained models

Setting up detect-track

Once you get this far, proceed to Training, Testing below.

Detect to Track and Track to Detect (The Original Readme)

Requirements

Setup

Training

Testing

Results on ImageNet VID

Trained models

Data

About

Resources

License

Stars

Watchers

Forks

Languages