GitHub - rbgirshick/DeepPyramid: Deep feature pyramids for various computer vision algorithms (DPMs, pyramid R-CNN, etc.)

DeepPyramid

DeepPyramid is a simple toolkit for building feature pyramids from deep convolutional networks. The DeepPyramid data structure is nearly identical to the HOG feature pyramid created by the featpyramid.m function in the voc-dpm code.

References

This code was used in our tech report about the relationship between deformable part models and convolutional networks.

@article{girshick14dpdpm,
    author    = {Ross Girshick and Forrest Iandola and Trevor Darrell and Jitendra Malik},
    title     = {Deformable Part Models are Convolutional Neural Networks},
    journal   = {CoRR},
    year      = {2014},
    volume    = {abs/1409.5403},
    url       = {http://arxiv.org/abs/1409.5403},
    year      = {2014}
}

Installation

Prerequisites
MATLAB (tested with 2014a on 64-bit Linux)
Caffe's prerequisites
Install Caffe (this is the most complicated part)
Follow the Caffe installation instructions
Let's call the place where you installed caffe $CAFFE_ROOT (you can run export CAFFE_ROOT=$(pwd))
Important: Make sure to compile the Caffe MATLAB wrapper, which is not built by default: make matcaffe
Important: Make sure to run cd $CAFFE_ROOT/data/ilsvrc12 && ./get_ilsvrc_aux.sh to download the ImageNet image mean
DeepPyramid has been tested with master and dev at the time of this writing
Get DeepPyramid
git clone https://github.com/rbgirshick/DeepPyramid.git
If you haven't installed R-CNN, you'll need to download its models
Copy R-CNN's non-finetuned ImageNet network <rcnnpath>/data/caffe_nets/ilsvrc_2012_train_iter_310k to <deeppyramidpath>/data/caffe_nets/ilsvrc_2012_train_iter_310k (or just create a symlink).

Usage

Run matlab from inside the DeepPyramid code directory
Add the matcaffe mex function to your path (addpath /path/to/caffe/matlab/caffe)
Run the demo demo_deep_pyramid

Uses

DeepPyramid can be used for implementing DPMs on deep convolutional network features, rather than HOG features. It can also be used whenever you need a dense multiscale pyramid of image features.

Caveats

The implementation is designed to be simple and as a result is very inefficient. There are a variety of ways to speed it up, and they will be done in the future. For now, it takes about 0.5 to 0.6 seconds to compute a feature pyramid on an NVIDIA Titan GPU, which is acceptable.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
data/caffe_nets		data/caffe_nets
model-defs		model-defs
000084.jpg		000084.jpg
LICENSE		LICENSE
README.md		README.md
cached_pyra.mat		cached_pyra.mat
deep_pyramid.m		deep_pyramid.m
deep_pyramid_add_padding.m		deep_pyramid_add_padding.m
deep_pyramid_cache_wrapper.m		deep_pyramid_cache_wrapper.m
demo_deep_pyramid.m		demo_deep_pyramid.m
green_colormap.mat		green_colormap.mat
im_to_pyra_coords.m		im_to_pyra_coords.m
init_cnn_model.m		init_cnn_model.m
pyra_to_im_coords.m		pyra_to_im_coords.m

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DeepPyramid

References

Installation

Usage

Uses

Caveats

About

Releases

Packages

Languages

License

rbgirshick/DeepPyramid

Folders and files

Latest commit

History

Repository files navigation

DeepPyramid

References

Installation

Usage

Uses

Caveats

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages