GitHub - shrubb/integral-layer: Integral Image Layer for Deep Neural Networks

Integral Image Layer for Deep Neural Networks

In a classical paper [1] from 2001, Viola and Jones popularized the use of large rectangular image filters in order to obtain features for image recognition. The use of very large filters allowed Viola and Jones to compute features over very large receptive fields without blowing up the computation cost. For the next 10+ years, such features remained the staple of fast computer vision (e.g. [2]). The advent of deep learning made the use of integral-image features far less popular. Currently, state-of-the-art architectures invariably relying on very deep architectures. In these architectures sufficiently large receptive fields are obtained via the use of downsampling with subsequent upsampling [3] or via dilated convolutions [4]. All such tricks however have their downsides and usually necessitate the use of very deep networks.

The goal of this project is to implement an integral image-based filtering as a layer for deep architectures in Torch deep learning package, and to evaluate it for the task of learning very fast object detectors (as an alternative to e.g. [5]) and semantic segmentation systems (as an alternative to e.g. [3,4]). The hope is to obtain much shallower architectures, which at least for simple classes (e.g. roadsigns or upright pedestrians) will approach the performance of much deeper ones.

The project is supervised by Victor Lempitsky at Skoltech, Moscow, Russia.

References

[1] Viola, Paul, and Michael J. Jones. "Robust real-time face detection." International journal of computer vision 57.2 (2004): 137-154.

[2] Dollár, Piotr, Serge Belongie, and Pietro Perona. "The Fastest Pedestrian Detector in the West." BMVC. Vol. 2. No. 3. 2010.

[3] Ronneberger, Olaf, Philipp Fischer, and Thomas Brox. "U-net: Convolutional networks for biomedical image segmentation." International Conference on Medical Image Computing and Computer-Assisted Intervention. Springer International Publishing, 2015.

[4] Yu, Fisher, and Vladlen Koltun. "Multi-scale context aggregation by dilated convolutions." arXiv preprint arXiv:1511.07122 (2015).[5] W. Liu, D. Anguelov, D. Erhan, C. Szegedy, and S. Reed. SSD: Single shot multibox detector. ECCV 2016

[5] W. Liu, D. Anguelov, D. Erhan, C. Szegedy, and S. Reed. SSD: Single shot multibox detector. ECCV 2016

Name		Name	Last commit message	Last commit date
Latest commit History 170 Commits
C		C
Models		Models
ModelsClassification		ModelsClassification
Scripts		Scripts
Tests		Tests
extern		extern
.gitignore		.gitignore
.gitmodules		.gitmodules
Classification-template.ipynb		Classification-template.ipynb
Flip.lua		Flip.lua
IntegralSmartNorm.lua		IntegralSmartNorm.lua
IntegralToBorder.lua		IntegralToBorder.lua
IntegralVarScale.lua		IntegralVarScale.lua
IntegralZeroPadding.lua		IntegralZeroPadding.lua
Queue.lua		Queue.lua
README.md		README.md
Segmentation-CS-small-5layers-LR150.ipynb		Segmentation-CS-small-5layers-LR150.ipynb
Segmentation-CS.ipynb		Segmentation-CS.ipynb
Segmentation-GPU-CS-template-disparity-hpc3.ipynb		Segmentation-GPU-CS-template-disparity-hpc3.ipynb
Segmentation-GPU-CS-template-gsOK.ipynb		Segmentation-GPU-CS-template-gsOK.ipynb
Segmentation-GPU-CS-template.ipynb		Segmentation-GPU-CS-template.ipynb
Segmentation-GPU-NYUv2-template.ipynb		Segmentation-GPU-NYUv2-template.ipynb
Segmentation-GPU-SB-template.ipynb		Segmentation-GPU-SB-template.ipynb
Segmentation-kitti.ipynb		Segmentation-kitti.ipynb
Time.ipynb		Time.ipynb
WindowDebugger.lua		WindowDebugger.lua
cifar10.lua		cifar10.lua
cityscapes.lua		cityscapes.lua
coco-stuff-pascal.lua		coco-stuff-pascal.lua
coco-stuff2017.lua		coco-stuff2017.lua
correct-given-disp.py		correct-given-disp.py
count-operations.lua		count-operations.lua
get-new-disp.py		get-new-disp.py
nyuv2.lua		nyuv2.lua
stanfordbkgd.lua		stanfordbkgd.lua
sunrgbd.lua		sunrgbd.lua
voc2012-sbd.lua		voc2012-sbd.lua

shrubb/integral-layer

Folders and files

Latest commit

History

Repository files navigation

Integral Image Layer for Deep Neural Networks

References

About

Resources

Stars

Watchers

Forks

Languages