Ego-Object Discovery

Object Discovery Algorithm on Egocentric Images. This semi-supervised algorithm is inspired by the work in [1], and it offers the following contributions: 1) Use of a powerful CNN-based feature extraction. 2) Refill strategy by the use of a previously acquired knowledge base with image labels (bag of refill) for creating purer and more compact clusters. 3) SVM 'Object' VS 'No Object' Filtering strategy for trimming most of the FP instances produced by any object detector.

If you use this code, please cite the following papers:

Bolaños, M. & Radeva, P. (2015). “Ego-object discovery”. Pre-print: http://arxiv.org/abs/1504.01639

Bolaños, M., Garolera, M., & Radeva, P. (2015). “Object Discovery using CNN Features in Egocentric Videos”. 
In 7th Iberian Conference on Pattern Recognition and Image Analysis (IbPRIA) (in press).

The dataset published in the most recent work can be found here:

http://www.ub.edu/cvub/member/marc-bolanos/

Usage:

For a single object discovery execution: 
	- Modify the parameters in loadParameters.m
	- Run main.m

For a battery of tests: 
	- Comment the first line in main.m
	- Change the parameters in tests_battery.m and run it.

If you want to use the Multiscale Combinatorial Grouping (MCG) Objectness [4]:
	- Download from the authors page: http://www.eecs.berkeley.edu/Research/Projects/CS/vision/grouping/mcg/
	- Configure installation path in loadParameters.m

If you want to use the CNN features [7] extraction:
	- Install Caffe: http://caffe.berkeleyvision.org/
	- Configure installation path in loadParameters.m
	- Download the "bvlc_reference_caffenet.caffemodel" network model.
	- Copy network model into Object_Discovery/Caffe Src/bvlc_reference_caffenet

Explanation and additional features:

This function is inspired by the pipeline proposed by 
   	K. Grauman in [1] with the following modifications:
   		- Added Ferrari's Objectness [2], BING Objectness [3],
           		Multiscale Combinatorial Grouping (MCG) [4] and
           		Selective Search [6].
       	- Added iterative Scene-Object discovery by turns.
       	- Added Scene Awareness to the set of features.
       	- Added Scene Score to the set of easiness scores.
       	- Added Dimensionality Reduction based on LSH.
       	- Added Locality Sensitive Hashing, K-Means, LSK+K-Means and 
           		Ward instead of Complete-Link clustering for the grouping 
           		of the easiest instances.
       	- Added clustering by levels (using a different set of 
           		features at each of them.
       	- Adaptable for huge datasets (e.g. SenseCam or Narrative 
           		lifelogging data).
       	- Added CNN features [5,7] extraction for object candidates 
           		and scenes.
       	- Added PCA dimensionality reduction on easy objects.
       	- Added initial ObjVsNoObj RBF-SVM classifier for discarding
		the majority of No Objects.

References:

   [1] Lee, Yong Jae, and Kristen Grauman. "Learning the easy things 
       first: Self-paced visual category discovery." Computer Vision 
       and Pattern Recognition (CVPR), 2011 IEEE Conference on. IEEE, 
       2011.
   [2] Alexe, Bogdan, Thomas Deselaers, and Vittorio Ferrari. "What 
       is an object?." Computer Vision and Pattern Recognition (CVPR), 
       2010 IEEE Conference on. IEEE, 2010.
   [3] Cheng, Ming-Ming, et al. "BING: Binarized normed gradients for 
       objectness estimation at 300fps." IEEE CVPR. 2014.
   [4] Arbeláez, Pablo, et al. "Multiscale Combinatorial Grouping." 
       CVPR, 2014.
   [5] Krizhevsky, Alex, Ilya Sutskever, and Geoffrey E. Hinton. 
       "Imagenet classification with deep convolutional neural 
       networks." Advances in neural information processing systems. 
       2012.
   [6] Uijlings, Jasper RR, et al. "Selective search for object 
       recognition." International journal of computer vision 104.2 
       (2013): 154-171.
   [7] Jia, Yangqing, et al. "Caffe: Convolutional architecture for 
       fast feature embedding." Proceedings of the ACM International 
       Conference on Multimedia. ACM, 2014.

Name		Name	Last commit message	Last commit date
Latest commit History 162 Commits
Caffe Src		Caffe Src
Concepts_Discovery		Concepts_Discovery
DimensionalityReduction		DimensionalityReduction
K_Means		K_Means
Locality Sensitive Hashing		Locality Sensitive Hashing
MeanShift		MeanShift
Objectness BING		Objectness BING
Objectness Ferrari		Objectness Ferrari
Objectness SelectiveSearch		Objectness SelectiveSearch
Objects Recognition		Objects Recognition
OpticalFlow		OpticalFlow
PHOG		PHOG
SIFT		SIFT
SIFTflow		SIFTflow
Superpixels		Superpixels
Turbo-Pixels		Turbo-Pixels
Utils		Utils
libsvm-3.18		libsvm-3.18
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
loadParameters.m		loadParameters.m
loadParameters_ConceptDiscovery.m		loadParameters_ConceptDiscovery.m
loadParameters_MSRC_tests.m		loadParameters_MSRC_tests.m
loadParameters_PASCAL_tests.m		loadParameters_PASCAL_tests.m
main.m		main.m
tests_battery.m		tests_battery.m
tests_battery1.m		tests_battery1.m
tests_battery2.m		tests_battery2.m
tests_battery3.m		tests_battery3.m
tests_battery4.m		tests_battery4.m
tests_battery5.m		tests_battery5.m
tests_battery_MSRC.m		tests_battery_MSRC.m
tests_battery_PASCAL.m		tests_battery_PASCAL.m

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Ego-Object Discovery

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Ego-Object Discovery

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages