CCCP pooling layer #498

mavenlin · 2014-06-13T05:47:51Z

This pull request is to add a cascadable cross channel parametric(cccp) pooling layer.
The output feature maps of this layer is an parametric recombination of the input feature map.
It is used with relulayer on top of a convolutional layer. Each patch from the convolution input is mapped to its feature vector in the output feature map through a nonlinear function (multilayer perceptron).
The function is equivalent to 1x1 convolution. However, if convlayer is used for this purpose, there is unnecessary im2col operation.

The cccp layer is used to implement the idea in this paper: Network In Network, which achieved best performance on cifar-10 and cifar-100 datasets. As is shown here.

Using the long name cascadable cross channel parametric pooling is because I want the abbrev to be the same with Союз Советских Социалистических Республик, which is just cool.

Please don't merge yet, tests and examples will be added soon.

kloudkl · 2014-06-17T04:45:46Z

@mavenlin, it's cool that you finally open sourced the algorithm. Why didn't you link to this PR in your blog post on CCCP pooling?

I heard from a colleague who has been collaborating with your team that you were not using it in models trained for production environment. Why? Is it too slow compared to the vanilla convolution?

mavenlin · 2014-06-17T07:01:46Z

@kloudkl I've opensourced it for quite a while in my own fork of caffe and cuda-convnet.
CCCP is not so computation intensive compared to vanilla convolution, because you can view it as 1x1 convolution.
I guess what you hear is that it is not used in last year's imagenet competition.
I'll be releasing a model for imagenet that's only 29MB (Without fully connected layer the model can be very compact) but performs slightly better(60% accuracy) than Alexnet (takes about 4~5 days to train on GTX Titan).

mavenlin · 2014-09-13T09:01:00Z

@shelhamer CCCP is ready to be merged
My small imagenet model has been uploaded to gist.
https://gist.github.com/mavenlin/d802a5849de39225bcc6

shelhamer · 2014-09-13T15:55:53Z

Great! Please format your model gist for the model zoo as done in Sergey's
example:

https://gist.github.com/sergeyk/034c6ac3865563b69e60

It should have a readme.md with commit + gist info, a solver prototxt, and
the model prototxt. Include an URL to the model weights in the readme.md
front matter if you choose to redistribute them.

On Saturday, September 13, 2014, Lin Min notifications@github.com wrote:

CCCP is ready to be merged
My small imagenet model has been uploaded to gist.
https://gist.github.com/mavenlin/d802a5849de39225bcc6

—
Reply to this email directly or view it on GitHub
#498 (comment).

Evan Shelhamer

mavenlin · 2014-09-14T01:48:48Z

@shelhamer It seems gist would put the files in alphabetical order. That's why my readme.md file is put after deploy.prototxt. I remove the deploy file and now it works.

shelhamer · 2014-09-20T02:10:52Z

@mavenlin rather than introduce a whole new layer for this special case of convolution I have included an optimization in the Caffe convolution layer #1118.

Once it is merged, please update your NIN definition to use CONV layers instead of CCCP, although of course you can keep the layer names to make their purpose clear. You can include the commit ID in the front matter then too.

Thanks for the inaugural contribution to the model zoo!

mavenlin · 2014-09-20T10:53:33Z

@shelhamer this is reasonable. BTW, I wonder if cudnn can do better in this, if the num dimension is also paralleled.

shelhamer · 2014-09-20T17:23:26Z

@mavenlin the num dimension is parallelized in our cuDNN integration. Once
you swap CONV layers into your model prototxt in place of CCCP layers you
could time the Caffe and cuDNN integrations by setting the engine flag in
the convolution_param.

Please close the PR once the model's updated to signal it's ready for a
try. Thanks.

On Saturday, September 20, 2014, Lin Min notifications@github.com wrote:

@shelhamer https://github.com/shelhamer this is reasonable. BTW, I
wonder if cudnn can do better in this, if the num dimension is also
paralleled.

—
Reply to this email directly or view it on GitHub
#498 (comment).

shelhamer · 2014-09-23T04:24:14Z

@mavenlin note that you can include the deploy.prototxt too if you just make the model zoo link include the anchor for the readme.md. That is, link to the gist file url instead of only the gist.

shelhamer · 2014-10-01T00:32:11Z

@mavenlin please update your model gist to switch the CCCP type layers to CONV for BVLC/caffe compatibility. I know there is interest in using your model. Thanks!

mavenlin · 2014-10-01T01:28:09Z

@shelhamer Sorry for leaving this pending for so long, I've updated the prototxt, I'm currently overseas, I'll update the model once I get back.

mavenlin · 2014-10-01T01:54:04Z

model updated.

shelhamer · 2014-10-01T01:56:46Z

Awesome! Thanks for contributing the Network-in-Network model.

On Tuesday, September 30, 2014, Lin Min notifications@github.com wrote:

model updated.

—
Reply to this email directly or view it on GitHub
#498 (comment).

ducha-aiki · 2014-10-03T10:25:54Z

Actually, it is updated, but looks like not working...
I have tried to finetune it to PASCAL and got an error - on the first cccp layer
1003 13:05:46.623013 31678 caffe.cpp:115] Finetuning from nin_imagenet.caffemodel
...
F1003 13:05:46.656553 31678 net.cpp:713] Check failed: target_blobs[j]->channels() == source_layer.blobs(j).channels() (1 vs. 96)

ronghanghu · 2014-10-03T11:03:26Z

@ducha-aiki It takes some extra hacking to update the model weights. I have done this myself previously, and you may want to try my version https://drive.google.com/folderview?id=0B0IedYUunOQINEFtUi1QNWVhVVU&usp=drive_web

ducha-aiki · 2014-10-03T12:20:45Z

@ronghanghu, thank you very much, I will try it. The funniest thing that I could not train it from the scratch not because of lack of GPU, but because of no free space for imagenet downloading :(

emasa · 2014-12-20T23:47:03Z

Hi @mavenlin, I'm wondering what top-5 accuracy do you get on ImageNet with the NIN model ? With and without tests ? I've not read about it so far . Thanks.

ducha-aiki · 2014-12-23T10:51:36Z

@emasa
Top-1 acc 0.5674
Top-5 acc 0.7953.
Single central crop.
See https://github.com/BVLC/caffe/wiki/Models-accuracy-on-ImageNet-2012-val

shelhamer force-pushed the dev branch 3 times, most recently from 4278286 to c01f07a Compare August 28, 2014 07:00

mavenlin added 3 commits September 13, 2014 17:03

add cccp layer

cb7a953

add test code for cccp layer

18e6d7f

fix errors

7bce450

shelhamer self-assigned this Sep 18, 2014

shelhamer mentioned this pull request Sep 18, 2014

[cancelled] Next #1109

Merged

shelhamer force-pushed the dev branch from 64258b6 to 403b56b Compare September 19, 2014 04:38

mavenlin closed this Oct 1, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CCCP pooling layer #498

CCCP pooling layer #498

mavenlin commented Jun 13, 2014

kloudkl commented Jun 17, 2014

mavenlin commented Jun 17, 2014

mavenlin commented Sep 13, 2014

shelhamer commented Sep 13, 2014

mavenlin commented Sep 14, 2014

shelhamer commented Sep 20, 2014

mavenlin commented Sep 20, 2014

shelhamer commented Sep 20, 2014

shelhamer commented Sep 23, 2014

shelhamer commented Oct 1, 2014

mavenlin commented Oct 1, 2014

mavenlin commented Oct 1, 2014

shelhamer commented Oct 1, 2014

ducha-aiki commented Oct 3, 2014

ronghanghu commented Oct 3, 2014

ducha-aiki commented Oct 3, 2014

emasa commented Dec 20, 2014

ducha-aiki commented Dec 23, 2014

CCCP pooling layer #498

CCCP pooling layer #498

Conversation

mavenlin commented Jun 13, 2014

kloudkl commented Jun 17, 2014

mavenlin commented Jun 17, 2014

mavenlin commented Sep 13, 2014

shelhamer commented Sep 13, 2014

mavenlin commented Sep 14, 2014

shelhamer commented Sep 20, 2014

mavenlin commented Sep 20, 2014

shelhamer commented Sep 20, 2014

shelhamer commented Sep 23, 2014

shelhamer commented Oct 1, 2014

mavenlin commented Oct 1, 2014

mavenlin commented Oct 1, 2014

shelhamer commented Oct 1, 2014

ducha-aiki commented Oct 3, 2014

ronghanghu commented Oct 3, 2014

ducha-aiki commented Oct 3, 2014

emasa commented Dec 20, 2014

ducha-aiki commented Dec 23, 2014