Add VGG16 #265

yuyu2172 · 2017-06-12T12:51:05Z

Merge after #271.

This PR adds a VGG16Layers, which will have APIs consistent with rest of the links in ChainerCV.

RGB images will work for pretraiend model. (TODO: A script to convert a caffe weight to npz file will be added).
Separate caffe loader from the model code. This is good for making the model code shorter.
predict returns an iterable of numpy.array.
Have an interface to select weight initializers at __init__ (users can manually select to stop using random initializer)
__call__ does not have layers option. Also, the return value is not a dictionary but a chainer.Variable.
__init__ takes feature option, which selects the feature that is going to be returned by __call__.
depending on feature option, layers that are unnecessary will automatically be deleted.

These changes are made for the following concrete scenarios in mind.

The chain will be used as a feature extractor for networks used in tasks other than classification. For example, this chain can be used as a feature extractor for FasterRCNNVGG16 with a proper initialization.
Usage together with other ChainerCV functions, which assume RGB image order.
I am assuming that the chain is used to extract only one type of feature. Also, I am assuming that the kind of extracted feature is fixed after initializing the chain. (EDIT: This assumption is no longer true in our final design)

t-abe · 2017-06-13T07:08:06Z

chainercv/links/model/classification/vgg.py

+            self.fc7 = L.Linear(4096, 4096, **kwargs)
+            self.fc8 = L.Linear(4096, 1000, **kwargs)
+
+        self.functions = collections.OrderedDict([


I do not recommend this style of network definition. It might cause problems when an instance of this class is copied.
(See chainer/chainer#2810 .)

I didn't know that! Thank you!

yuyu2172 · 2017-06-14T10:53:45Z

I am trying to reproduce the evaluation score reported here.
The website reports that VGG16 scores error 28.5 % for Top-1 Error with single center crop.

Currently, my implementation scored 32%, and I am going to investigate the reason why it scored below the reported score.

EDIT:
I forgot to set chainer.config.train = False. After fixing that, the score is 28.97%

EDIT2:
It seems that the reported performance is calculated on weights trained on Matconvnet. The caffeweight maybe different from the Matconvnet weight.

NOTE:
From the paper, following their evaluation technique, the model is expected to score 27% for Top-1 Error and 8.8% for Top-5 Error (Table 3, Row D, S=Q=256).

NOTE:
With ten-crop evaluation, the model scored 27.06% Top-1 Error, which is reasonably close to 27% reported in the paper.

yuyu2172 · 2017-08-20T00:24:22Z

Merged master

Hakuyume · 2017-08-20T02:32:21Z

chainercv/links/model/faster_rcnn/faster_rcnn_vgg.py

@@ -74,7 +73,8 @@ class FasterRCNNVGG16(FasterRCNN):
        'voc07': {
            'n_fg_class': 20,
            'url': 'https://github.com/yuyu2172/share-weights/releases/'
-            'download/0.0.3/faster_rcnn_vgg16_voc07_2017_06_06.npz'
+            'download/0.0.4/'
+            'faster_rcnn_vgg16_voc07_trained_2017_08_06_trial_4.npz'


Why do you use trial_4? Is this the best model?

It is the model that is converted from faster_rcnn_vgg16_voc07_2017_06_06.npz.
It performs the same with the previously distributed model.

Perhaps, faster_rcnn_vgg16_voc07_2017_08_06.npz is better. User will think "what is trial_4?" like me.

OK. Thanks for you feedback.

Hakuyume · 2017-08-20T02:34:38Z