Pvanet:Deep but Lightweight Neural Neural Networks for Real-time Object Detection #7786

qingzhouzhen · 2017-09-07T03:46:22Z

article adress : Pvanet:Deep but Lightweight Neural Neural Networks for Real-time Object Detection
Result of classification network:
INFO:root:Epoch[82] Batch [2000] Speed: 586.96 samples/sec accuracy=0.668164 top_k_accuracy_5=0.874805 INFO:root:Epoch[82] Batch [2050] Speed: 586.14 samples/sec accuracy=0.664766 top_k_accuracy_5=0.876250 INFO:root:Epoch[82] Batch [2100] Speed: 589.28 samples/sec accuracy=0.668438 top_k_accuracy_5=0.870938 INFO:root:Epoch[82] Batch [2150] Speed: 587.12 samples/sec accuracy=0.669766 top_k_accuracy_5=0.877266 INFO:root:Epoch[82] Batch [2200] Speed: 590.23 samples/sec accuracy=0.664297 top_k_accuracy_5=0.874922 INFO:root:Epoch[82] Batch [2250] Speed: 584.57 samples/sec accuracy=0.672266 top_k_accuracy_5=0.876836 INFO:root:Epoch[82] Batch [2300] Speed: 590.03 samples/sec accuracy=0.674492 top_k_accuracy_5=0.876172 INFO:root:Epoch[82] Batch [2350] Speed: 588.57 samples/sec accuracy=0.670820 top_k_accuracy_5=0.874453 INFO:root:Epoch[82] Batch [2400] Speed: 587.81 samples/sec accuracy=0.673672 top_k_accuracy_5=0.876094 INFO:root:Epoch[82] Batch [2450] Speed: 591.53 samples/sec accuracy=0.671406 top_k_accuracy_5=0.873828 INFO:root:Epoch[82] Batch [2500] Speed: 582.21 samples/sec accuracy=0.671992 top_k_accuracy_5=0.874805 INFO:root:Epoch[82] Train-accuracy=0.663086 INFO:root:Epoch[82] Train-top_k_accuracy_5=0.849609 INFO:root:Epoch[82] Time cost=2180.302 INFO:root:Saved checkpoint to "pvanet-models/pvanet-0083.params" INFO:root:Epoch[82] Validation-accuracy=0.640804 INFO:root:Epoch[82] Validation-top_k_accuracy_5=0.854931

Result of rpn and faster-rcnn training:
INFO:root:Epoch[9] Batch [9940] Speed: 2.57 samples/sec RPNAcc=0.991533 RPNLogLoss=0.023411 RPNL1Loss=0.322666 RCNNAcc=0.943286 RCNNLogLoss=0.157866RCNNL1Loss=0.834379 INFO:root:Epoch[9] Batch [9960] Speed: 2.66 samples/sec RPNAcc=0.991529 RPNLogLoss=0.023437 RPNL1Loss=0.322645 RCNNAcc=0.943291 RCNNLogLoss=0.157865RCNNL1Loss=0.834185 INFO:root:Epoch[9] Batch [9980] Speed: 2.48 samples/sec RPNAcc=0.991519 RPNLogLoss=0.023454 RPNL1Loss=0.322520 RCNNAcc=0.943320 RCNNLogLoss=0.157764RCNNL1Loss=0.833864 INFO:root:Epoch[9] Batch [10000] Speed: 2.63 samples/sec RPNAcc=0.991529 RPNLogLoss=0.023437 RPNL1Loss=0.322378 RCNNAcc=0.943317 RCNNLogLoss=0.157765 RCNNL1Loss=0.833716 INFO:root:Epoch[9] Batch [10020] Speed: 2.49 samples/sec RPNAcc=0.991519 RPNLogLoss=0.023461 RPNL1Loss=0.322260 RCNNAcc=0.943335 RCNNLogLoss=0.157711 RCNNL1Loss=0.833482 INFO:root:Epoch[9] Train-RPNAcc=0.991520 INFO:root:Epoch[9] Train-RPNLogLoss=0.023459 INFO:root:Epoch[9] Train-RPNL1Loss=0.322241 INFO:root:Epoch[9] Train-RCNNAcc=0.943339 INFO:root:Epoch[9] Train-RCNNLogLoss=0.157700 INFO:root:Epoch[9] Train-RCNNL1Loss=0.833458 INFO:root:Epoch[9] Time cost=3940.801 INFO:root:Saved checkpoint to "model/e2e-0010.params"

Result of the whole Object Detection network:
INFO:root:reading annotations for 4301/4952
INFO:root:reading annotations for 4401/4952
INFO:root:reading annotations for 4501/4952
INFO:root:reading annotations for 4601/4952
INFO:root:reading annotations for 4701/4952
INFO:root:reading annotations for 4801/4952
INFO:root:reading annotations for 4901/4952
INFO:root:saving annotations cache to data/cache/voc_2007_test_annotations.pkl
INFO:root:AP for aeroplane = 0.5641
INFO:root:AP for bicycle = 0.6811
INFO:root:AP for bird = 0.5965
INFO:root:AP for boat = 0.4123
INFO:root:AP for bottle = 0.3641
INFO:root:AP for bus = 0.7021
INFO:root:AP for car = 0.7254
INFO:root:AP for cat = 0.7801
INFO:root:AP for chair = 0.3233
INFO:root:AP for cow = 0.5812
INFO:root:AP for diningtable = 0.5688
INFO:root:AP for dog = 0.7511
INFO:root:AP for horse = 0.7783
INFO:root:AP for motorbike = 0.6953
INFO:root:AP for person = 0.6612
INFO:root:AP for pottedplant = 0.2878
INFO:root:AP for sheep = 0.5344
INFO:root:AP for sofa = 0.5900
INFO:root:AP for train = 0.6550
INFO:root:AP for tvmonitor = 0.6214
INFO:root:Mean AP = 0.5937

Mark:
1 Currently, the classification network result is 64%(70.6% as article), result of object detection is poor than article's, I will improve it in the days ahead
2 There has no 4 contributors as show, actually, it is just me , but I used different computers and proxy, next time I will pay attention to it

piiswrong · 2017-09-07T03:59:46Z

example/image-classification/symbols/pvanet.py

+import mxnet as mx
+def get_symbol(num_classes, **kwargs):
+
+    data = mx.sym.Variable(name='data')


why put the same file at two places?

pvanet is a classification net, it is a little different with it's counterpart as in symbol_pvanet, And secondly, I cannot upload pre-trained model(pvanet.py) for it is too large, thus some one else could pretrain with pvanet.py on ImageNet

qingzhouzhen · 2017-09-07T04:25:44Z

How I do:
1 pretrain pvanet with ImageNet, it is image classification, to specify, cd /path/to/example/image-calssification python train_imagenet.py --data-train=/data/ILSVRC2012_img_train.rec --data-val=/data/ILSVRC2012_img_val.rec --model-prefix=pvanet-models/pvanet --network=pvanet --gpus=0,1,2,3 --disp-batch=50 --batch-size=512 --lr=0.1 --top-k=5 , as refered in article, reshape image to 192*192
2 when "1" step finished, copy the model to /path/to/incubator-mxnet/example/rcnn/model, then bash script/pvanet_voc07.sh 0

…caffe

chinakook · 2017-09-07T07:22:57Z

A nice feature!

piiswrong · 2017-09-07T17:39:32Z

Could you add pvanet to here: https://mxnet.incubator.apache.org/model_zoo/index.html
and explain how to use it in rcnn's readme?

@mli Can we give @qingzhouzhen access to the modelzoo hosting server?

qingzhouzhen · 2017-09-08T06:20:12Z

OK, I'd like to, but I did not find where to push code in this link https://mxnet.incubator.apache.org/model_zoo/index.html @piiswrong

piiswrong · 2017-09-10T07:33:53Z

@winstywang Anyone from your side can review this?

piiswrong · 2017-09-10T17:32:00Z

@qingzhouzhen We decided to postpone merge and pretrained model upload until the result is reasonably close to the original paper.

qingzhouzhen · 2017-09-11T01:02:22Z

Ok, I will work on this to improve the result, if have any suggestion to my problem(mAP is low), please inform me, I am new to deep learning and mxnet @piiswrong

qingzhouzhen · 2017-09-11T02:46:39Z

I think the first part(image classfication net) is OK, currently the accuracy is 64.08%(70.6% as the result of the article), Can I push the first part only? @piiswrong

szha · 2017-09-11T02:58:23Z

6% is a bit far...

huanghanqing and others added 12 commits August 26, 2017 15:24

add pvanet first 225

b4a1ebb

add pvanet last part

0253f05

revert to vgg

ea6e530

harmonized with primary code

d8ab527

add pvanet_pretrain.py

3f740fa

pvanet end2end success

ca1fe65

modify symbol_pvanet end2end only

f72ce98

change fc6 fc7 name in pvanet_pretrain

a732127

Merge remote-tracking branch 'origin/pvanet-e2e' into pvanet-e2e

8c90b21

Update symbol_pvanet.py

602e17f

Update symbol_pvanet.py

de78900

Rename pvanet_pretrain.py to pvanet.py

9d1501c

qingzhouzhen force-pushed the pvanet_as_caffe branch from d20721e to 33d6db8 Compare September 7, 2017 03:56

piiswrong reviewed Sep 7, 2017

View reviewed changes

Hanqing Huang and others added 3 commits September 7, 2017 12:55

finetune faster-rcnn as caffe

33d6db8

Merge remote-tracking branch 'upstream/master' into pvanet-e2e

2391ad3

Merge remote-tracking branch 'origin/pvanet_as_caffe' into pvanet_as_…

afd0e05

…caffe

qingzhouzhen mentioned this pull request Sep 9, 2017

Low accuracy of pvanet #7820

Closed

qingzhouzhen closed this Oct 25, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pvanet:Deep but Lightweight Neural Neural Networks for Real-time Object Detection #7786

Pvanet:Deep but Lightweight Neural Neural Networks for Real-time Object Detection #7786

qingzhouzhen commented Sep 7, 2017 •

edited

piiswrong Sep 7, 2017

qingzhouzhen Sep 7, 2017

qingzhouzhen commented Sep 7, 2017

chinakook commented Sep 7, 2017

piiswrong commented Sep 7, 2017

qingzhouzhen commented Sep 8, 2017 •

edited

piiswrong commented Sep 10, 2017

piiswrong commented Sep 10, 2017

qingzhouzhen commented Sep 11, 2017

qingzhouzhen commented Sep 11, 2017

szha commented Sep 11, 2017

Pvanet:Deep but Lightweight Neural Neural Networks for Real-time Object Detection #7786

Pvanet:Deep but Lightweight Neural Neural Networks for Real-time Object Detection #7786

Conversation

qingzhouzhen commented Sep 7, 2017 • edited

piiswrong Sep 7, 2017

Choose a reason for hiding this comment

qingzhouzhen Sep 7, 2017

Choose a reason for hiding this comment

qingzhouzhen commented Sep 7, 2017

chinakook commented Sep 7, 2017

piiswrong commented Sep 7, 2017

qingzhouzhen commented Sep 8, 2017 • edited

piiswrong commented Sep 10, 2017

piiswrong commented Sep 10, 2017

qingzhouzhen commented Sep 11, 2017

qingzhouzhen commented Sep 11, 2017

szha commented Sep 11, 2017

qingzhouzhen commented Sep 7, 2017 •

edited

qingzhouzhen commented Sep 8, 2017 •

edited