The layer before the auxiliary loss layer #21

huaxinxiao · 2017-01-23T07:57:45Z

Hi @hszhao, what is the layer setting before the auxiliary loss layer? A simple 3x3 conv layer or another pyramid pooling module?

authman · 2017-07-14T13:00:30Z

I've tried with {ProjectionConv, BN, Relu, Dropout, Conv2d->NumClasses} as an aux branch. The results were 'meh'.

qizhuli · 2017-07-14T13:35:48Z

@authman Hi, I don't think that is what they used. I actually loaded up their released weights using their deploy prototxt and inspected the "ignored layers" output by caffe.

I0718 17:39:40.174931  9953 net.cpp:816] Ignoring source layer data
I0718 17:39:40.174958  9953 net.cpp:816] Ignoring source layer label_gather
I0718 17:39:40.221314  9953 net.cpp:816] Ignoring source layer conv6_gather
I0718 17:39:40.221345  9953 net.cpp:816] Ignoring source layer conv6_gather_conv6_gather_0_split
I0718 17:39:40.221349  9953 net.cpp:816] Ignoring source layer label_shrink
I0718 17:39:40.221354  9953 net.cpp:816] Ignoring source layer label_shrink_label_shrink_0_split
I0718 17:39:40.221357  9953 net.cpp:816] Ignoring source layer loss
I0718 17:39:40.221361  9953 net.cpp:816] Ignoring source layer accuracy
I0718 17:39:40.221366  9953 net.cpp:816] Ignoring source layer conv4_24
I0718 17:39:40.221372  9953 net.cpp:816] Ignoring source layer conv4_24/bn
I0718 17:39:40.221377  9953 net.cpp:816] Ignoring source layer conv4_24/relu
I0718 17:39:40.221382  9953 net.cpp:816] Ignoring source layer conv4_24/dropout
I0718 17:39:40.221387  9953 net.cpp:816] Ignoring source layer conv6_1
I0718 17:39:40.221392  9953 net.cpp:816] Ignoring source layer conv6_1_gather
I0718 17:39:40.221396  9953 net.cpp:816] Ignoring source layer loss_1

Apparently it was a conv/bn/relu/dropout block (the conv4_24s) followed by a single conv (conv6_1) in the auxiliary branch. Would that have made the difference between their reported result and the "meh" results you and I got? Not likely.

Personally I think the key to achieving their stellar accuracy is in finetuning the BN parameters on VOC across multiple GPUs. That is also confirmed by deeplab-v3 which did the BN trick and obtained almost the same accuracy as PSPNet.

mjohn123 · 2017-07-15T11:17:07Z

@qizhuli: do you know where can i get deeplabv3 source code? I did not find it. Thanks

qizhuli · 2017-07-15T15:01:02Z

@mjohn123 I don't think they have released it yet.

This was referenced Oct 25, 2017

Training accuracy is stuck at 0 #63

Open

How to training? #52

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The layer before the auxiliary loss layer #21

The layer before the auxiliary loss layer #21

huaxinxiao commented Jan 23, 2017

authman commented Jul 14, 2017

qizhuli commented Jul 14, 2017 •

edited

Loading

mjohn123 commented Jul 15, 2017

qizhuli commented Jul 15, 2017

The layer before the auxiliary loss layer #21

The layer before the auxiliary loss layer #21

Comments

huaxinxiao commented Jan 23, 2017

authman commented Jul 14, 2017

qizhuli commented Jul 14, 2017 • edited Loading

mjohn123 commented Jul 15, 2017

qizhuli commented Jul 15, 2017

qizhuli commented Jul 14, 2017 •

edited

Loading