How to use OHEM loss function? #15

qijiezhao · 2018-12-26T02:01:44Z

Hi, Zilong:

Thanks for your contribution to this amazing repo! I really appreciate of this!
By the way, I want to reproduce the highest score in your paper: 81.4 on Cityscapes test set.(with 4 V100 GPUs).
I have already reproduced the default setting single-scale result on val set: 79.7 on my machine.
To reach the 81.4, as far as I have known: besides the default settings, OHEM loss function and multi-scale inference should also be used. But when I re-train the code with setting --ohem True, the evaluation results drop a lot - about 10 points are decreased.
So my problem is, what also should I modify to achieve the result as proposed in the original paper?

Thanks a lot for your help!

speedinghzl · 2018-12-26T04:54:43Z

@qijiezhao Thanks for your attention. I have updated run_local.sh. The hyperparameters of OHEM are --ohem-thres 0.7 --ohem-keep 100000.
To achieve the 81.4 on the testing set, you need to train the model with both the training set and validation set for 100000 iterations, then you need to finetune this model with a low learning rate and frozen sync bn.
Besides, Can you report your result in Reproduce Performance Discussion with your env information? Thank you very much.

qijiezhao · 2018-12-26T05:09:02Z

Yeah, I reproduced the results of PSPNet、FCN_NonLocal、Deeplabv3 and CCNet with 769x769 input size(±0.2%). I think these steps are not that troublesome.

speedinghzl · 2018-12-26T05:15:50Z

Great! Does FCN_NonLocal use Resnet-101 as the backbone? Can you tell me the performance of FCN_NonLocal? I don't have V100 to run FCN_NonLocal with Resnet-101.

qijiezhao · 2018-12-26T05:32:50Z

Great! Does FCN_NonLocal use Resnet-101 as the backbone? Can you tell me the performance of FCN_NonLocal? I don't have V100 to run FCN_NonLocal with Resnet-101.

Yes, FCN_nonlocal with r101, I have tuned with a lot different settings to maximize the performance, the highest is around 79.97(val set, single scale, however, much more params than CCNet then). The tricks are like: tuning the key channels(pruning), decrease the unnecessary channels etc.

In addition, If I use the segmentation toolbox code to run ohem loss function, the modification is also:
--ohem-thres 0.7 --ohem-keep 100000 ?

speedinghzl · 2018-12-26T05:42:38Z

Ok, thanks for the information.
Yes, the segmentation toolbox can also use the same OHEM parameters: --ohem-thres 0.7 --ohem-keep 100000.

qijiezhao · 2018-12-26T05:49:13Z

Thanks for your help, my command is:
python train.py --data-dir /path/to/my/data_dir/ --random-mirror --random-scale --restore-from ./dataset/resnet101-imagenet.pth --gpu 0,1 --learning-rate 0.01 --weight-decay 0.0005 --batch-size 8 --num-steps 40000 --ohem True --ohem-thres 0.7 --ohem-keep 100000
This is for training only on train set. As far as I can get improvement, I will try to add the val set and plus more iterations.
I will report my final OHEM results here when I get them.

qijiezhao · 2019-01-04T15:59:28Z

Hi, zilong:

I have evaluated normal FCN-non-local with OHEM and it improves 0.7 points than without OHEM. However, when I change to my own method, it gets no improvement, I think my method may conflict with OHEM. Have you ever met this problem when designing CCNet.

I want to keep a long-term communication with you on semantics segmentation problems. Can you add a WeChat friend with me? My ID is zhaoqijie8356. Thanks.

lxtGH · 2019-01-06T14:47:25Z

Hi, I also met the same problem, Using OHEM did no improvement to my own method. It seems that OHEM may be sensitive to network architecture.

EthanZhangYi · 2019-01-14T01:18:53Z

Hi, @qijiezhao
Can you report the results of PSPNet, FCN_NonLocal, Deeplabv3 and CCNet with 769x769 input size, trained on Cityscapes train set and evaluated on val set. Moreover, can you report your environment info here?

Thanks

qijiezhao · 2019-01-14T12:11:17Z

Hi, @qijiezhao
Can you report the results of PSPNet, FCN_NonLocal, Deeplabv3 and CCNet with 769x769 input size, trained on Cityscapes train set and evaluated on val set. Moreover, can you report your environment info here?

Thanks

Sure。
PSPNet, 78.6
Deeplabv3，78.8
CCNet，79.8
FCN_Nonlocal，79.2

The env is: 2V100 GPUs, Pytorch0.4.1, images per GPU is 4.

EthanZhangYi · 2019-01-15T00:56:46Z

@qijiezhao Thanks very much

zhangpj · 2020-11-01T09:40:33Z

@qijiezhao Thanks for your attention. I have updated run_local.sh. The hyperparameters of OHEM are --ohem-thres 0.7 --ohem-keep 100000.
To achieve the 81.4 on the testing set, you need to train the model with both the training set and validation set for 100000 iterations, then you need to finetune this model with a low learning rate and frozen sync bn.
Besides, Can you report your result in Reproduce Performance Discussion with your env information? Thank you very much.

@speedinghzl Can you share the exact learning rate and the number of training steps for finetune this model?

speedinghzl mentioned this issue Dec 26, 2018

Reproduce Performance Discussion #4

Closed

speedinghzl added the result reproduced The results are well reproduced label Dec 26, 2018

speedinghzl mentioned this issue Jan 7, 2019

The results of reimplemented models speedinghzl/pytorch-segmentation-toolbox#1

Closed

speedinghzl closed this as completed Feb 16, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to use OHEM loss function? #15

How to use OHEM loss function? #15

qijiezhao commented Dec 26, 2018 •

edited

speedinghzl commented Dec 26, 2018

qijiezhao commented Dec 26, 2018

speedinghzl commented Dec 26, 2018

qijiezhao commented Dec 26, 2018 •

edited

speedinghzl commented Dec 26, 2018

qijiezhao commented Dec 26, 2018 •

edited

qijiezhao commented Jan 4, 2019 •

edited

lxtGH commented Jan 6, 2019

EthanZhangYi commented Jan 14, 2019

qijiezhao commented Jan 14, 2019

EthanZhangYi commented Jan 15, 2019

zhangpj commented Nov 1, 2020

How to use OHEM loss function? #15

How to use OHEM loss function? #15

Comments

qijiezhao commented Dec 26, 2018 • edited

speedinghzl commented Dec 26, 2018

qijiezhao commented Dec 26, 2018

speedinghzl commented Dec 26, 2018

qijiezhao commented Dec 26, 2018 • edited

speedinghzl commented Dec 26, 2018

qijiezhao commented Dec 26, 2018 • edited

qijiezhao commented Jan 4, 2019 • edited

lxtGH commented Jan 6, 2019

EthanZhangYi commented Jan 14, 2019

qijiezhao commented Jan 14, 2019

EthanZhangYi commented Jan 15, 2019

zhangpj commented Nov 1, 2020

qijiezhao commented Dec 26, 2018 •

edited

qijiezhao commented Dec 26, 2018 •

edited

qijiezhao commented Dec 26, 2018 •

edited

qijiezhao commented Jan 4, 2019 •

edited