Accuracy results on cifar100 #4

shuikehuo · 2019-09-14T08:48:56Z

this paper reports 75.30 accuracy on the clean test set. But I obatin 78.16 accuracy on the same test set, I use resnet50 with SGD + momentum optimizer trained for 350 epoch.

rohan-anil · 2019-09-14T09:08:22Z

Hi Shuikehuo,

We used Resnet-56 without batch norm from [1] which explains the accuracy difference (and a weaker baseline). And it was trained with SGD optimizer for 50k steps with batch size 128.

The experiment shows the effect of noisy labels on the test accuracy when trained with logistic loss, and with bi-tempered logistic loss. We expect that the results in terms of accuracy delta to remain similar even when trained with the Resnet-50 (with batch norm) model or models of similar capacity. We will make available the code for Resnet-56 model without batch norm from [1] soon to reproduce the results.

Thanks,

[1] Identity Matters in Deep Learning, Moritz Hardt, Tengyu Ma, https://arxiv.org/pdf/1611.04231.pdf

Charles-Xie · 2020-01-02T15:37:56Z

Hi Shuikehuo,

We used Resnet-56 without batch norm from [1] which explains the accuracy difference (and a weaker baseline). And it was trained with SGD optimizer for 50k steps with batch size 128.

The experiment shows the effect of noisy labels on the test accuracy when trained with logistic loss, and with bi-tempered logistic loss. We expect that the results in terms of accuracy delta to remain similar even when trained with the Resnet-50 (with batch norm) model or models of similar capacity. We will make available the code for Resnet-56 model without batch norm from [1] soon to reproduce the results.

Thanks,

[1] Identity Matters in Deep Learning, Moritz Hardt, Tengyu Ma, https://arxiv.org/pdf/1611.04231.pdf

@rohan-anil Is there any reason to use resnet56 without batch normalization? This network seems not to be used a lot in experiments.

When I use resnet110 with BN (as introduced in ResNet v1 paper), the accuracy delta (improvement) does not seems to be very obvious, for clean or noisy labels.

eamid · 2020-01-15T23:59:53Z

Hi Chi,

Thank you for your interest in our method.

We used the Resnet-56 model because we had the baseline easily available (Moritz was at google, and we used his codebase). I noticed that the bi-tempered loss still gives some improvements in your case. You might achieve even more improvement by tuning t1 and t2 (I would suggest trying a larger t2 value).

Ehsan

Charles-Xie · 2020-01-17T09:06:16Z

@eamid
Thanks a lot!

eamid closed this as completed Apr 23, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Accuracy results on cifar100 #4

Accuracy results on cifar100 #4

shuikehuo commented Sep 14, 2019

rohan-anil commented Sep 14, 2019 •

edited

Charles-Xie commented Jan 2, 2020

eamid commented Jan 15, 2020

Charles-Xie commented Jan 17, 2020

Accuracy results on cifar100 #4

Accuracy results on cifar100 #4

Comments

shuikehuo commented Sep 14, 2019

rohan-anil commented Sep 14, 2019 • edited

Charles-Xie commented Jan 2, 2020

eamid commented Jan 15, 2020

Charles-Xie commented Jan 17, 2020

rohan-anil commented Sep 14, 2019 •

edited