New SOTA reported by original TF repo, any plans to sync-up the changes? #70

factplay1 · 2020-08-12T16:51:35Z

Thanks for your good work.

https://github.com/google/automl/tree/master/efficientdet

They now report 34.3 val mAP for EffDet-D0. Not sure what exactly they did, but seems like they made some changes in their model definition etc. It'd be great if you could have a look at their changes, and potentially sync-up your repo so it reproduces those numbers.

Thanks a million :)

rwightman · 2020-08-12T17:45:02Z

Just weight changes from a new set of runs, also not clear on what they changed for the better result. I have D0-D5 converted and evaluated, haven't had a moment to validate D6. I'd already done the D3, D7, D7X updates which were trained with longer epochs.

factplay1 · 2020-08-12T20:01:31Z

Longer epochs? How many epochs? More than 300?

On D0, I have played lots with your code on the hyper-parameters, I don't think I can achieve 34.3 to be honest, just with the hyper-params changes

hal-314 · 2020-08-19T10:07:24Z

@factplay1 Reviewing the changes between version 6 and 7 of the paper, it seems that they now use soft-NMS for all models instead of NMS. They didn't train during more epochs. Here is an extract of the paragraph (section 5.1, first paragraph, at the end):

During training, we apply horizontal flipping and scale jittering [0.1, 2.0], which randomly resizes images between 0.1x and 2.0x of the original size before cropping. We apply soft-NMS [3] for eval. For D0-D6, each model is trained for 300 epochs with total batch size 128 on 32 TPUv3 cores,but to push the envelope, we train D7/D7x for 600 epochson 128 TPUv3 cores.

While the old was:

We use RetinaNnet[23] preprocessing with training-time flipping and scaling. For D0-D6, each model is trained for 300 epochs with total batch size 128 on 32 TPUv3 cores and evaluated with standard NMS. To further push the envelope,we train D7 for 600 epochs and apply soft-NMS [3].

rwightman · 2020-08-19T15:33:13Z

softnms is the default for the eval results on latest models, but most of the gains aren't due to that, but due to the retrainining. So still not clear what they changed for D0-D6 that improved the accuracy with the same number of epochs

factplay1 · 2020-08-19T16:18:22Z

It'd be very interesting to find that out. I'll try to also investigate the difference, will report if I find anything useful.

Naivepro1990 · 2020-08-31T01:30:35Z

Hi @rwightman

How you trained D7, as far as I know it is not possible to fit in 16GB GPUs with even batch size of one.

afaq-ahmad · 2020-09-02T05:45:43Z

Hi @rwightman

How you trained D7, as far as I know it is not possible to fit in 16GB GPUs with even batch size of one.

Its the same problem, I am facing.

rwightman · 2020-09-02T14:59:10Z

@afaq-ahmad @Naivepro1990 I have never tried to train a D7, with the resolution in the paper you'd need 4-8 32-GB V100 cards or A100 cards (if that even works) and many weeks. The originals were trained with (a lot) of TPUs. I have no plans to spend $100K+ on a system or spend $20-50K in cloud compute so no, there is no point. I think a few people have fine-tuned a D7 at lower resolutions for some Kaggle challenges.

rwightman closed this as completed in 1394d5b Sep 3, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New SOTA reported by original TF repo, any plans to sync-up the changes? #70

New SOTA reported by original TF repo, any plans to sync-up the changes? #70

factplay1 commented Aug 12, 2020

rwightman commented Aug 12, 2020

factplay1 commented Aug 12, 2020

hal-314 commented Aug 19, 2020

rwightman commented Aug 19, 2020

factplay1 commented Aug 19, 2020

Naivepro1990 commented Aug 31, 2020

afaq-ahmad commented Sep 2, 2020

rwightman commented Sep 2, 2020

New SOTA reported by original TF repo, any plans to sync-up the changes? #70

New SOTA reported by original TF repo, any plans to sync-up the changes? #70

Comments

factplay1 commented Aug 12, 2020

rwightman commented Aug 12, 2020

factplay1 commented Aug 12, 2020

hal-314 commented Aug 19, 2020

rwightman commented Aug 19, 2020

factplay1 commented Aug 19, 2020

Naivepro1990 commented Aug 31, 2020

afaq-ahmad commented Sep 2, 2020

rwightman commented Sep 2, 2020