sigma_const hyper parameter #40

motokimura · 2019-11-27T03:29:23Z

Hi, I talked with you at your poster in ICCV2019!
I just released PyTorch implementation of Gaussian YOLOv3 with training code on COCO dataset. Would it be possible to include link to our repo in third-party-implementations section of your README? If it is okay for you, I can send a pull request to update your README.

Though Gaussian YOLOv3 in our repo shows significant improvement of COCO mAP (2.7 point) on COCO2017 val, this improvement is still smaller than the one reported in your paper (3.1 point).
I'm wondering if this difference comes from the hyper parameter sigma_const set to 0.3 in your implementation (our implementation does not have this parameter).

Do you think sigma_const affects the result a lot?
How did you find the value 0.3 for this parameter?

The text was updated successfully, but these errors were encountered:

jwchoi384 · 2019-11-27T15:27:07Z

@motokimura
Hi, I remember you! Nice to meet you!
Thanks for your implementation 👍
Please send me pull request for update the README.

I experimented with several sigma_const values, and there was some difference.
On BDD validation set (over 200k iteration weight),
sigma_const: 0.1 -> mAP is 18.49-18.65.
sigma_const: 0.2 -> mAP is 18.68-19.11.
sigma_const: 0.3 -> mAP is 18.92-19.14 (and when we reduce the learning rate in on-going training, we can get over 19.7 mAP)
Of those, the value of 0.3 was best.
If sigma is not added to the loss in our C implementation, the training did not work well.
I don't know exactly why, but it seems to be sensitive to variance.
So i added it my model.

motokimura · 2019-11-28T09:08:35Z

Glad to see you there, too! 😄
Thank you so much for merging PR!

sigma_const looks more important than I thought.
I agree loss is sensitive to variance with small sigma_const. In our experiments without sigma_const on COCO dataset, we needed gradient clipping to avoid divergence. Also, mAP increased more slowly than when we trained normal YOLOv3 with the same hyper parameters [link]. These may be caused by large magnitude of gradients and sigma_const seems to mitigate this kind of instability of the gaussian loss.

I will try Gaussian YOLOv3 training on COCO again with sigma_const parameter for higher mAP!

jwchoi384 · 2019-11-29T06:19:12Z

@motokimura
I see! Thanks!

motokimura · 2019-11-29T13:49:34Z

I'll let you konw when I got the experiment result! Thanks for your kind answers!

CuongNguyen218 · 2019-11-30T09:29:24Z

@motokimura , can you open issues in your repo ?

NewRGB · 2019-11-30T10:34:40Z

Hi Jiwoong, It is received an e-mail from the Github/jwchoi384/Gaussian_YOLOv3. I'm in Github and forked your repository ( https://github.com/jwchoi384/Gaussian_YOLOv3), yesterday. Your project is very very interesting and very new concept. I'm very much interested to learn about this project about autonomous vehicle. Could you please, guide/mentor me on this project ? Your response is appreciated. ------------------------------- *Thanks & Regards,* *-Amber* =============================================================

…

On Sat, 30 Nov 2019 at 14:59, CuongNguyen218 ***@***.***> wrote: @motokimura <https://github.com/motokimura> , can you open issues in your repo ? — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#40?email_source=notifications&email_token=ANWVMZBQR2T4AYOMRR3KYETQWIW7LA5CNFSM4JSA2DAKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEFP634A#issuecomment-559934960>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ANWVMZAU3X4CT4DZMSK6BHDQWIW7LANCNFSM4JSA2DAA> .

motokimura · 2019-12-01T07:43:26Z

@CuongNguyen218
I enabled issues feature in our repo!
https://github.com/motokimura/PyTorch_Gaussian_YOLOv3/issues

I forgot to enable this feature.
Thanks for your comment!

motokimura mentioned this issue Nov 27, 2019

update README.md #42

Merged

motokimura closed this as completed Nov 29, 2019

motokimura mentioned this issue Dec 17, 2019

got NAN after trainging some iters motokimura/PyTorch_Gaussian_YOLOv3#12

Closed

laycoding mentioned this issue Dec 24, 2019

Question about the '_gaussian_dist_pdf' motokimura/PyTorch_Gaussian_YOLOv3#13

Closed

motokimura mentioned this issue Feb 18, 2020

Got NaN loss when use the Gaussian idea to regression TASK. motokimura/PyTorch_Gaussian_YOLOv3#23

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sigma_const hyper parameter #40

sigma_const hyper parameter #40

motokimura commented Nov 27, 2019

jwchoi384 commented Nov 27, 2019

motokimura commented Nov 28, 2019 •

edited

jwchoi384 commented Nov 29, 2019

motokimura commented Nov 29, 2019

CuongNguyen218 commented Nov 30, 2019

NewRGB commented Nov 30, 2019 via email

motokimura commented Dec 1, 2019

sigma_const hyper parameter #40

sigma_const hyper parameter #40

Comments

motokimura commented Nov 27, 2019

jwchoi384 commented Nov 27, 2019

motokimura commented Nov 28, 2019 • edited

jwchoi384 commented Nov 29, 2019

motokimura commented Nov 29, 2019

CuongNguyen218 commented Nov 30, 2019

NewRGB commented Nov 30, 2019 via email

motokimura commented Dec 1, 2019

motokimura commented Nov 28, 2019 •

edited