Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Performance for RegNetY #2

Open
zhanghang1989 opened this issue May 25, 2020 · 4 comments
Open

Performance for RegNetY #2

zhanghang1989 opened this issue May 25, 2020 · 4 comments

Comments

@zhanghang1989
Copy link

First, thanks for the awesome work of re-implementing RegNet.

I am having difficulty of reproducing the results for RegNetY-0.4GF. The configurations are taken from the original repo:

group_width = 8
initial_width = 48
slope = 27.89
quantized_param = 2.09
network_depth = 16

I only get 72.92 top-1 accuracy, but the original paper reported 74.2. Any thoughts on that?

@uvipen
Copy link
Contributor

uvipen commented May 26, 2020

Hi,
Thanks for your interest. Do you change anything else on our code (e.g data augmentation, schedule, ...)?

@zhanghang1989
Copy link
Author

Thanks for the quick reply!

I used different training HPs learning rate: 0.2, batch size: 512, weight decay: 1e-4

When switching back to paper default: learning rate: 0.8, batch size: 1024, weight decay: 5e-5, I got 73.66 now.

@uvipen
Copy link
Contributor

uvipen commented May 26, 2020

Great. Pls notice that data-preprocessing stage also influences performance. Since the paper didnt explicitly mention this step, we only applied very basic techniques. That's the reason why performance could be slightly different from paper

@zhanghang1989
Copy link
Author

Thanks @uvipen
Could you the results in your experiments? I saw the README is still pending.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants