Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[QUESTION] Difference 10-epochs, 40-epochs #9

Closed
RGring opened this issue Nov 30, 2020 · 3 comments
Closed

[QUESTION] Difference 10-epochs, 40-epochs #9

RGring opened this issue Nov 30, 2020 · 3 comments

Comments

@RGring
Copy link

RGring commented Nov 30, 2020

Hi Ayush and Sayak,
I read your blog on wandb and find it interesting to see the performance on a relatively small dataset. Great work!

I am trying to reproduce your results with the official swAV-implementation. Though I only reach the 10-epochs plateau (~2.7).

I was wondering what are the difference between both runs? I found the following, did I miss any?

  • projection to prototype architecture
  • PolynomialDecay
  • ... further?

Just curious, if you achieved the same results with the original pytorch implementation?

Thanks in advance :)

@sayakpaul
Copy link
Collaborator

Thank you for your interest @RGring.

Here are a couple of suggestions that the authors have put in the official repository and we have verified that ourselves and they work! We are yet to publish the changes but just wanted to confirm that those suggestions actually work.

Just curious, if you achieved the same results with the original pytorch implementation?

Actually, we did not run the official implementation so won't be able to lament.

I hope this helps.

@RGring
Copy link
Author

RGring commented Nov 30, 2020

Thanks for your answer. I've seen the suggestions and some of them helped, indeed. I guess, now it is a matter of hyperparameter-tuning. Looking forward to your new best score on the flower dataset :).

Thanks again!

@sayakpaul
Copy link
Collaborator

Yes, that is generally the case. Sometimes simplifying the data augmentation pipeline also helps quite a lot.

You can also take a look at SimSiam (https://github.com/sayakpaul/SimSiam-TF) which I open-sourced yesterday. It's by FAIR. It's way simpler and it provides pretty comparable results to SwAV, BYOL, MoCov2, and SimCLR.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants