Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The setting of random seeds #4

Closed
Vanint opened this issue Apr 20, 2021 · 2 comments
Closed

The setting of random seeds #4

Vanint opened this issue Apr 20, 2021 · 2 comments

Comments

@Vanint
Copy link

Vanint commented Apr 20, 2021

Hi, thanks for your great work. I have a question related to random seeds.
First, I tried the seed=1 and default hyperparameters, and the result of LADE is 52.2, which is actually lower than 53 in the original paper. I have no idea what is the difference.
Then, I tried the seed=2 and default hyperparameters, and the result of LADE is 48.9. Does that mean this method is unstable? Thanks very much if you can reply.

@wade3han
Copy link
Contributor

wade3han commented Apr 20, 2021

Huge thanks for your comments & experiments.

  1. All of our experiments are done by fixing the seed=1. We’re not entirely sure why you’re encountering the difference, but we suspect it may due to different machine settings. We did our experiment on AWS p3.16xlarge instances, and you may try this setting too.

  2. Since the Imagenet-LT dataset is quite large and takes some time to train the model, we haven’t got time to try multiple seeds to check the stableness of our method. We also agree that our method can be somewhat unstable in some cases since it is using the Monte Carlo approximation to calculate the loss. Managing the unstableness might be quite an intriguing direction to work with.

@Vanint
Copy link
Author

Vanint commented Apr 20, 2021

Thanks for your reply. I use TITAN RTX 2080.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants