Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add better learning rate annealing scheduler + Learning rate finder #228

Closed
mhham opened this issue Nov 19, 2018 · 4 comments
Closed

Add better learning rate annealing scheduler + Learning rate finder #228

mhham opened this issue Nov 19, 2018 · 4 comments
Labels
enhancement Improving of an existing feature

Comments

@mhham
Copy link

mhham commented Nov 19, 2018

Two ideas that could allow improving training :

  1. Add learning rate finder : as explained in here, and implemented here

  2. Cosine Annealing with restart : https://github.com/roveo/pytorch/pull/1/commits/35891c46dbdec46e55a5fb725418c2880f631dac

@alanakbik
Copy link
Collaborator

That sounds like a great idea - if fact we've been recently thinking a lot about better methods to find good parameters than a full parameter sweep. Would you be interested in testing/integrating such approaches?

@tabergma tabergma added the enhancement Improving of an existing feature label Nov 20, 2018
@alanakbik
Copy link
Collaborator

@kashif what do you think?

@kashif
Copy link
Contributor

kashif commented Nov 21, 2018

@mhham @alanakbik Indeed I have a PR in preparation that adds weight decay versions of optimizers after which we can add the corresponding schedulers that not only change the learning rate, but also the weight decay factor or momentum

@alanakbik
Copy link
Collaborator

Learning rate finder was just added for release-0.4 - we'll check out the cosine annealing in a later version.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement Improving of an existing feature
Projects
None yet
Development

No branches or pull requests

4 participants