OneCycleCosine

Implements the modified version of Leslie Smith's 1cycle policy described by Sylvain Gugger and Jeremy Howard for PyTorch. This version uses cosine annealing like the FastAI version but has three phases instead of two:

phase	default	description
warmup	30%	lr_min -> lr_max, momentum_max -> momentum_min
plateau	0%	lr_max, momentum_min - spends more time looking for an optimal minima.
winddown	70%	lr_max -> lr_max / 24e4, momentum_min -> momentum_max

Phases 1 and 3 are the same as phases 1 and 2 FastAI 1cycle policy. Phase 2 is described in the FastAI blogpost.

Usage

OneCycleCosine should be used for optimizers which have a 'momentum' parameter
OneCycleCosineAdam should be used for Adam based optimizers which have a 'betas' parameter tuple

References

FastAI Blogpost - https://www.fast.ai/2018/07/02/adam-weight-decay
Original Paper - https://arxiv.org/abs/1803.09820

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.gitignore		.gitignore
README.md		README.md
onecyclec.py		onecyclec.py
plot.py		plot.py
sched.png		sched.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OneCycleCosine

Usage

References

About

Releases

Packages

Languages

csvance/onecycle-cosine

Folders and files

Latest commit

History

Repository files navigation

OneCycleCosine

Usage

References

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages