To set the learning rate of each parameter group using a cosine annealing schedule.
This repository is contains the core implementation of SGDR: Stochastic Gradient Descent with Warm Restarts. Please refer the original paper(https://arxiv.org/pdf/1608.03983.pdf) for more details