Warm restart policy is available now #6130

AutuanLiu · 2018-03-30T13:50:53Z

Please tell me if violated any rules.

* warm restart is available now

ssnl · 2018-03-30T16:59:18Z

Can you also add tests for this in test_optim.py?

ezyang · 2018-03-30T19:16:43Z

@pytorchbot test this please

AutuanLiu · 2018-03-31T03:08:07Z

ok

ezyang · 2018-04-01T01:37:41Z

@pytorchbot test this please

ezyang · 2018-04-01T03:33:08Z

@pytorchbot test this please

ezyang · 2018-04-02T03:54:49Z

@pytorchbot test this please

torch/optim/lr_scheduler.py

-                (1 + math.cos(math.pi * self.last_epoch / self.T_max)) / 2
+        if self.restart and self.last_epoch == self.T_max:
+            self.last_epoch = 0
+            self.T_max *= self.T_mult


* actually, the restart argument is redundant, because T_max will equal to training epochs when warm restart policy not be used. * if we want to apply warm restart policy, we need to set T_max ＜ training epochs.

torch/optim/lr_scheduler.py

        T_max (int): Maximum number of iterations.
        eta_min (float): Minimum learning rate. Default: 0.
+        T_mult (int): Multiplicative factor of T_max. Default: 2
+        restart (bool): If True, warm restart policy will be used.


test/test_optim.py

+        single_targets = [eta_min + (0.05 - eta_min) * (1 + math.cos(math.pi * x / y)) / 2
+                          for x, y in zip(T_cur, T_i)]
+        targets = [single_targets, list(map(lambda x: x * epochs, single_targets))]
+        scheduler = CosineAnnealingLR(self.opt, T_max=T_max, eta_min=eta_min, T_mult=T_mult, restart=True)


ssnl · 2018-04-02T15:45:48Z

@pytorchbot add to whitelist

ezyang · 2018-04-16T18:40:22Z

@ssnl Do you think this is OK to merge now?

apaszke · 2018-04-16T20:02:35Z

@ezyang it's not. See our discussion above, which hasn't concluded yet.

torch/optim/lr_scheduler.py

+                self.cycle += 1
+        else:
+            self.cycle = int(math.floor(math.log(epoch / self.T_max * (self.T_mult - 1) + 1, self.T_mult)))
+            epoch -= sum([self.T_max * self.T_mult ** x for x in range(self.cycle)])


yf225 · 2018-07-10T17:08:57Z

@AutuanLiu Let us know if you have time to address @apaszke and @ssnl 's comments, thanks!

AutuanLiu · 2018-07-11T09:20:45Z

@yf225 I'm so sorry, I have no time to address and figure out these comments.

loshchil · 2018-07-25T23:51:13Z

Sorry for not providing the actual working code, but our implementation of restarts for TensorFlow might be useful as a reference:
https://github.com/tensorflow/tensorflow/blob/25c197e02393bd44f50079945409009dd4d434f8/tensorflow/python/training/learning_rate_decay.py#L514

AlexMRuch · 2019-01-18T15:32:41Z

Looking forward to seeing this function!

danieltudosiu · 2020-03-12T14:44:09Z

I gave it a shot at porting the TF one, but I am not sure if it is correct since I just started using PyTorch. Could you please give me your two cents?

import math
import warnings

from torch.optim.lr_scheduler import _LRScheduler


class CosineDecayRestarts(_LRScheduler):
    def __init__(
        self,
        optimizer,
        first_decay_steps,
        t_mul=2.0,
        m_mul=1.0,
        alpha=0.0,
        last_epoch=-1,
    ):
        self.first_decay_steps = first_decay_steps
        self.t_mul = t_mul
        self.m_mul = m_mul
        self.alpha = alpha

        super(CosineDecayRestarts, self).__init__(optimizer, last_epoch)

    def get_lr(self):
        if not self._get_lr_called_within_step:
            warnings.warn("To get the last learning rate computed by the scheduler, "
                          "please use `get_last_lr()`.", DeprecationWarning)

        if self.last_epoch == 0:
            return self.base_lrs

        return [self._calculate_decayed_lr(group['lr']) for group in self.optimizer.param_groups]

    def _calculate_decayed_lr(self, group_lr):
        completed_fraction = self._step_count / self.first_decay_steps

        if not self.t_mul == 1.0:
            i_restart = math.floor(
                math.log(1 - completed_fraction * (1 - self.t_mul)) / math.log(self.t_mul)
            )
            sum_r = (1.0 - self.t_mul ** i_restart) / (1.0 - self.t_mul)
            completed_fraction = (completed_fraction - sum_r) / self.t_mul ** i_restart
        else:
            i_restart = math.floor(completed_fraction)
            completed_fraction = completed_fraction - i_restart

        m_fac = self.m_mul ** i_restart
        cosine_decayed = 0.5 * m_fac * (1.0 + math.cos(math.pi * completed_fraction))
        decayed = (1 - self.alpha) * cosine_decayed + self.alpha

        return group_lr * decayed

github-actions · 2022-03-23T23:05:54Z

Looks like this PR hasn't been updated in a while so we're going to go ahead and mark this as Stale.
Feel free to remove the Stale label if you feel this was a mistake.
If you are unable to remove the Stale label please contact a maintainer in order to do so.
Stale pull requests will automatically be closed 30 days after being marked Stale

facebook-github-bot · 2022-03-29T18:52:44Z

Hi @AutuanLiu!

Thank you for your pull request and welcome to our community.

Action Required

In order to merge any pull request (code, docs, etc.), we require contributors to sign our Contributor License Agreement, and we don't seem to have one on file for you.

Process

In order for us to review and merge your suggested changes, please sign at https://code.facebook.com/cla. If you are contributing on behalf of someone else (eg your employer), the individual CLA may not be sufficient and your employer may need to sign the corporate CLA.

Once the CLA is signed, our tooling will perform checks and validations. Afterwards, the pull request will be tagged with CLA signed. The tagging process may take up to 1 hour after signing. Please give it that time before contacting us about it.

If you have received this in error or have any questions, please contact us at cla@fb.com. Thanks!

github-actions · 2022-05-28T20:36:20Z

Looks like this PR hasn't been updated in a while so we're going to go ahead and mark this as Stale.
Feel free to remove the Stale label if you feel this was a mistake.
If you are unable to remove the Stale label please contact a maintainer in order to do so.
If you want the bot to never mark this PR stale again, add the no-stale label.
Stale pull requests will automatically be closed after 30 days of inactivity.

AutuanLiu added 6 commits March 30, 2018 09:12

Update lr_scheduler.py

6c0be32

* warm restart is available now

Update lr_scheduler.py

d1867a6

Update lr_scheduler.py

df96d22

Update lr_scheduler.py

2e4da57

Update lr_scheduler.py

aae78c1

Update lr_scheduler.py

cab7a44

AutuanLiu added 2 commits March 31, 2018 19:34

Merge remote-tracking branch 'upstream/master'

ab48241

add test

c3d6471

AutuanLiu added 2 commits April 1, 2018 11:08

fix test error

e55f394

fix test error

cbfc3ed

AutuanLiu added 8 commits April 1, 2018 11:42

fix test errors

866af45

indent error fix

5696ad7

indent error fix

49f80e4

fix indent error

885ce31

indent

11db9b0

under indent fix

c5e2acf

"

ba7ed20

indent

c0ff4fe

Update lr_scheduler.py

92e47ef

ssnl reviewed Apr 2, 2018

View reviewed changes

Update lr_scheduler.py

7784c7f

* actually, the restart argument is redundant, because T_max will equal to training epochs when warm restart policy not be used. * if we want to apply warm restart policy, we need to set T_max ＜ training epochs.

ssnl reviewed Apr 2, 2018

View reviewed changes

AutuanLiu requested review from ezyang, gchanan, soumith and zdevito as code owners April 11, 2018 02:29

update

67d26cc

ssnl reviewed Jun 26, 2018

View reviewed changes

torch/optim/lr_scheduler.py

self.cycle += 1

else:

self.cycle = int(math.floor(math.log(epoch / self.T_max * (self.T_mult - 1) + 1, self.T_mult)))

epoch -= sum([self.T_max * self.T_mult ** x for x in range(self.cycle)])

This comment was marked as off-topic.

Sign in to view

ssnl mentioned this pull request Jun 26, 2018

Cosine Annealing with warm restarts #7821

Closed

zou3519 added the awaiting response (this tag is deprecated) This tag is deprecated while we figure out what to do with it label Jul 10, 2018

weiyangfb added triage review and removed awaiting response (this tag is deprecated) This tag is deprecated while we figure out what to do with it labels Aug 14, 2018

This was referenced Aug 21, 2018

Feature request: cosine annealing scheduler with warm restarts allenai/allennlp#1642

Closed

Implement cosine with restarts allenai/allennlp#1647

Merged

striajan mentioned this pull request Aug 30, 2018

Cosine annealing with restarts #11104

Closed

zdevito removed their request for review February 13, 2019 01:23

gchanan removed their request for review February 28, 2019 16:28

ezyang added the open source label Jun 5, 2019

github-actions bot added the Stale label Mar 23, 2022

github-actions bot removed the Stale label Mar 29, 2022

github-actions bot added the Stale label May 28, 2022

github-actions bot closed this Jun 27, 2022

Warm restart policy is available now #6130

Warm restart policy is available now #6130

Uh oh!

Conversation

AutuanLiu commented Mar 30, 2018

Uh oh!

ssnl commented Mar 30, 2018

Uh oh!

ezyang commented Mar 30, 2018

Uh oh!

AutuanLiu commented Mar 31, 2018

Uh oh!

ezyang commented Apr 1, 2018

Uh oh!

ezyang commented Apr 1, 2018

Uh oh!

ezyang commented Apr 2, 2018

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

ssnl commented Apr 2, 2018

Uh oh!

ezyang commented Apr 16, 2018

Uh oh!

apaszke commented Apr 16, 2018

Uh oh!

This comment was marked as off-topic.

Uh oh!

yf225 commented Jul 10, 2018

Uh oh!

AutuanLiu commented Jul 11, 2018

Uh oh!

loshchil commented Jul 25, 2018

Uh oh!

AlexMRuch commented Jan 18, 2019

Uh oh!

danieltudosiu commented Mar 12, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Mar 23, 2022

Uh oh!

facebook-github-bot commented Mar 29, 2022

Action Required

Process

Uh oh!

github-actions bot commented May 28, 2022

Uh oh!

Uh oh!

danieltudosiu commented Mar 12, 2020 •

edited

Loading