Potentially swap to SWA's learning rate schedule for CIFAR baselines #64

dustinvtran · 2020-08-16T21:12:31Z

@dustinvtran
From preliminary experiments, the LR schedule from the SWA papers (https://github.com/timgaripov/swa/blob/master/train.py#L94) seems to improve the baseline results (at least for deterministic and dropout). Upgrading to that one may close the gap from our deterministic baseline which reproduces the original paper of 96.0% (and we get 0.154 NLL). Their papers' baseline reports 96.4% and 0.12 NLL. (Same for CIFAR-100.)

google/edward2#233

dustinvtran mentioned this issue Aug 16, 2020

Potentially swap to SWA's learning rate schedule for CIFAR baselines google/edward2#233

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Potentially swap to SWA's learning rate schedule for CIFAR baselines #64

Potentially swap to SWA's learning rate schedule for CIFAR baselines #64

dustinvtran commented Aug 16, 2020

Potentially swap to SWA's learning rate schedule for CIFAR baselines #64

Potentially swap to SWA's learning rate schedule for CIFAR baselines #64

Comments

dustinvtran commented Aug 16, 2020