Potentially swap to SWA's learning rate schedule for CIFAR baselines #233

dustinvtran · 2020-03-08T19:28:58Z

From preliminary experiments, the LR schedule from the SWA papers (https://github.com/timgaripov/swa/blob/master/train.py#L94) seems to improve the baseline results (at least for deterministic and dropout). Upgrading to that one may close the gap from our deterministic baseline which reproduces the original paper of 96.0% (and we get 0.154 NLL). Their papers' baseline reports 96.4% and 0.12 NLL. (Same for CIFAR-100.)

dustinvtran · 2020-08-16T21:12:40Z

Moved to google/uncertainty-baselines#64.

dustinvtran mentioned this issue Aug 16, 2020

Potentially swap to SWA's learning rate schedule for CIFAR baselines google/uncertainty-baselines#64

Open

dustinvtran closed this as completed Aug 16, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Potentially swap to SWA's learning rate schedule for CIFAR baselines #233

Potentially swap to SWA's learning rate schedule for CIFAR baselines #233

dustinvtran commented Mar 8, 2020 •

edited

dustinvtran commented Aug 16, 2020

Potentially swap to SWA's learning rate schedule for CIFAR baselines #233

Potentially swap to SWA's learning rate schedule for CIFAR baselines #233

Comments

dustinvtran commented Mar 8, 2020 • edited

dustinvtran commented Aug 16, 2020

dustinvtran commented Mar 8, 2020 •

edited