Stagewise Locally-Regularized LookAhead (SLRLA)

Towards Understanding Why Lookahead Generalizes Better Than SGD and Beyond
Pan Zhou ^*, Hanshu Yan ^*, Xiaotong Yuan ^{^}, Jiashi Feng ^*, Shuicheng Yan ^*
^* Sea AI Lab, Sea Group, ^{^} Nanjing University of Information Science & Technology
Neural Information Processing Systems (NeurIPS), 2021

Requirements

This repository provides codes for SLRLA.

Python = 3.6
Pytorch = 1.6
CUDA = 10.1

Experimental Results

Here we list experimental results on CIFAR10 and CIFAR100. Please refer to the main paper for detailed experimental settings and more results on ImageNet.

CIFAR10

Optimizer	ResNet-18	VGG-16	WRN-16-10
stagewise SGD	95.23	92.13	95.51
stagewise LA	95.27	92.38	95.73
SLRLA	95.47	92.63	96.08

CIFAR100

Optimizer	ResNet-18	VGG-16	WRN-16-10
stagewise SGD	78.24	69.97	78.95
stagewise LA	78.34	70.2	79.54
SLRLA	78.58	70.63	79.85

ImageNet

Optimizer	ResNet-18
stagewise SGD	70.23
stagewise LA	70.30
SLRLA	70.47

Run scripts

CUDA_VISIBLE_DEVICES=0,1 python main.py --dataset CIFAR10 --network ResNet-18 --input_norm --exp_name sgd_vanilla --wd 1e-3 --lr 1e-1

CUDA_VISIBLE_DEVICES=0 python main.py --dataset CIFAR10 --network ResNet-18 --input_norm --exp_name sgd_LA --wd 1e-3 --lr 1e-1 --lookahead 5_0.8

CUDA_VISIBLE_DEVICES=0 python main.py --dataset CIFAR10 --network ResNet-18 --input_norm --exp_name sgd_SLRLA --wd 1e-3 --lr 1e-1 --lookahead 5_0.8 --slr 5_0.2

LICENSE

This repo is under the Apache-2.0 license. For commercial use, please contact the authors.

Bibtex

@inproceedings{Zhou2021LA,
author = {Pan Zhou and Hanshu Yan and Xiaotong Yuan and Jiashi Feng and Shuicheng Yan}
title = {Towards Understanding Why Lookahead Generalizes Better Than SGD and Beyond},
booktitle = {NeurIPS},
year = {2021}
}

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
models		models
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
_preamble_.py		_preamble_.py
dataset.py		dataset.py
main.py		main.py
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

models

models

utils

utils

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

_preamble_.py

_preamble_.py

dataset.py

dataset.py

main.py

main.py

run.sh

run.sh

Repository files navigation

Stagewise Locally-Regularized LookAhead (SLRLA)

Requirements

Experimental Results

CIFAR10

CIFAR100

ImageNet

Run scripts

LICENSE

Bibtex

About

Releases

Packages

Contributors 2

Languages

License

sail-sg/SLRLA-optimizer

Folders and files

Latest commit

History

Repository files navigation

Stagewise Locally-Regularized LookAhead (SLRLA)

Requirements

Experimental Results

CIFAR10

CIFAR100

ImageNet

Run scripts

LICENSE

Bibtex

About

Resources

License

Stars

Watchers

Forks

Languages