linGrad

Initial stepsize (learning rate) in gradient descent selected by linear range.

Linear range is the range of parameter perturbations which lead to approximately linear perturbations in the states of a network. Linear range is computed from the difference between actual perturbations in states and the tangent solution. In linGrad, the optimal initial stepsize is such that parameter changes on all minibatches are within linear range.

For detailed explanations check the accompanying paper: https://arxiv.org/abs/1905.04561.

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
data		data
src		src
tests		tests
.coveragerc		.coveragerc
.coveralls.yml		.coveralls.yml
.gitignore		.gitignore
.travis.yml		.travis.yml
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

src

src

tests

tests

.coveragerc

.coveragerc

.coveralls.yml

.coveralls.yml

.gitignore

.gitignore

.travis.yml

.travis.yml

LICENSE

LICENSE

README.md

README.md

requirements.txt

requirements.txt

Repository files navigation

linGrad

About

Releases

Packages

Languages

License

niangxiu/linGrad

Folders and files

Latest commit

History

Repository files navigation

linGrad

About

Resources

License

Stars

Watchers

Forks

Languages