When More Parameters Reduce Training Performance: Linear Neural Networks

Code for proving that neural networks without activations, or Linear Neural Networks (LNNs), despite being the same as neural networks are actually harder to optimize due to their excess of parameters. This is because having more parameters leads to updates for parameters being determined by other currently suboptimal parameters in iterative optimization methods. Thus, there is a nonconvex objective function which impirically leads to local minimas in optimization; the paper demonstrates this empirically as well.

train_linear_model.py provides all code required for the experiments used.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
differences		differences
grads		grads
mse		mse
LICENSE		LICENSE
README.md		README.md
parameter-deviation-noise=0.05.png		parameter-deviation-noise=0.05.png
train_linear_model.py		train_linear_model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

differences

differences

grads

grads

mse

mse

LICENSE

LICENSE

README.md

README.md

parameter-deviation-noise=0.05.png

parameter-deviation-noise=0.05.png

train_linear_model.py

train_linear_model.py

Repository files navigation

When More Parameters Reduce Training Performance: Linear Neural Networks

About

Releases

Packages

Languages

License

anish-lakkapragada/Linear-Neural-Nets-Suck

Folders and files

Latest commit

History

Repository files navigation

When More Parameters Reduce Training Performance: Linear Neural Networks

About

Resources

License

Stars

Watchers

Forks

Languages