Skip to content

lukearend/linear-gradient-descent

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

linear-gradient-descent

This repository investigates gradient descent in linear networks as described in the paper Exact solutions to the nonlinear dynamics of learning in deep linear neural networks by Andrew Saxe, James McClelland, and Surya Ganguli.

Everything is in the self-contained notebook linear-gradient-descent.ipynb, which

  1. explains and implements gradient descent for the linear network,
  2. trains a linear network to solve a classification task, and
  3. investigates a learning regime called dynamical isometry.

To run the notebook yourself,

  1. clone this repo: git clone ...
  2. run make env at the top level to build a python environment with the packages listed in requirements.txt
  3. run make notebook to start a local Jupyter notebook server
  4. browse to main.ipynb and open it

About

gradient descent in deep linear networks

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published