MLRG Deep Curvature

(Updated 19 Apr 2020)

MLRG Deep Curvature is a PyTorch-based [1] package to analyse and visualise neural network curvature and loss landscape, powered by GPU-accelerated Lanczos algorithm built by GPytorch [2].

If you find our package is useful for your research, please consider citing below:

MLRG Deep Curvature. Diego Granziol*, Xingchen Wan*, Timur Garipov*. In arXiv preprint: arXiv: 1912.09656. 2019.

Network training and evaluation

The package provides a range of pre-built modern popular neural network structures, such as VGG [3] and variants of ResNets [4], and various optimisation schemes in addition to the ones already present in the PyTorch frameworks, such as K-FAC [5] and SWATS [6]. These facilitates faster training and evaluation of the networks (although it is worth noting that any PyTorch-compatible optimisers or architectures can be easily integrated into its analysis framework).

Eigenspectrum analysis of the curvature matrices

Powered by the Lanczos techniques, with a single random vector the package uses Pearlmutter matrix-vector product trick for fast computation for inference of the eigenvalues and eigenvectors of the common curvature matrices of the deep neural networks. In addition to the standard Hessian matrix, Generalised Gauss-Newton matrix is also supported.

Advanced Statistics of Networks

In addition to the commonly used statistics to evaluate network training and performance such as the training and testing losses and accuracy, the package supports computations of more advanced statistics, such as squared mean and variance of gradients and Hessians (and GGN), squared norms of Hessian and GGN, L2 and L-inf norms of the network weights and etc. These statistics are useful and relevant for a wide range of purposes such as the designs of second-order optimisers and network architecture.

Visualisations

For all main features above, accompanying visualisation tools are included. In addition, with the eigen-information obtained visualisations of the loss landscape are also supported by studying the sensitivity of the neural network to perturbations of weights. One key difference is that, instead of random directions as featured in some other packages, this package perturbs the weights in the eigenvector directions explicitly.

For an illustrated example of its use, please see example.ipynb.

References:

Paszke, A., Gross, S., Massa, F., Lerer, A., Bradbury, J., Chanan, G., Killeen, T., Lin, Z., Gimelshein, N., Antiga, L. and Desmaison, A., 2019. PyTorch: An imperative style, high-performance deep learning library. In Advances in Neural Information Processing Systems (pp. 8024-8035).
Gardner, J., Pleiss, G., Weinberger, K.Q., Bindel, D. and Wilson, A.G., 2018. Gpytorch: Blackbox matrix-matrix gaussian process inference with gpu acceleration. In Advances in Neural Information Processing Systems (pp. 7576-7586).
Simonyan, K. and Zisserman, A., 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556.
He, K., Zhang, X., Ren, S. and Sun, J., 2016. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 770-778).
Martens, J. and Grosse, R., 2015, June. Optimizing neural networks with kronecker-factored approximate curvature. In International conference on machine learning (pp. 2408-2417).
Keskar, N.S. and Socher, R., 2017. Improving generalization performance by switching from adam to sgd. arXiv preprint arXiv:1712.07628.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
core		core
curvature		curvature
optimizers		optimizers
utils		utils
visualise		visualise
README.md		README.md
REQUIREMENTS.txt		REQUIREMENTS.txt
example.ipynb		example.ipynb
example.py		example.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

core

core

curvature

curvature

optimizers

optimizers

utils

utils

visualise

visualise

README.md

README.md

REQUIREMENTS.txt

REQUIREMENTS.txt

example.ipynb

example.ipynb

example.py

example.py

Repository files navigation

MLRG Deep Curvature

Network training and evaluation

Eigenspectrum analysis of the curvature matrices

Advanced Statistics of Networks

Visualisations

About

Releases

Packages

Languages

xingchenwan/MLRG_DeepCurvature

Folders and files

Latest commit

History

Repository files navigation

MLRG Deep Curvature

Network training and evaluation

Eigenspectrum analysis of the curvature matrices

Advanced Statistics of Networks

Visualisations

About

Resources

Stars

Watchers

Forks

Languages