Add KFAC optimizer #35801
Labels
module: optimizer
Related to torch.optim
needs research
We need to decide whether or not this merits inclusion, based on research world
triaged
This issue has been looked at a team member, and triaged and prioritized into an appropriate module
🚀 Feature
Add the Kronecker-factored Approximate Curvature (KFAC) optimizer and/or E-KFAC.
Motivation
These are both approximate second order methods with promising results in terms of convergence rate. KFAC uses an block-diagonal approximation to the Fisher information matrix to approximate natural gradients. This allows for larger learning rates and thus improved convergence rate. Having a fully features built-in optimizer allows researchers and deep learning engineers to quickly iterate and test.
Pitch
A KFAC implementation that works out of the box for all supported layers should be added to the built-in optimizers.
Additional context
An unmaintained Pytorch implementation
The standard TF implementation
KFAC for CNN
KFAC for RNN
cc @vincentqb
The text was updated successfully, but these errors were encountered: