SINGD: Structured Inverse-Free Natural Gradient Descent

This package contains the official PyTorch implementation of our memory-efficient and numerically stable KFAC variant, termed SINGD (paper).

The main feature is a torch.optim.Optimizer which works like most PyTorch optimizers and is compatible with:

Per-parameter options (param_groups)
Using a learning rate scheduler
Checkpointing
Gradient scaling & mixed-precision training
Gradient accumulation (multiple forward-backwards, then take a step)
Distributed data-parallel (DDP) training¹

The pre-conditioner matrices support different structures that allow to reduce cost (overview).

Installation

Stable (recommended):
```
pip install singd
```

Latest version from GitHub main branch:

pip install git+https://github.com/f-dangel/singd.git@main

Usage

Limitations

SINGD does not support graph neural networks (GNN).
SINGD currently does not support gradient clipping.
The code has stabilized only recently. Expect things to break and help us improve by filing issues.

Citation

If you find this code useful for your research, consider citing the paper:

@article{lin2023structured,
  title =        {Structured Inverse-Free Natural Gradient: Memory-Efficient &
  Numerically-Stable KFAC for Large Neural Nets},
  author =       {Lin, Wu and Dangel, Felix and Eschenhagen, Runa and Neklyudov,
  Kirill and Kristiadi, Agustinus and Turner, Richard E and Makhzani, Alireza},
  year =         2023,
}

We do support standard DDP with one crucial difference: The model should not be wrapped with the DDP wrapper, but the rest, e.g. using the torchrun command stays the same. ↩

Name		Name	Last commit message	Last commit date
Latest commit History 286 Commits
.github/workflows		.github/workflows
docs		docs
lin2023simplifying @ 4ccd6d7		lin2023simplifying @ 4ccd6d7
profile		profile
singd		singd
test		test
.conda_env.yml		.conda_env.yml
.envrc		.envrc
.gitignore		.gitignore
.gitmodules		.gitmodules
.readthedocs.yaml		.readthedocs.yaml
README.md		README.md
black.toml		black.toml
build.sh		build.sh
changelog.md		changelog.md
makefile		makefile
mkdocs.yml		mkdocs.yml
pytest.ini		pytest.ini
setup.cfg		setup.cfg
setup.py		setup.py

f-dangel/singd

Folders and files

Latest commit

History

Repository files navigation

SINGD: Structured Inverse-Free Natural Gradient Descent

Installation

Usage

Limitations

Citation

Footnotes

About

Resources

Stars

Watchers

Forks

Languages