LBFGS optimizer

An improved LBFGS (and LBFGS-B) optimizer for PyTorch is provided with the code. Further details are given in this paper. Also see this introduction.

Examples of use:

Federated learning: see these examples.
Calibration and other inverse problems: see radio interferometric calibration.
K-harmonic means clustering: see LOFAR system health management.
Other problems: see this example.

Files included are:

lbfgsnew.py: New LBFGS optimizer

lbfgsb.py: LBFGS-B optimizer (with bound constraints)

lbfgs.py: Symlink to lbfgsnew.py

cifar10_resnet.py: CIFAR10 ResNet training example (see figures below)

kan_pde.py: Kolmogorov Arnold network PDE example

The above figure shows the training loss and training time using Colab with one GPU. ResNet18 and ResNet101 models are used. Test accuracy after 20 epochs: 84% for LBFGS and 82% for Adam.

Changing the activation from commonly used ReLU to others like ELU gives faster convergence in LBFGS, as seen in the figure below.

Here is a comparison of both training error and test accuracy for ResNet9 using LBFGS and Adam.

Example usage in full batch mode:

from lbfgsnew import LBFGSNew
optimizer = LBFGSNew(model.parameters(), history_size=7, max_iter=100, line_search_fn=True, batch_mode=False)

Example usage in minibatch mode:

from lbfgsnew import LBFGSNew
optimizer = LBFGSNew(model.parameters(), history_size=7, max_iter=2, line_search_fn=True, batch_mode=True)

Note: for certain problems, the gradient can also be part of the cost, for example in TV regularization. In such situations, give the option cost_use_gradient=True to LBFGSNew(). However, this will increase the computational cost, so only use when needed.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
torch/optim		torch/optim
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
activation.png		activation.png
cifar10_resnet.py		cifar10_resnet.py
kan_pde.py		kan_pde.py
lbfgsb.py		lbfgsb.py
lbfgsnew.py		lbfgsnew.py
loss.png		loss.png
resnet9.png		resnet9.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

torch/optim

torch/optim

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

activation.png

activation.png

cifar10_resnet.py

cifar10_resnet.py

kan_pde.py

kan_pde.py

lbfgsb.py

lbfgsb.py

lbfgsnew.py

lbfgsnew.py

loss.png

loss.png

resnet9.png

resnet9.png

Repository files navigation

LBFGS optimizer

About

Releases

Packages

Languages

License

nlesc-dirac/pytorch

Folders and files

Latest commit

History

Repository files navigation

LBFGS optimizer

About

Topics

Resources

License

Stars

Watchers

Forks

Languages