Symmetry Teleportation for Accelerated Optimization unofficial implementation

I implement the https://arxiv.org/abs/2205.10637 paper (Symmetry Teleportation for Accelerated Optimization) in Tensorflow, and find fruitful results using a slightly alternate teleportation scheme in application with the Fashion-MNIST dataset. The image provided shows general (paper) training results as a result of teleportation for highly non-convex functions.

A personal synopsis:

The loss function with respect to weight layers in deep neural-nets contain symmetries, these symmetries can be exploited via group invariant (equi-energy contour preserving) actions under specific conditions. These priorly unexploited symmetries allow for the model to improve convergence speeds by effectively teleporting its weights in a way as to maintain it's output, yet get itself out of non-optimal troughs and poorly learnable environments, hence improving the speed at which it learns. The paper focuses on using lie groups to allow for gradient ascent for optimising teleportations, however I go a different way by samplying suitable teleports randomly and choosing the best one.

In my tests with a small Fashion-MNIST dataset and feed-forward neural net of size 64, 64, 10, I saw ~10% leaps in training improvement for each teleport; and further vastly improving the convergence. Due to not using gradient ascent, the paper's analysis on achieving 2nd order convergence speeds does not directly apply here - however as I choose a maximal gradient amongst the ones sampled anyways, by numerical observation I do see increased convergence speed.

The main advantage I see to using teleportation instead of simply having more epochs in training is in the ability to get out of highly non-convex environments, as well as leaps in accuracies where things aren't optimal in some sense.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.gitignore		.gitignore
README.md		README.md
autograd_tests.ipynb		autograd_tests.ipynb
fig2_frompaper.png		fig2_frompaper.png
teleporting_gradients_gs.ipynb		teleporting_gradients_gs.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.gitignore

.gitignore

README.md

README.md

autograd_tests.ipynb

autograd_tests.ipynb

fig2_frompaper.png

fig2_frompaper.png

teleporting_gradients_gs.ipynb

teleporting_gradients_gs.ipynb

Repository files navigation

Symmetry Teleportation for Accelerated Optimization unofficial implementation

A personal synopsis:

About

Releases

Packages

Languages

baubels/gradient_teleportation

Folders and files

Latest commit

History

Repository files navigation

Symmetry Teleportation for Accelerated Optimization unofficial implementation

A personal synopsis:

About

Resources

Stars

Watchers

Forks

Languages