Relativistic Gradient Descent (RGD)

RGD is a simple optimization method based on the simulation of a relativistic particle under the influence of a potential (objective function) and friction. We use a symplectic integrator to simulate such a physical system.

Gradient descent (GD) is probably the most well-known optimization method. The classical momentum method (CM), also known as Polyak's heavy ball, and Nesterov's accelerated gradient method (NAG) are accelerated variants of GD which are extensively used in machine learning. RGD generalizes both CM and NAG and usually have a superior performance. For instance, its convergence rate in a matrix completion problem (which is nonconvex) is illustrated in the figure below.

This method was proposed in the G. França et. al., "Conformal symplectic and relativistic optimization," J. Stat. Mech. (2020) 124008.
A shorter version of this paper was also published at NeurIPS 2020 (spotlight).
Besides the above papers, see the Presentation for a quick introduction or the Poster for an even quicker one.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
figs		figs
Franca_talk_NeurIPS2020.pdf		Franca_talk_NeurIPS2020.pdf
README.md		README.md
colors.py		colors.py
corr_quad.py		corr_quad.py
fourth.py		fourth.py
matplotlibrc		matplotlibrc
matrix_compl.py		matrix_compl.py
opt.py		opt.py
plot_surfaces.py		plot_surfaces.py
poster_franca.pdf		poster_franca.pdf
random_quad.py		random_quad.py
rosenbrock.py		rosenbrock.py
solvers_mass.py		solvers_mass.py
solvers_scaled.py		solvers_scaled.py
test.py		test.py
test_suite.py		test_suite.py
test_surface.py		test_surface.py

guisf/rgd

Folders and files

Latest commit

History

Repository files navigation

Relativistic Gradient Descent (RGD)

About

Resources

Stars

Watchers

Forks

Languages