Improve Bures-Wasserstein distance #468

francois-rozet · 2023-05-02T12:47:34Z

Types of changes

Fixed typo in the documentation of the Bures-Wasserstein distance ($\Sigma_s$ instead of $\Sigma_s^{1/2}$).
Faster way of computing the trace of the square-root of the product of $\Sigma_s$ and $\Sigma_t$.

The implementation is based on two facts:

The trace of $A$ equals the sum of its eigenvalues.
The eigenvalues of $\sqrt{A}$ are the square-roots of the eigenvalues of $A$.

Then, $\mathrm{tr}(\sqrt{A})$ is the sum of the square-roots of the eigenvalues of $A$.

See Lightning-AI/torchmetrics#1705.

Motivation and context / Related issue

Computing the square-root of a matrix is slow and unstable.

How has this been tested (if it applies)

The new implementation still passes the tests (at least with NumPy backend).

PR checklist

I have read the CONTRIBUTING document.
The documentation is up-to-date with the changes I made (check build artifacts).
All tests passed, and additional code has been covered with new tests.
I have added the PR and Issue fix to the RELEASES.md file.

francois-rozet · 2023-05-02T16:01:17Z

It seems like the TensorFlow backend does not support .real on a complex tensor. I am not sure how to solve that. There are also issues with older torch versions.

rflamary · 2023-05-03T06:37:27Z

Hello @francois-rozet, when a sepcific backend type misses a feature is a feature we usually add it to the backend, in this case maybe we shoudl add nx.real and nx.imagto the backend functions

rflamary · 2023-05-03T07:13:32Z

Also @francois-rozet could you please do a quick test of the timing before and after your speedup for different backends (at least numpy and pytorch) and put it in the text description of the PR?

I like to have quantified performance gain in the history/github to know why we changed stuff. Also maybe a quick test that checks that the new function returns the same thing as the np.trace () up to numerical precision. It seems right but such a test will help detecting potential problems in the future.

francois-rozet · 2023-05-03T15:20:16Z

After some tests, I found that this implementation is only faster for NumPy. Computing the square root of a general matrix is indeed slower than computing its eigenvalues. However, computing the square root of a symmetric matrix takes more or less the same time as computing its eigenvalues. In fact, the PyTorch backend uses torch.linalg.eigh to implement sqrtm. So I think instead of modifying the algorithm, we can simply replace the sqrtm of the NumPy backend.

rflamary · 2023-05-03T15:27:21Z

OK it make sens, happy I pushed you to investigate. You can leave the envals function in the backend it can be usefull in the future (trace norm regularization for instance)

francois-rozet · 2023-05-03T15:32:56Z

You can leave the eigvals function in the backend

Oops I already removed it. Are the eigvals or the singular values necessary for trace norm regularization? And is it for a symmetric matrix? I turns out that eigh is much faster than eig for a symmetric matrix.

rflamary · 2023-05-03T15:37:44Z

OK no worry we can add them (properlyd depending on symmetry or not) later.

I'm neraly OK for a merge but please add a short description of the PR in the RELEASES file file.

francois-rozet · 2023-05-03T15:51:13Z

I added a line to the RELEASES.md file. Also, here is a small notebook demo to show that np.linalg.eigh is much faster than scipy.linalg.sqrtm:

import numpy as np
import scipy.linalg as sl

A = np.random.rand(512, 512) / 512 ** 0.5
A = A @ A.T + np.eye(512) * 1e-6  # definite positive

%timeit np.linalg.eigh(A)
%timeit sl.sqrtm(A)

returns

19.5 ms ± 2.27 ms per loop (mean ± std. dev. of 7 runs, 100 loops each)
166 ms ± 15.2 ms per loop (mean ± std. dev. of 7 runs, 10 loops each)

The time gap increases for larger matrices.

francois-rozet · 2023-05-03T17:06:49Z

All tests have passed, but the CircleCI one.

Improve Bures-Wasserstein distance

f0ebddc

Revert changes and modify sqrtm

10050ec

Merge branch 'master' into patch

71f04c6

Fix typo

d0d8af0

Add changes to RELEASES.md

974c08b

rflamary merged commit 83dc498 into PythonOT:master May 4, 2023

francois-rozet deleted the patch branch May 4, 2023 06:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve Bures-Wasserstein distance #468

Improve Bures-Wasserstein distance #468

Uh oh!

francois-rozet commented May 2, 2023 •

edited

Loading

Uh oh!

francois-rozet commented May 2, 2023 •

edited

Loading

Uh oh!

rflamary commented May 3, 2023

Uh oh!

rflamary commented May 3, 2023

Uh oh!

francois-rozet commented May 3, 2023

Uh oh!

rflamary commented May 3, 2023

Uh oh!

francois-rozet commented May 3, 2023 •

edited

Loading

Uh oh!

rflamary commented May 3, 2023

Uh oh!

francois-rozet commented May 3, 2023 •

edited

Loading

Uh oh!

francois-rozet commented May 3, 2023

Uh oh!

Uh oh!

Improve Bures-Wasserstein distance #468

Improve Bures-Wasserstein distance #468

Uh oh!

Conversation

francois-rozet commented May 2, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Types of changes

Motivation and context / Related issue

How has this been tested (if it applies)

PR checklist

Uh oh!

francois-rozet commented May 2, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rflamary commented May 3, 2023

Uh oh!

rflamary commented May 3, 2023

Uh oh!

francois-rozet commented May 3, 2023

Uh oh!

rflamary commented May 3, 2023

Uh oh!

francois-rozet commented May 3, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rflamary commented May 3, 2023

Uh oh!

francois-rozet commented May 3, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

francois-rozet commented May 3, 2023

Uh oh!

Uh oh!

francois-rozet commented May 2, 2023 •

edited

Loading

francois-rozet commented May 2, 2023 •

edited

Loading

francois-rozet commented May 3, 2023 •

edited

Loading

francois-rozet commented May 3, 2023 •

edited

Loading