[MRG] Add implicit Sinkhorn gradients #605

rflamary · 2024-02-19T09:50:24Z

Types of changes

This PR aims at

implementing the detach function in the backend to allow speedup on CPU/GPU in some solvers (which was already done in a previous PR but with limited doc).
Implement variants of Sinkhorn where computations are detached and gradients at convergence is returned instead

This PR should solve #565 and greatly limit memory for sinkhorn when computing gradienst wrt the value.

In order to use implicit diffeerntiation one needs to set the grad parameter in ot.solveand ot.solve_sampleas such

sol = ot.solve(M, a, b, reg=10, grad='implicit')
sol.value.backward()
# beware with  grad='implicit', sol.value_linear and sol.plan are not differentiable (not implemented yet).

On a simple example with pytorch arrays with required gradients, I has a 1000x gain in memory for solving the problem when a large number of sinkhorn operations are needed.

Motivation and context / Related issue

How has this been tested (if it applies)

PR checklist

I have read the CONTRIBUTING document.
The documentation is up-to-date with the changes I made (check build artifacts).
All tests passed, and additional code has been covered with new tests.
I have added the PR and Issue fix to the RELEASES.md file.

codecov · 2024-02-19T10:05:09Z

Codecov Report

Merging #605 (28fe869) into master (c84ef33) will increase coverage by 0.03%.
The diff coverage is 100.00%.

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #605      +/-   ##
==========================================
+ Coverage   96.75%   96.78%   +0.03%     
==========================================
  Files          77       77              
  Lines       15961    16002      +41     
==========================================
+ Hits        15443    15488      +45     
+ Misses        518      514       -4

cedricvincentcuaz

Hello Rémi ! Ready to merge ;)

add detach function to backend

3a29218

rflamary added 3 commits February 19, 2024 14:02

debug function

361c27b

better detach

05346e9

new implementation

6ba0f26

rflamary changed the title ~~[WIP] Add detach function to backend~~ [WIP] Add implicit Sinkhorn gradients Feb 20, 2024

rflamary added 3 commits February 20, 2024 10:58

add test for gradient

37f3eed

better default

2c27a43

update documentation

28fe869

rflamary changed the title ~~[WIP] Add implicit Sinkhorn gradients~~ [MRG] Add implicit Sinkhorn gradients Feb 20, 2024

rflamary requested a review from cedricvincentcuaz February 20, 2024 10:43

cedricvincentcuaz approved these changes Feb 20, 2024

View reviewed changes

rflamary merged commit 6f35804 into master Feb 20, 2024

cedricvincentcuaz mentioned this pull request Feb 25, 2024

CUDA out of memory when using ot.sinkhorn2 as a loss function #565

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[MRG] Add implicit Sinkhorn gradients #605

[MRG] Add implicit Sinkhorn gradients #605

Uh oh!

rflamary commented Feb 19, 2024 •

edited

Loading

Uh oh!

codecov bot commented Feb 19, 2024 •

edited

Loading

Uh oh!

cedricvincentcuaz left a comment

Uh oh!

Uh oh!

[MRG] Add implicit Sinkhorn gradients #605

[MRG] Add implicit Sinkhorn gradients #605

Uh oh!

Conversation

rflamary commented Feb 19, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Types of changes

Motivation and context / Related issue

How has this been tested (if it applies)

PR checklist

Uh oh!

codecov bot commented Feb 19, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

cedricvincentcuaz left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

rflamary commented Feb 19, 2024 •

edited

Loading

codecov bot commented Feb 19, 2024 •

edited

Loading