Add user guide #690

ArinaDanilina · 2024-04-30T06:49:38Z

No description provided.

for more information, see https://pre-commit.ci

…nto add/user_guide

docs/user_guide.md

MUCDK

also add more stuff, please.

giovp · 2024-05-07T07:26:41Z

missing:

hyperparameters section
fill the table of the problems

for more information, see https://pre-commit.ci

docs/user_guide.md

MUCDK

minor changes

MUCDK · 2024-05-14T07:24:04Z

Please also add a general reference to examples / tutorials at the end.

MUCDK · 2024-05-29T06:37:31Z

docs/user_guide.md

+
+In their original formulation, OT algorithms don't scale to large datasets due to their high computational complexity. Moscot overcomes this limitation by allowing for the use of low-rank solvers. In each `solve` method we have the `rank` parameter, by default $-1$ -- the full rank.
+Whenever possible, it's best to start with the full rank, but when needed, the rank should be set to a positive integer. The higher the rank, the better the full-rank approximation. Hence, one should start with a reasonable high rank, e.g. $5000$. Consecutively decrease the rank if needed due to memory constraints. Note that the scale of $\tau_a$ and $\tau_b$ changes whenever we are in the low-rank setting. While they should be still between $0$ and $1$, empirically they should be set in the range between $0.1$ and $0.5$. See {doc}`/notebooks/examples/solvers/100_linear_problems_basic` and {doc}`/notebooks/examples/solvers/300_quad_problems_basic` on how to use low-rank solutions.
+Another option to use the full rank is to specify the `batch_size` parameter of the `solve` method. It determines the number of rows or columns of the cost matrix to materialize during the {term}`Sinkhorn` iterations. Larger values will require more memory and can be adjusted due to memory constraints as well.


Can we have this before low-rank? and then say, for linear problems we can do the batch_size, which reduces memory complexity (but slightly increase time complexity).

and then we motivate the low-rank: Whenever time complexity in a linear problem (e.g. TemporalProblem) should be reduced, or memory/time complexity in a quadratic (enumarte / link quadratic problems here) problem should be reduced , we use low-rank OT.

MUCDK · 2024-05-29T06:38:53Z

docs/user_guide.md

+
+## Hyperparameters
+
+Moscot problems' `solve` methods have the following parameters that can be set depending on the specific task:


The solve method of moscot problems have a wide range of parameters. In the following, we discuss the most relevant ones:

MUCDK · 2024-05-29T06:39:34Z

docs/user_guide.md

+
+Moscot problems' `solve` methods have the following parameters that can be set depending on the specific task:
+
+- $\varepsilon$ - {term}`Entropic regularization`.


[...] This determines the stochasticity of the map. The higher epsilon, the more stochastic the map is.

MUCDK · 2024-05-29T06:42:07Z

docs/user_guide.md

+Moscot problems' `solve` methods have the following parameters that can be set depending on the specific task:
+
+- $\varepsilon$ - {term}`Entropic regularization`.
+- $\tau_a$ and $\tau_b$ - Parameters in $(0, 1]$ that define how {term}`unbalanced <unbalanced OT problem>` is the problem on the source and target {term}`marginals`. If $1$, the problem is {term}`balanced <balanced OT problem>`.


The lower tau, the more "unbalanced" the problem. Unbalancedness allows to automatically discard outliers, compensate for undesired distributional shifts, and model cell proliferation and apoptosis.

MUCDK · 2024-05-29T06:43:07Z

docs/user_guide.md

+
+- $\varepsilon$ - {term}`Entropic regularization`.
+- $\tau_a$ and $\tau_b$ - Parameters in $(0, 1]$ that define how {term}`unbalanced <unbalanced OT problem>` is the problem on the source and target {term}`marginals`. If $1$, the problem is {term}`balanced <balanced OT problem>`.
+- $\alpha$ - Parameter in $(0, 1]$ that interpolates between the {term}`quadratic term` and the {term}`linear term`. $\alpha = 1$ corresponds to the pure {term}`Gromov-Wasserstein` problem while $\alpha \to 0$ corresponds to the pure {term}`linear problem`.


... $\alpha$ (only in problems building upon Fused Gromov-Wasserstein).....

MUCDK · 2024-05-29T06:43:24Z

docs/user_guide.md

+- $\varepsilon$ - {term}`Entropic regularization`.
+- $\tau_a$ and $\tau_b$ - Parameters in $(0, 1]$ that define how {term}`unbalanced <unbalanced OT problem>` is the problem on the source and target {term}`marginals`. If $1$, the problem is {term}`balanced <balanced OT problem>`.
+- $\alpha$ - Parameter in $(0, 1]$ that interpolates between the {term}`quadratic term` and the {term}`linear term`. $\alpha = 1$ corresponds to the pure {term}`Gromov-Wasserstein` problem while $\alpha \to 0$ corresponds to the pure {term}`linear problem`.
+- `batch_size` - Number of rows/columns of the cost matrix to materialize during the solver iterations. Larger value will require more memory.


....See above the scalability

MUCDK · 2024-05-29T06:43:50Z

docs/user_guide.md

+- `batch_size` - Number of rows/columns of the cost matrix to materialize during the solver iterations. Larger value will require more memory.
+- `rank` - Rank of the {term}`low-rank OT` solver {cite}`scetbon:21b`. If $-1$, full-rank solver {cite}`peyre:2016` is used.
+
+For more hyperparameters and their usage please refer to {doc}`/notebooks/examples/solvers/200_linear_problems_advanced` and {doc}`/notebooks/examples/solvers/400_quad_problems_advanced`.


you should also link to basic solve examples.

MUCDK

THanks

Arina Danilina and others added 8 commits April 30, 2024 08:35

user guide

ee004a4

[pre-commit.ci] auto fixes from pre-commit.com hooks

fd8e52a

for more information, see https://pre-commit.ci

module and toctree

a88e2ba

Merge branch 'add/user_guide' of https://github.com/theislab/moscot i…

a262829

…nto add/user_guide

Merge branch 'main' into add/user_guide

7939851

Merge branch 'main' into add/user_guide

e755c47

:mod: and header anchor

82a2bb2

module::

7e78934

MUCDK reviewed May 6, 2024

View reviewed changes

docs/user_guide.md Outdated Show resolved Hide resolved

MUCDK reviewed May 6, 2024

View reviewed changes

docs/user_guide.md Outdated Show resolved Hide resolved

MUCDK reviewed May 6, 2024

View reviewed changes

docs/user_guide.md Outdated Show resolved Hide resolved

MUCDK requested changes May 6, 2024

View reviewed changes

typos and links

a9c2020

giovp added this to the 0.4.0 milestone May 7, 2024

ArinaDanilina and others added 7 commits May 13, 2024 14:00

Merge branch 'main' into add/user_guide

fdbdf2e

tables

4f1c8db

hyperparameters

1d9045e

typo

361324a

OT link

4c2ca76

[pre-commit.ci] auto fixes from pre-commit.com hooks

684243f

for more information, see https://pre-commit.ci

Merge branch 'main' into add/user_guide

ac60f7d

ArinaDanilina requested a review from MUCDK May 13, 2024 15:41

MUCDK reviewed May 14, 2024

View reviewed changes

docs/user_guide.md Outdated Show resolved Hide resolved

MUCDK reviewed May 14, 2024

View reviewed changes

docs/user_guide.md Outdated Show resolved Hide resolved

MUCDK reviewed May 14, 2024

View reviewed changes

docs/user_guide.md Outdated Show resolved Hide resolved

MUCDK reviewed May 14, 2024

View reviewed changes

docs/user_guide.md Outdated Show resolved Hide resolved

MUCDK requested changes May 14, 2024

View reviewed changes

Merge branch 'main' into add/user_guide

1f95b04

ArinaDanilina and others added 3 commits May 27, 2024 13:37

Merge branch 'main' into add/user_guide

6b4914c

edits and links

7242750

general reference to examples / tutorials

9970295

MUCDK self-requested a review May 28, 2024 07:27

Merge branch 'main' into add/user_guide

8ab44e4

MUCDK reviewed May 29, 2024

View reviewed changes

MUCDK requested changes May 29, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add user guide #690

Add user guide #690

ArinaDanilina commented Apr 30, 2024

MUCDK left a comment

giovp commented May 7, 2024 •

edited by ArinaDanilina

MUCDK left a comment

MUCDK commented May 14, 2024

MUCDK May 29, 2024

MUCDK May 29, 2024

MUCDK May 29, 2024

MUCDK May 29, 2024

MUCDK May 29, 2024

MUCDK May 29, 2024

MUCDK May 29, 2024

MUCDK left a comment


		## Hyperparameters

		Moscot problems' `solve` methods have the following parameters that can be set depending on the specific task:


		Moscot problems' `solve` methods have the following parameters that can be set depending on the specific task:

		- $\varepsilon$ - {term}`Entropic regularization`.

Add user guide #690

Are you sure you want to change the base?

Add user guide #690

Conversation

ArinaDanilina commented Apr 30, 2024

MUCDK left a comment

Choose a reason for hiding this comment

giovp commented May 7, 2024 • edited by ArinaDanilina

MUCDK left a comment

Choose a reason for hiding this comment

MUCDK commented May 14, 2024

MUCDK May 29, 2024

Choose a reason for hiding this comment

MUCDK May 29, 2024

Choose a reason for hiding this comment

MUCDK May 29, 2024

Choose a reason for hiding this comment

MUCDK May 29, 2024

Choose a reason for hiding this comment

MUCDK May 29, 2024

Choose a reason for hiding this comment

MUCDK May 29, 2024

Choose a reason for hiding this comment

MUCDK May 29, 2024

Choose a reason for hiding this comment

MUCDK left a comment

Choose a reason for hiding this comment

giovp commented May 7, 2024 •

edited by ArinaDanilina