Local sparsity support with ADTypes #72

gdalle · 2024-05-18T11:48:59Z

I think our TracerSparsityDetector should offer the option to use either local or global patterns.

I don't think it should be the job of ADTypes to provide that distinction, and here's why: the notion of "sparsity" is intrinsically ill-defined.
Our concept sparsity is not the same as that of Symbolics (for instance wrt dependency on control flow if/else), which may not be the same as that of a pre-specified pattern.
It's not up to ADTypes to define a complete classification of the various types of sparsity. Rather, the sparsity detectors themselves should document their approach and limitations, and offer all the necessary toggles

The text was updated successfully, but these errors were encountered:

adrhill · 2024-05-18T11:50:14Z

We could add local tracing as a kwarg to TracerSparsityDetector, or add a separate LocalTracerSparsityDetector, but this might cause downstream packages like DI to not recompute sparsity patterns when they should.

For this reason, I would argue that local and global sparsity detection should be distinguished between on the level of ADTypes.

gdalle · 2024-05-18T11:52:05Z

We could add local tracing as a kwarg to TracerSparsityDetector()

First of all, if we add it it should be as a type parameter to enable dispatch.

this might cause downstream packages like DI to not recompute sparsity patterns when they should.

It's not "packages" who decide when to recompute the sparsity pattern, it's users. They make the call as to whether prepare_jacobian should be run again or not when x changes.

Even what we call "global sparsity" depends on the control flow, so it's not robust to any change in x. Sure, we throw an error "primal value missing" in some cases when we see a comparison, but there must be more cases we missed.

adrhill · 2024-05-18T11:52:09Z

Our concept sparsity is not the same as that of Symbolics (for instance wrt dependency on control flow if/else), which may not be the same as that of a pre-specified pattern.

This is not the issue. The issue is that in DI, LocalTracerSparsityDetector should only be used in combination with prepare_*_same_point and never with prepare_*.

gdalle · 2024-05-18T11:54:08Z

The issue is that in DI, LocalTracerSparsityDetector should only be used in combination with prepare_same_point and never with prepare.

Okay but that is something we want to warn users about, not explicitly forbid.

The same warning holds for other types of preparation, like the tape in ReverseDiff: it depends on the control flow, and thus possibly on some values inside x. We still allow this preparation, we just let users decide how many times they can safely reuse extras

adrhill · 2024-05-18T11:54:14Z

It's not "packages" who decide when to recompute the sparsity pattern, its users. They make the call as to whether prepare_jacobian should be run again or not when x changes.

My point is basically holding the hand of the user: there is never any use to call prepare_jacobian with LocalTracerSparsityDetector, so I personally just wouldn't allow that footgun.

gdalle · 2024-05-18T11:55:13Z

there is never any use to call prepare_jacobian with LocalTracerSparsityDetector

Yes there is, because even when you compute a one-time Jacobian,

jacobian(f, backend, x) = jacobian(f, backend, x, prepare_jacobian(f, backend, x))

gdalle · 2024-05-18T11:56:10Z

In such cases, the user wouldn't call prepare_jacobian explicitly, but the preparation still happens even for one shot

adrhill · 2024-05-18T11:56:32Z

Yeah, now I see the issue.

gdalle · 2024-05-18T11:57:33Z

Basically

ADTypes cannot explicitly distinguish every possible brand of sparsity
We need local sparsity to be supported by ADTypes because that's what DI uses
Thus we need the option in our TracerSparsityDetector, with all the necessary red flags

adrhill · 2024-05-18T11:57:59Z

Then let's make it explicit and add a separate LocalTracerSparsityDetector (or TracerLocalSparsityDetector).

gdalle · 2024-05-18T11:58:25Z

Sure

gdalle mentioned this issue May 18, 2024

Support for linear algebra #68

Closed

adrhill self-assigned this May 21, 2024

adrhill mentioned this issue May 21, 2024

Add TracerLocalSparsityDetector #81

Merged

adrhill closed this as completed in #81 May 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Local sparsity support with ADTypes #72

Local sparsity support with ADTypes #72

gdalle commented May 18, 2024

adrhill commented May 18, 2024

gdalle commented May 18, 2024

adrhill commented May 18, 2024

gdalle commented May 18, 2024

adrhill commented May 18, 2024

gdalle commented May 18, 2024

gdalle commented May 18, 2024

adrhill commented May 18, 2024

gdalle commented May 18, 2024

adrhill commented May 18, 2024 •

edited

Loading

gdalle commented May 18, 2024

Local sparsity support with ADTypes #72

Local sparsity support with ADTypes #72

Comments

gdalle commented May 18, 2024

adrhill commented May 18, 2024

gdalle commented May 18, 2024

adrhill commented May 18, 2024

gdalle commented May 18, 2024

adrhill commented May 18, 2024

gdalle commented May 18, 2024

gdalle commented May 18, 2024

adrhill commented May 18, 2024

gdalle commented May 18, 2024

adrhill commented May 18, 2024 • edited Loading

gdalle commented May 18, 2024

adrhill commented May 18, 2024 •

edited

Loading