Automatically select a backend #164

prbzrg · 2024-04-09T08:52:57Z

It would be great to have an API to automatically select a backend. It can be a function that try each backend and return the fastest workable one.
One of its usages could be in https://github.com/SciML/SciMLSensitivity.jl/blob/master/src/concrete_solve.jl

gdalle · 2024-04-09T09:23:01Z

DifferentiationInterfaceTest.jl already provides the necessary utilities for users to compare and benchmark backends.
I think doing it in their place would be a step too far, and extremely costly in terms of runtime. What we can do however is define utilities to list available backends, in order to make benchmarking even simpler

prbzrg · 2024-04-09T10:40:23Z

Selecting only happens once, before optimization starts. So for big optimizations, the time cost is negligible.
And if it's a bad idea, what about having a function that doesn't try the backends but select based on properties like mutating or having branches?

gdalle · 2024-04-09T10:59:10Z

My reasoning was: it's very costly and it's a two-liner, so we better let the user do it themselves. However, I guess we could expose an interface of the form:

function fastest_backend(backends, scenario)
    results = benchmark_differentiation(backends, scenario)
    best_trial = argmin(trial -> trial.time, results)
    return best_trial.backend  # currently doesn't work but only needs minor modifs
end

Is that similar to what you had in mind?

gdalle · 2024-04-09T10:59:48Z

As for a heuristic to select backends, I think benchmarking is indeed more reasonable. We have internal traits to check whether mutation is supported though, we could expose them

gdalle · 2024-04-09T11:00:41Z

As for listing the available backends, I thought some more and it's not obvious what the right method is. I can check whether ForwardDiff.jl is loaded, but then what is my "prototypical" ForwardDiff backend object: how many chunks does it have? Same for ReverseDiff and compiled tape.

adrhill · 2024-04-09T11:10:15Z

I see how this would be useful for

users who want a very high-level interface that abstracts away as much as possible
downstream packages that don’t know which functions will be passed to their interface and want to use runtime heuristics for maximum performance (e.g. the SciML ecosystem, as you mentioned)

Both Guillaume and I favor explicitness in DI and try to avoid macros and generated code.
Something I could envision, that is very close to Guillaume's suggestion, is a thin wrapper around DifferentiationInterfaceTest.jl's test_differentiation that returns a backend according to an optimality criterium (runtime, allocs, ...) selected by the user.

The following would allow you to take the human reading a table of benchmarks out of the loop:

backend = autobackend(GradientScenario(f; x=x))

for 1:100000
    grad = gradient(f, backend, x)
    # ...
end

This autobackend function could be called by downstream packages to define default configurations for e.g. subsequent solver calls.

adrhill · 2024-04-09T11:11:04Z

I see I wrote too slowly. 😄

adrhill · 2024-04-09T11:20:00Z

I can check whether ForwardDiff.jl is loaded, but then what is my "prototypical" ForwardDiff backend object: how many chunks does it have? Same for ReverseDiff and compiled tape.

I personally would be ok with such a high-level function being "suboptimal", as long as advanced users can manually specify to benchmark several ForwardDiff backends with different chuck sizes.
In this specific case, we could default to the pickchunksize heuristics used by ForwardDiff.

prbzrg · 2024-04-09T13:10:39Z

Is that similar to what you had in mind?

I'm sure, it would be helpful for new AD users, but what I wish for is a AutoAuto() or AutoAuto(list_of_backends) backend that in runtime select a backend and use memoization for future calls.

Vaibhavdixit02 · 2024-04-10T05:02:05Z

I think implementing this would make more sense for downstream packages that take DI as a dep. Since this automatic selection is heavily influenced by the problem type you have.

gdalle · 2024-04-10T06:19:26Z

Adrian and I are both against magic tricks like memoization, so if we do offer this functionality it will be a separate choice function, not a backend object with a hidden mechanism. But at the moment it doesn't fit well within our benchmark framework so I would leave it to downstream users

gdalle · 2024-05-28T07:53:18Z

Closing this issue since the latest version of DIT (since #257) includes the specification of benchmarking results in its public API. Users are free to write a 3-line code to select the best backend for their own criteria.

gdalle added the test Related to the testing subpackage label Apr 9, 2024

gdalle mentioned this issue Apr 29, 2024

autodiff in default algorithm SciML/DifferentialEquations.jl#1027

Open

gdalle closed this as not planned Won't fix, can't repro, duplicate, stale May 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Automatically select a backend #164

Automatically select a backend #164

prbzrg commented Apr 9, 2024

gdalle commented Apr 9, 2024

prbzrg commented Apr 9, 2024

gdalle commented Apr 9, 2024

gdalle commented Apr 9, 2024

gdalle commented Apr 9, 2024

adrhill commented Apr 9, 2024

adrhill commented Apr 9, 2024

adrhill commented Apr 9, 2024

prbzrg commented Apr 9, 2024

Vaibhavdixit02 commented Apr 10, 2024

gdalle commented Apr 10, 2024

gdalle commented May 28, 2024

Automatically select a backend #164

Automatically select a backend #164

Comments

prbzrg commented Apr 9, 2024

gdalle commented Apr 9, 2024

prbzrg commented Apr 9, 2024

gdalle commented Apr 9, 2024

gdalle commented Apr 9, 2024

gdalle commented Apr 9, 2024

adrhill commented Apr 9, 2024

adrhill commented Apr 9, 2024

adrhill commented Apr 9, 2024

prbzrg commented Apr 9, 2024

Vaibhavdixit02 commented Apr 10, 2024

gdalle commented Apr 10, 2024

gdalle commented May 28, 2024