feat: add gradient with AutoReactant #918

gdalle · 2025-11-15T23:57:10Z

Updated experiments with Reactant-accelerated derivatives.

@wsmoses is this still the right paradigm in your opinion? I may not implement every operator right away but I thought starting with a gradient made sense

Re-toggle tests once this is mergeable

DifferentiationInterface/ext/DifferentiationInterfaceReactantExt/onearg.jl

codecov · 2025-11-16T00:16:01Z

Codecov Report

❌ Patch coverage is 94.23077% with 3 lines in your changes missing coverage. Please review.
✅ Project coverage is 3.18%. Comparing base (bbc39fd) to head (c7e7598).

Files with missing lines	Patch %	Lines
...e/ext/DifferentiationInterfaceReactantExt/utils.jl	50.00%	3 Missing ⚠️

❗ There is a different number of reports uploaded between BASE (bbc39fd) and HEAD (c7e7598). Click for more details.

HEAD has 59 uploads less than BASE

Flag BASE (bbc39fd) HEAD (c7e7598)

DIT 12 0

DI 48 1

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #918       +/-   ##
==========================================
- Coverage   98.10%   3.18%   -94.92%     
==========================================
  Files         133     101       -32     
  Lines        7971    5553     -2418     
==========================================
- Hits         7820     177     -7643     
- Misses        151    5376     +5225

Flag	Coverage Δ
DI	`3.18% <94.23%> (-95.65%)`	⬇️
DIT	`?`

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

gdalle · 2025-11-16T20:13:51Z

@wsmoses does this look better to you now?
I'm not sure what we should do in terms of storage versus allocations. We can store xr (and even contextsr) during preparation and then copy to them at execution time instead of generating a new RArray, but that would require a copying method to be defined (which doesn't apply to all non-array objects).

wsmoses · 2025-11-16T20:37:19Z

DifferentiationInterface/ext/DifferentiationInterfaceReactantExt/onearg.jl

+    DI.check_prep(f, prep, rebackend, x)
+    backend = rebackend.mode
+    (; xr, compiled_gradient) = prep
+    copyto!(xr, x)


We should only do this if x is not a reactantarray

wsmoses · 2025-11-16T20:38:17Z

DifferentiationInterface/ext/DifferentiationInterfaceReactantExt/onearg.jl

+    ) where {F, C}
+    _sig = DI.signature(f, rebackend, x; strict)
+    backend = rebackend.mode
+    xr = to_reac(x)


We shouldn't save anything as a prep argument if a reactant array, I would keep this as if reactant array then xr is nothing otherwise to_rarray(x)

Sounds reasonable

wsmoses · 2025-11-16T20:38:49Z

DifferentiationInterface/ext/DifferentiationInterfaceReactantExt/onearg.jl

+    DI.check_prep(f, prep, rebackend, x)
+    backend = rebackend.mode
+    (; xr, compiled_value_and_gradient) = prep
+    copyto!(xr, x)


Same comment here

wsmoses · 2025-11-16T20:39:17Z

DifferentiationInterface/ext/DifferentiationInterfaceReactantExt/onearg.jl

+    DI.check_prep(f, prep, rebackend, x)
+    backend = rebackend.mode
+    (; xr, gr, compiled_gradient!) = prep
+    copyto!(xr, x)


wsmoses · 2025-11-16T20:39:54Z

DifferentiationInterface/test/Back/Reactant/test.jl

+@test check_inplace(backend)
+
+test_differentiation(
+    backend, DifferentiationInterfaceTest.default_scenarios(;


Can you add a test that the prep contains no data except the compiled fn if compiled for a reactant array

wsmoses · 2025-11-16T20:45:25Z

Modulo some comments being addressed above this looks reasonable to me.

However note that there may be a potential mismatch in expectations behind what prepare gradient defines and what reactant compile defines.

See https://enzymead.github.io/Reactant.jl/dev/tutorials/partial-evaluation

Currently any data inside a constant or cache will be baked into the compiled function and will not be re read in later evaluation.

Enzyme.jl in particular does not have any such constraint (as it will always re run with live data as prep is nothing).

Something like reversediff compiled probably does bake in the assumption from compilation.

So this is a question of what is the semantics of prep.

If the non differentiated data is the same between prep and evaluation there is no difference between the two

gdalle · 2025-11-16T21:14:19Z

So this is a question of what is the semantics of prep. If the non differentiated data is the same between prep and evaluation there is no difference between the two

The semantics of prep: differentiated and non-differentiated data are free to change between preparation and execution, as long as they keep the same types and sizes. See here for details.

Currently any data inside a constant or cache will be baked into the compiled function and will not be re read in later evaluation.

I thought converting the contexts into reactant arrays inside contextr would allow them to be traced? If that's true, then they won't be partially evaluated into the compiled function, which means the semantics of prep are respected?

wsmoses · 2025-11-16T21:19:04Z

No your to_reac function does not achieve that. If you have a context of Tuple{Int, Int} this will not be converted unless you do to_rarray(context; track_numbers=Number).

However, concurrently, most of the time you actually want to partially evaluate integers in (e.g. for sizes/bounds/etc).

I think the more reasonable setup here is to not to_rarray the context, and instead add a similar warning to the one from reversediff:

These rules hold for the majority of backends, but there are some exceptions. The most important exception is [ReverseDiff.jl](https://github.com/JuliaDiff/ReverseDiff.jl) and its taping mechanism, which is sensitive to control flow inside the function.

gdalle · 2025-11-16T21:46:45Z

Being forced to keep the same context values makes preparation pretty much useless. I think I'd rather have us trace everything in the context, even if it means a slowdown in some cases. Will it lead to actual errors?

gdalle · 2025-11-16T21:49:48Z

Or alternately we could restrict the kind of contexts we allow here

wsmoses · 2025-11-16T21:50:48Z

Yes unnecessarily tracing objects can lead to errors that would fail to compile otherwise

wsmoses · 2025-11-16T21:51:27Z

and it doesn't make it useless, it just means that the user is responsible for performing the to_rarray themselves for things that may change

gdalle · 2025-11-17T09:30:51Z

Yes unnecessarily tracing objects can lead to errors that would fail to compile otherwise

Can you give an example so that I wrap my mind around this?

and it doesn't make it useless, it just means that the user is responsible for performing the to_rarray themselves for things that may change

That would be a Reactant-specific workaround, which doesn't fit other DI-supported backends. The whole point of DI is to enable easy backend switch, so I'd love to find a solution that doesn't expect users to wrap some of the arguments in Reactant-specific types when they want to switch to AutoReactant.

gdalle · 2025-11-17T14:38:51Z

Besides, the problem is not specific to contexts: x itself can contain integers we don't necessarily want to track. And a preparation that can only be reused if nothing at all changes will only ever be used once anyway, so it is pointless.

Maybe DI could expose a function like trace(a, backend) or translate(a, backend) which takes care of populating values in the correct way for differentiation / Reactant compilation? I already use such a function internally anyway, especially in ForwardDiff and other operator overloading-based backends.

wsmoses · 2025-11-17T16:25:33Z

julia> using Reactant; x = Reactant.to_rarray(ones(10)); s = Reactant.ConcreteRNumber(2); e = Reactant.ConcreteRNumber(5);

julia> f(x, s, e) = x[s:e]
f (generic function with 1 method)

julia> @jit f(x, s, e)
ERROR: TypeError: non-boolean (Reactant.TracedRNumber{Bool}) used in boolean context
Stacktrace:
  [1] getindex_linear
    @ ~/git/Reactant.jl/src/Indexing.jl:340 [inlined]
  [2] (::Nothing)(none::typeof(Reactant.TracedIndexing.getindex_linear), none::Reactant.TracedRArray{Float64, 1}, none::Reactant.TracedUnitRange{Reactant.TracedRNumber{Int64}})
    @ Reactant ./<missing>:0
  [3] getindex_linear
    @ ~/git/Reactant.jl/src/Indexing.jl:339 [inlined]
  [4] call_with_reactant(::Reactant.MustThrowError, ::typeof(Reactant.TracedIndexing.getindex_linear), ::Reactant.TracedRArray{…}, ::Reactant.TracedUnitRange{…})
    @ Reactant ~/git/Reactant.jl/src/utils.jl:0
  [5] getindex
    @ ~/git/Reactant.jl/src/Indexing.jl:75 [inlined]
  [6] (::Nothing)(none::typeof(getindex), none::Reactant.TracedRArray{Float64, 1}, none::Reactant.TracedUnitRange{Reactant.TracedRNumber{Int64}})
    @ Reactant ./<missing>:0
  [7] getindex
    @ ~/git/Reactant.jl/src/Indexing.jl:75 [inlined]
  [8] call_with_reactant(::Reactant.MustThrowError, ::typeof(getindex), ::Reactant.TracedRArray{Float64, 1}, ::Reactant.TracedUnitRange{Reactant.TracedRNumber{Int64}})
    @ Reactant ~/git/Reactant.jl/src/utils.jl:0
  [9] f
    @ ./REPL[5]:1 [inlined]
 [10] (::Nothing)(none::typeof(f), none::Reactant.TracedRArray{Float64, 1}, none::Reactant.TracedRNumber{Int64}, none::Reactant.TracedRNumber{Int64})
    @ Reactant ./<missing>:0
 [11] TracedUnitRange
    @ ~/git/Reactant.jl/src/Types.jl:108 [inlined]
 [12] TracedUnitRange
    @ ~/git/Reactant.jl/src/TracedRange.jl:124 [inlined]
 [13] Colon
    @ ~/git/Reactant.jl/src/TracedRange.jl:181 [inlined]
 [14] f
    @ ./REPL[5]:1 [inlined]
 [15] call_with_reactant(::typeof(f), ::Reactant.TracedRArray{Float64, 1}, ::Reactant.TracedRNumber{Int64}, ::Reactant.TracedRNumber{Int64})
    @ Reactant ~/git/Reactant.jl/src/utils.jl:0
 [16] make_mlir_fn(f::typeof(f), args::Tuple{…}, kwargs::@NamedTuple{}, name::String, concretein::Bool; toscalar::Bool, return_dialect::Symbol, args_in_result::Symbol, construct_function_without_args::Bool, do_transpose::Bool, input_shardings::Nothing, output_shardings::Nothing, runtime::Val{…}, verify_arg_names::Nothing, argprefix::Symbol, resprefix::Symbol, resargprefix::Symbol, num_replicas::Int64, optimize_then_pad::Bool)
    @ Reactant.TracedUtils ~/git/Reactant.jl/src/TracedUtils.jl:345
 [17] make_mlir_fn
    @ ~/git/Reactant.jl/src/TracedUtils.jl:275 [inlined]
 [18] compile_mlir!(mod::Reactant.MLIR.IR.Module, f::typeof(f), args::Tuple{…}, compile_options::CompileOptions, callcache::Dict{…}, sdycache::Dict{…}, sdygroupidcache::Tuple{…}; fn_kwargs::@NamedTuple{}, backend::String, runtime::Val{…}, legalize_stablehlo_to_mhlo::Bool, client::Reactant.XLA.PJRT.Client, kwargs::@Kwargs{})
    @ Reactant.Compiler ~/git/Reactant.jl/src/Compiler.jl:1608
 [19] compile_mlir!
    @ ~/git/Reactant.jl/src/Compiler.jl:1570 [inlined]
 [20] compile_xla(f::Function, args::Tuple{…}; before_xla_optimizations::Bool, client::Nothing, serializable::Bool, kwargs::@Kwargs{…})
    @ Reactant.Compiler ~/git/Reactant.jl/src/Compiler.jl:3516
 [21] compile_xla
    @ ~/git/Reactant.jl/src/Compiler.jl:3488 [inlined]
 [22] compile(f::Function, args::Tuple{…}; kwargs::@Kwargs{…})
    @ Reactant.Compiler ~/git/Reactant.jl/src/Compiler.jl:3592
 [23] top-level scope
    @ ~/git/Reactant.jl/src/Compiler.jl:2661
Some type information was truncated. Use `show(err)` to see complete types.

julia> @jit f(x, 2, 5)
4-element ConcretePJRTArray{Float64,1}:
 1.0
 1.0
 1.0
 1.0

wsmoses · 2025-11-17T16:27:58Z

and I don't think this is terribly reactant-specific. The same core issue here equally applies to reversediff compiled [where the context will equally be baked it]. Just because reactant also has a way to circumvent the problem in special cases shouldn't mean it is treated differently here.

wsmoses · 2025-11-18T14:59:24Z

and I don't think this is terribly reactant-specific. The same core issue here equally applies to reversediff compiled [where the context will equally be baked it]. Just because reactant also has a way to circumvent the problem in special cases shouldn't mean it is treated differently here.

bumping this @gdalle are you okay not to trace the contexts?

gdalle · 2025-11-18T15:06:19Z

Not really. Many use cases of DI that I can think of require changing contexts, and Reactant has to be relevant for these cases too. Re-compiling the derivatives for each context changes is very impractical. On the other hand, asking users to pass RArrays instead of their normal arguments might make other backends fail, so the code on which DI runs is no longer fully generic, and I want to avoid that too. It would be like asking ForwardDiff users to pass arrays of Dual numbers.
Besides, that problem is not specific to contexts: what is stopping x itself (the active argument) from containing scalars that users may or may not want to trace?

I don't have a lot of bandwith these days, but I think the right solution might be to expose something like DI.to_reactant, telling users that the function will be called on every argument before Reactant compilation. That way, if they want to enforce specific tracing behavior, they can wrap their argument in a custom type and overload to_reactant, but it doesn't force them to

wsmoses · 2025-11-18T15:08:37Z

In order to be differentiated the data must be a reactant array. If we assume that DI only officially supports array inputs this is fine

gdalle · 2025-11-18T16:20:58Z

In order to be differentiated the data must be a reactant array. If we assume that DI only officially supports array inputs this is fine

"array" as in "RArray only" or as in "any nested struct of RArrays?

gdalle · 2025-11-18T16:41:06Z

By the way, this Reactant issue prevented me from testing DI.Cache here, if you happen to have a quick fix lying around. I can also modify the DI EnzymeExt source code if this is expected behavior

feat: add gradient with AutoReactant

2f13597

wsmoses reviewed Nov 16, 2025

View reviewed changes

DifferentiationInterface/ext/DifferentiationInterfaceReactantExt/onearg.jl Outdated Show resolved Hide resolved

wsmoses reviewed Nov 16, 2025

View reviewed changes

DifferentiationInterface/ext/DifferentiationInterfaceReactantExt/onearg.jl Outdated Show resolved Hide resolved

Include contexts

c7e7598

gdalle mentioned this pull request Nov 16, 2025

Compiled autodiff returns more stuff? EnzymeAD/Reactant.jl#1875

Open

wsmoses reviewed Nov 16, 2025

View reviewed changes

gdalle mentioned this pull request Nov 17, 2025

Force fields to have the same eltype albertomercurio/DeviceSparseArrays.jl#10

Open

feat: add gradient with AutoReactant #918

Are you sure you want to change the base?

feat: add gradient with AutoReactant #918

Uh oh!

Conversation

gdalle commented Nov 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

codecov bot commented Nov 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

gdalle commented Nov 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wsmoses Nov 16, 2025

Choose a reason for hiding this comment

Uh oh!

wsmoses Nov 16, 2025

Choose a reason for hiding this comment

Uh oh!

gdalle Nov 16, 2025

Choose a reason for hiding this comment

Uh oh!

wsmoses Nov 16, 2025

Choose a reason for hiding this comment

Uh oh!

wsmoses Nov 16, 2025

Choose a reason for hiding this comment

Uh oh!

wsmoses Nov 16, 2025

Choose a reason for hiding this comment

Uh oh!

wsmoses commented Nov 16, 2025

Uh oh!

gdalle commented Nov 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wsmoses commented Nov 16, 2025

Uh oh!

gdalle commented Nov 16, 2025

Uh oh!

gdalle commented Nov 16, 2025

Uh oh!

wsmoses commented Nov 16, 2025

Uh oh!

wsmoses commented Nov 16, 2025

Uh oh!

gdalle commented Nov 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gdalle commented Nov 17, 2025

Uh oh!

wsmoses commented Nov 17, 2025

Uh oh!

wsmoses commented Nov 17, 2025

Uh oh!

wsmoses commented Nov 18, 2025

Uh oh!

gdalle commented Nov 18, 2025

Uh oh!

wsmoses commented Nov 18, 2025

Uh oh!

gdalle commented Nov 18, 2025

Uh oh!

gdalle commented Nov 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

gdalle commented Nov 15, 2025 •

edited

Loading

codecov bot commented Nov 16, 2025 •

edited

Loading

gdalle commented Nov 16, 2025 •

edited

Loading

gdalle commented Nov 16, 2025 •

edited

Loading

gdalle commented Nov 17, 2025 •

edited

Loading

gdalle commented Nov 18, 2025 •

edited

Loading