Get rid of ugly macro? #123

gdalle · 2023-12-21T09:28:36Z

I was chatting with @adrhill and he suggested that the macro @primitive could be discarded if each backend simply implemented some methods from AbstractDifferentiation, mostly jacobian and a pushforward or pullback. Thoughts?

The text was updated successfully, but these errors were encountered:

adrhill · 2023-12-22T01:01:28Z

To elaborate on this:

What the macro does

As far as I understand, the @primitive macro is used on pullbacks/pushforwards from individual backends to generate the following AD.jacobian functions:

Forward-mode AD:

AbstractDifferentiation.jl/src/AbstractDifferentiation.jl

Lines 600 to 634 in 211b675

    
           function define_pushforward_function_and_friends(fdef) 
        
               fdef[:name] = :($(AbstractDifferentiation).pushforward_function) 
        
               args = fdef[:args] 
        
               funcs = quote 
        
                   $(ExprTools.combinedef(fdef)) 
        
                   function $(AbstractDifferentiation).jacobian($(args...)) 
        
                       identity_like = $(identity_matrix_like)($(args[3:end]...)) 
        
                       pff = $(pushforward_function)($(args...)) 
        
                       if eltype(identity_like) <: Tuple{Vararg{Union{AbstractMatrix,Number}}} 
        
                           return map(identity_like) do identity_like_i 
        
                               return mapreduce(hcat, $(_eachcol).(identity_like_i)...) do (cols...) 
        
                                   pff(cols) 
        
                               end 
        
                           end 
        
                       elseif eltype(identity_like) <: AbstractMatrix 
        
                           # needed for the computation of the Hessian and Jacobian 
        
                           ret = hcat.(mapslices(identity_like[1]; dims=1) do cols 
        
                               # cols loop over basis states    
        
                               pf = pff((cols,)) 
        
                               if typeof(pf) <: AbstractVector 
        
                                   # to make the hcat. work / get correct matrix-like, non-flat output dimension 
        
                                   return (pf,) 
        
                               else 
        
                                   return pf 
        
                               end 
        
                           end...) 
        
                           return ret isa Tuple ? ret : (ret,) 
        
                       else 
        
                           return pff(identity_like) 
        
                       end 
        
                   end 
        
               end 
        
               return funcs 
        
           end

Reverse-mode AD:

AbstractDifferentiation.jl/src/AbstractDifferentiation.jl

Lines 636 to 663 in 211b675

    
           function define_value_and_pullback_function_and_friends(fdef) 
        
               fdef[:name] = :($(AbstractDifferentiation).value_and_pullback_function) 
        
               args = fdef[:args] 
        
               funcs = quote 
        
                   $(ExprTools.combinedef(fdef)) 
        
                   function $(AbstractDifferentiation).jacobian($(args...)) 
        
                       value, pbf = $(value_and_pullback_function)($(args...)) 
        
                       identity_like = $(identity_matrix_like)(value) 
        
                       if eltype(identity_like) <: Tuple{Vararg{AbstractMatrix}} 
        
                           return map(identity_like) do identity_like_i 
        
                               return mapreduce(vcat, $(_eachcol).(identity_like_i)...) do (cols...) 
        
                                   pbf(cols)' 
        
                               end 
        
                           end 
        
                       elseif eltype(identity_like) <: AbstractMatrix 
        
                           # needed for Hessian computation: 
        
                           # value is a (grad,). Then, identity_like is a (matrix,). 
        
                           # cols loops over columns of the matrix   
        
                           return vcat.(mapslices(identity_like[1]; dims=1) do cols 
        
                               adjoint.(pbf((cols,))) 
        
                           end...) 
        
                       else 
        
                           return adjoint.(pbf(identity_like)) 
        
                       end 
        
                   end 
        
               end 
        
               return funcs 
        
           end

These functions compute full Jacobians by evaluating the pullbacks/pushforwards on the standard basis (identity_like).

Fallback behavior

By default, the fallback jacobian function is empty (maybe this should be replaced by a NotImplementedError):

AbstractDifferentiation.jl/src/AbstractDifferentiation.jl

Line 82 in 211b675

function jacobian(ab::AbstractBackend, f, xs...) end

As shown in the implementer guide, this jacobian function is the fallback at the core of most functions exported by AbstractDifferentiation:

Taking reverse-mode AD as an example, the function dependency graph of value_and_pullback_function would look as follows:

value_and_pullback_function calls jacobian
jacobian is an empty function

Now, when a reverse-mode AD backend is loaded, value_and_pullback_function is defined for the backend and @primitive is called on it, the function dependency graph is inverted:

value_and_pullback_function calls the backend
a new generated jacobian calls value_and_pullback_function

The second behaviour is desired, as we wouldn't want to compute a full Jacobian just to compute a VJP when we can instead evaluate the pullback directly.

The fact that the function dependency graph is flipped was very confusing to me at first. A lot of hidden control flow is added via package extensions and the @primitive macro, which currently isn't documented in the implementer guide.

Back to the question

Why is AD.jacobian so central to AbstractDifferentiation.jl and why does it have to be generated via a macro? Can't it be implemented in a more generic way by making sure pullbacks and pushforward wrappers have consistent output types?

The only advantage I currently see is to allow users to

compute VJPs by constructing a full Jacobian using JVPs
compute JVPs by constructing a full Jacobian using VJPs

but those sound like things that should usually be avoided.

Why isn't AbstractDifferentiation.jl built around two primitives value_and_pullback_function and value_and_pushforward ¹ and making more liberal use of dispatch on the AbstractReverseMode and AbstractForwardMode types?

Ideally with in-place mutating variants. ↩

devmotion · 2024-01-07T21:16:17Z

Duplicate of #13, or at least #13 (comment) and the following discussion?

oxinabox · 2024-01-08T02:32:24Z

Why is AD.jacobian so central to AbstractDifferentiation.jl
Why isn't AbstractDifferentiation.jl built around two primitives value_and_pullback_function and value_and_pushforward

Historical reasons based mainly on the original author have a strong enough understanding of the calculus involved, but not such a strong understanding of autodiff or julia abstractions, IIRC. And the priority being on getting something out that worked and was usable. It should be.

mohamed82008 · 2024-01-08T05:40:33Z

This issue is my fault. Feel free to remove the macro if it makes things simpler.

devmotion · 2024-01-08T09:01:27Z

BTW, regarding

Why is AD.jacobian so central to AbstractDifferentiation.jl and why does it have to be generated via a macro? Can't it be implemented in a more generic way by making sure pullbacks and pushforward wrappers have consistent output types?

#95 trimmed down the macro, it can only be used anymore to implement the jacobian based on a pushforward_function or a value_and_pullback_function. Support for ReverseDiff and FiniteDifferences is implemented without the macro already, and e.g. ForwardDiff uses the automatically constructed jacobian function only for functions with multiple arguments (the single-argument version just calls ForwardDiff.jacobian).

mohamed82008 · 2024-03-13T11:47:13Z

As I mentioned in #13 (comment) and #123 (comment), I am ok with removing the macro. It is currently a thin wrapper over a pushforward or pullback definition. Feel free to open a PR.

gdalle added the question Inquiries and discussions label Dec 21, 2023

gdalle mentioned this issue Feb 5, 2024

Maybe AbstractDifferentiation should shrink to a collection of names? #129

Open

gdalle mentioned this issue Mar 13, 2024

Comparison with DifferentiationInterface.jl #131

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Get rid of ugly macro? #123

Get rid of ugly macro? #123

gdalle commented Dec 21, 2023 •

edited

Loading

adrhill commented Dec 22, 2023

devmotion commented Jan 7, 2024

oxinabox commented Jan 8, 2024

mohamed82008 commented Jan 8, 2024

devmotion commented Jan 8, 2024

mohamed82008 commented Mar 13, 2024

Get rid of ugly macro? #123

Get rid of ugly macro? #123

Comments

gdalle commented Dec 21, 2023 • edited Loading

adrhill commented Dec 22, 2023

What the macro does

Fallback behavior

Back to the question

Footnotes

devmotion commented Jan 7, 2024

oxinabox commented Jan 8, 2024

mohamed82008 commented Jan 8, 2024

devmotion commented Jan 8, 2024

mohamed82008 commented Mar 13, 2024

gdalle commented Dec 21, 2023 •

edited

Loading