Rules for turning lazy broadcasted to eager one? #104

tkf · 2019-09-13T08:36:51Z

Zygote.jl has some definitions for broadcasted such as this:

@adjoint function broadcasted(::typeof(tanh), x::Numeric)
  y = tanh.(x)
  y, ȳ -> (nothing, ȳ .* conj.(1 .- y.^2))
end

Notice ed, not broadcast. It's converting a lazy broadcast to eager one to store the temporary values.

I'm wondering if this kind of rules should be ported to ChainRules.jl. I think the way ChainRulesCore.jl express the rules cannot be used to auto-generate this kind of specializations (both before and after JuliaDiff/ChainRulesCore.jl#30). However, implementing this rule in ChainRules.jl means that AD engines cannot choose to use it or not.

Does it make sense to have it in ChainRules.jl? Or should it be done for each AD implementation?

(Maybe related to @willtebbutt's question here #12 (comment) ?)

The text was updated successfully, but these errors were encountered:

oxinabox · 2019-09-13T09:33:46Z

Yes, this is a reasonable rule to have.

The reason rrule (and similarly frule)
allows for having the first return value not always be directly calling the function,
is that sometimes there is a different way to call it that makes doing the pullback (similar pushforward) much faster,
and without unreasonable overhead (for some definition of unreasonable)

However, implementing this rule in ChainRules.jl means that AD engines cannot choose to use it or not.

Not entirely true, AD engines can opt out of particular ChainRules rules,
e.g. at least initially Zygote will be opting out of the 6 that return Wirtinger.
And also probably a bunch of other things that rely on the specifics of the AD internals too.
(Most of this is done by just defining things in Zygotes rule system (i.e. directly overloading _forward, which takes precidence over using ChainRules' definitions))

While opting out has to more or less be on a case by case basic,
I think that is actually fine for this case.
Someone notices a particular performance issue that is tied to a particular behavour in a particular instancem, and they opt out of thjat instance.

At some point we might want to consider attaching metadata via say traits to rules to make opt-ing out of particular groups easier. (Or maybe to allow use to define a meta_rrule(traits...) that gives back an new general purpose rrule function that will follow.).
But that is a way down the road.

tkf · 2019-09-13T18:34:13Z

Not entirely true, AD engines can opt out of particular ChainRules rules,

i.e. directly overloading _forward, which takes precidence over using ChainRules' definitions

I see. This is a really nice way of implementing it. Thanks for pointing this out.

tkf · 2019-09-27T06:10:36Z

Is it solved?

oxinabox · 2019-09-27T12:09:24Z

I think so?
The question was:

Does it make sense to have it in ChainRules.jl?

the answer was:

Yes, this is a reasonable rule to have.

tkf · 2019-09-27T22:51:27Z

I see. That makes sense. I was somehow thinking this would be closed by porting some related rules from Zyogte. But that's not the OP said and I guess we don't need an individual issue to track it.

oxinabox · 2019-09-28T08:35:40Z

I do need to sit down and write a script to find all rules from Zygote that we haven't ported from Nabla alrrady.
But yes that is another issue.

oxinabox mentioned this issue Sep 13, 2019

introdocs + demo + instructions on writing good rules #103

Merged

oxinabox closed this as completed Sep 26, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rules for turning lazy broadcasted to eager one? #104

Rules for turning lazy broadcasted to eager one? #104

tkf commented Sep 13, 2019

oxinabox commented Sep 13, 2019

tkf commented Sep 13, 2019

tkf commented Sep 27, 2019

oxinabox commented Sep 27, 2019

tkf commented Sep 27, 2019

oxinabox commented Sep 28, 2019

Rules for turning lazy broadcasted to eager one? #104

Rules for turning lazy broadcasted to eager one? #104

Comments

tkf commented Sep 13, 2019

oxinabox commented Sep 13, 2019

tkf commented Sep 13, 2019

tkf commented Sep 27, 2019

oxinabox commented Sep 27, 2019

tkf commented Sep 27, 2019

oxinabox commented Sep 28, 2019