Make broadcast recursion in `flatten` structural #29816

Keno · 2018-10-26T18:24:01Z

The inference enhancements in #29294 work quite well to prevent limiting
on many kinds of code. However, targetting TPUs, one code pattern it
struggeled with was a fairly large broadcast fusion in Flux:

 λ.(reshape(γ, affine_shape...) .* ((x .- μ) ./ σ) .+ reshape(β, affine_shape...))

The reason #29294 doesn't trigger is that the make_makeargs function used by the
implementation of Broadcast.flatten (which the TPU backend uses) had
a non-decreasing first argument (passing the return value of a previous
invocation of make_makeargs back in as the first argument). However,
that's not a fundamental limitation of the operation, but rather an
implementation choice. This PR switches that function's recursion pattern
to be purely structural, allowing inference to infer through it (with
the changes in #29294). As a result, ResNet50 infers properly.

The inference enhancements in #29294 work quite well to prevent limiting on many kinds of code. However, targetting TPUs, one code pattern it struggeled with was a fairly large broadcast fusion in Flux: λ.(reshape(γ, affine_shape...) .* ((x .- μ) ./ σ) .+ reshape(β, affine_shape...)) The reason #29294 is because the make_makeargs function used by the implementation of Broadcast.flatten (which the TPU backend uses) had a non-decreasing first argument (passing the return value of a previous invocation of make_makeargs back in as the first argument). However, that's not a fundamental limitation of the operation, but rather an implementation choice. This PR switches that function's recursion pattern to be purely structural, allowing inference to infer through it (with the changes in #29294). As a result, ResNet50 infers properly.

Keno · 2018-10-26T18:24:51Z

@nanosoldier runbenchmarks(ALL, vs = ":master")

base/broadcast.jl

Co-Authored-By: mbauman <mbauman@gmail.com>

mbauman

Thank you for these comments!

nanosoldier · 2018-10-27T01:12:02Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here. cc @ararslan

andreasnoack · 2018-11-28T12:04:40Z

This PR seems to have broken Flux. cc @MikeInnes

Keno requested a review from mbauman October 26, 2018 18:24

nalimilan reviewed Oct 26, 2018

View reviewed changes

base/broadcast.jl Outdated Show resolved Hide resolved

mbauman self-assigned this Oct 26, 2018

Comment spelling fix

286175d

Co-Authored-By: mbauman <mbauman@gmail.com>

mbauman approved these changes Oct 26, 2018

View reviewed changes

Keno merged commit b84fe52 into master Oct 28, 2018

vtjnash deleted the kf/broadcastflatten branch October 28, 2018 19:26

Keno mentioned this pull request Oct 29, 2018

Upstream dependencies JuliaGPU/XLA.jl#1

Open

schmrlng mentioned this pull request Jul 11, 2019

Function redefinition can improve performance/inlining/inference #32552

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make broadcast recursion in `flatten` structural #29816

Make broadcast recursion in `flatten` structural #29816

Keno commented Oct 26, 2018 •

edited

Loading

Keno commented Oct 26, 2018

mbauman left a comment

nanosoldier commented Oct 27, 2018

andreasnoack commented Nov 28, 2018

Make broadcast recursion in flatten structural #29816

Make broadcast recursion in flatten structural #29816

Conversation

Keno commented Oct 26, 2018 • edited Loading

Keno commented Oct 26, 2018

mbauman left a comment

Choose a reason for hiding this comment

nanosoldier commented Oct 27, 2018

andreasnoack commented Nov 28, 2018

Make broadcast recursion in `flatten` structural #29816

Make broadcast recursion in `flatten` structural #29816

Keno commented Oct 26, 2018 •

edited

Loading