Inlining arithmetic can produce slowdowns #13350

timholy · 2015-09-28T22:03:14Z

I've got an application where I'm seeing a big performance difference between

cm*vm + c*v + cp*vp

and

(cm*vm + c*v) + cp*vp

This is related to previous issues #6193 (comment) #5011 #7075 (comment) #10278. However, they're all closed, so I thought it would be better to start fresh.

This demo relies on ForwardDiff.jl, and uses the code in this gist.

Timing results:

julia> include("perf_parens.jl")
Warmup @time
  0.000001 seconds (3 allocations: 144 bytes)
Without parentheses:
  0.350804 seconds (16.00 M allocations: 488.282 MB, 19.45% gc time)
With parentheses:
  0.085496 seconds (15 allocations: 624 bytes)

Considerably more discussion can be found here.

Anything to do here? Or just tweak Interpolations.jl (which is where those functions get @generated) to insert the parentheses?

The text was updated successfully, but these errors were encountered:

jrevels · 2015-09-28T22:37:28Z

To add a little context, GradientNumber is the number type with forcibly inlined arithmetic. With that in mind, I think the most useful comparisons here might be found by calling the below code (after loading the gist linked by @timholy above):

G = ForwardDiff.GradientNumber{1,Float64,Tuple{Float64}}
gdx, gdy = rand(G), rand(G)

@code_llvm mygetindex_slow(A, gdx, gdy)
@code_llvm mygetindex_fast(A, gdx, gdy)

In Tim's actual gist, ForwardDiff uses G = ForwardDiff.GradientNumber{2,Float64,Tuple{Float64, Float64}}, but the single-component case shown above is more minimal for the sake of demonstrating the performance differences.

timholy · 2015-09-29T00:09:00Z

Actually, @jrevels, since this affects even the 1d case, we can make this even easier. It really is just the two arithmetic operations at the top. Would you be able to put together a demonstration in "raw" julia, without using ForwardDiff at all? Might help debug this.

jrevels · 2015-09-29T01:13:15Z

Would you be able to put together a demonstration in "raw" julia, without using ForwardDiff at all?

I wasn't able to reproduce this when the involved number types are Base number types; or do you mean an example with a simpler wrapper type than GradientNumber, on which some inlined arithmetic functions are defined?

These were broken by #11274 but no one noticed. Fixes #13350.

timholy · 2015-09-29T08:20:48Z

Yes, or just a version for ForwardDiff itself that strips the operations down to the essentials. If it can be reduced to ~20 loc or something, that might make it a lot easier for folks. (I frequently do that when I report bugs, but in this case you understand ForwardDiff far better than I.)

timholy · 2015-09-29T08:22:01Z

Ah, but see #13355!

These were broken by #11274 but no one noticed. Fixes #13350. (cherry picked from commit 7dfcf70) ref #13355

These were broken by #11274 but no one noticed. Fixes #13350.

JeffBezanson · 2015-10-22T21:46:59Z

I am now seeing much worse performance here, and it does not seem to be fixed by #13355.

pao · 2015-10-23T14:45:54Z

Reopening based on @JeffBezanson's comment from yesterday--this got autoclosed by #13355.

These were broken by JuliaLang#11274 but no one noticed. Fixes JuliaLang#13350.

yuyichao · 2015-12-15T22:47:43Z

Is this a dup of #12219 ?

simonster · 2015-12-16T02:17:55Z

The original issue was kind of the opposite of #12219: we had type information, but we weren't inlining aggressively enough to avoid calling * with a suboptimal calling convention. It was basically the same issue as #5011, and my fix was to fix the heuristic that @JeffBezanson introduced in 1316f81, which had been broken by inference refactoring in #11274. But I think there was a (different?) regression on master, which I haven't had time to take a look at.

simonster · 2015-12-17T18:36:11Z

The new regression that @JeffBezanson saw may be #14294.

vtjnash · 2016-02-27T21:10:59Z

confirmed the new issue was #14294, not the original report here

timholy mentioned this issue Sep 28, 2015

Disable forced inlining in arithmetic JuliaDiff/ForwardDiff.jl#58

Closed

tkelman added the performance Must go faster label Sep 28, 2015

simonster added a commit that referenced this issue Sep 29, 2015

Fix inference heuristic hacks

7dfcf70

These were broken by #11274 but no one noticed. Fixes #13350.

simonster mentioned this issue Sep 29, 2015

Fix inference heuristic hacks #13355

Merged

JeffBezanson added the kind:regression Regression in behavior compared to a previous version label Sep 29, 2015

simonster added a commit that referenced this issue Sep 30, 2015

Fix inference heuristic hacks

fef1fb8

These were broken by #11274 but no one noticed. Fixes #13350. (cherry picked from commit 7dfcf70) ref #13355

simonster added a commit that referenced this issue Oct 22, 2015

Fix inference heuristic hacks

d1e1fc3

These were broken by #11274 but no one noticed. Fixes #13350.

simonster added a commit that referenced this issue Oct 22, 2015

Fix inference heuristic hacks

9822e93

These were broken by #11274 but no one noticed. Fixes #13350.

simonster added a commit that referenced this issue Oct 22, 2015

Fix inference heuristic hacks

f8d2294

These were broken by #11274 but no one noticed. Fixes #13350.

This was referenced Oct 22, 2015

inlined code line numbers need work #13725

Closed

perf regression in @pure changes #13735

Closed

JeffBezanson closed this as completed in #13355 Oct 23, 2015

pao reopened this Oct 23, 2015

bjarthur pushed a commit to bjarthur/julia that referenced this issue Oct 27, 2015

Fix inference heuristic hacks

98f74d6

These were broken by JuliaLang#11274 but no one noticed. Fixes JuliaLang#13350.

jrevels mentioned this issue Nov 6, 2015

CI Performance Tracking for v0.5 #13893

Closed

4 tasks

jrevels added the kind:potential benchmark Could make a good benchmark in BaseBenchmarks label Nov 13, 2015

vtjnash closed this as completed Feb 27, 2016

KristofferC removed the kind:potential benchmark Could make a good benchmark in BaseBenchmarks label Oct 31, 2018

vtjnash mentioned this issue Nov 8, 2021

simplify inline cost computation, update docs #42997

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inlining arithmetic can produce slowdowns #13350

Inlining arithmetic can produce slowdowns #13350

timholy commented Sep 28, 2015

jrevels commented Sep 28, 2015

timholy commented Sep 29, 2015

jrevels commented Sep 29, 2015

timholy commented Sep 29, 2015

timholy commented Sep 29, 2015

JeffBezanson commented Oct 22, 2015

pao commented Oct 23, 2015

yuyichao commented Dec 15, 2015

simonster commented Dec 16, 2015

simonster commented Dec 17, 2015

vtjnash commented Feb 27, 2016

Inlining arithmetic can produce slowdowns #13350

Inlining arithmetic can produce slowdowns #13350

Comments

timholy commented Sep 28, 2015

jrevels commented Sep 28, 2015

timholy commented Sep 29, 2015

jrevels commented Sep 29, 2015

timholy commented Sep 29, 2015

timholy commented Sep 29, 2015

JeffBezanson commented Oct 22, 2015

pao commented Oct 23, 2015

yuyichao commented Dec 15, 2015

simonster commented Dec 16, 2015

simonster commented Dec 17, 2015

vtjnash commented Feb 27, 2016