Automatic interpolation to avoid global variable issues #65

rdeits · 2017-07-07T21:56:05Z

After approximately the zillionth time seeing people get confusing or incorrect benchmark results because they did:

@benchmark foo(x)

instead of

@benchmark foo($x)

I started wondering if maybe we could do something to avoid forcing this cognitive burden on users.

As inspiration, I've used the following macro in the unit tests to measure "real" allocations from a single execution of a function:

macro wrappedallocs(expr)
    argnames = [gensym() for a in expr.args]
    quote
        function g($(argnames...))
            @allocated $(Expr(expr.head, argnames...))
        end
        $(Expr(:call, :g, [esc(a) for a in expr.args]...))
    end
end

@wrappedallocs f(x) turns @allocated f(x) into something more like:

function g(_y)
  @allocated f(_y)
end
g(y)

which does the same computation but measures the allocations inside the wrapped function instead of at global scope.

It might be possible to do something like this for benchmarking. This particular implementation is wrong, because @wrappedallocs f(g(x)) will only measure the allocations of f() not g(), but a similar approach, involving walking the expression to collect all the symbols and then passing those symbols through a new outer function, might work.

The result would be that

@benchmark f(g(y), x)

would turn into something like

function _f(_f, _g, _y, _x)
  @_benchmark _f(_g(_y), _x)
end
_f(f, g, y, x)

where @_benchmark does basically what regular @benchmark does right now. Passing _f and _g as arguments is not necessary if they're regular functions, but it is necessary if they're arbitrary callable objects.

The question is: is this a good idea? This makes BenchmarkTools more complicated, and might involve too much magic. I also haven't thought through how to integrate this with the setup arguments. I'm mostly just interested in seeing if this is something that's worth spending time on.

One particular concern I have is if the user tries to benchmark a big block of code, we may end up with the wrapper function taking a ridiculous number of arguments, which I suspect is likely to be handled badly by Julia. Fortunately, the macro can at least detect that case and demand that the user manually splice in their arguments.

The text was updated successfully, but these errors were encountered:

rdeits · 2017-07-07T22:00:19Z

Or, to summarize all of the above: "transforming foo(x) into foo($x) is tedious, so we should make computers do it"

jrevels · 2017-07-09T15:18:15Z

I'm worried the payoff here wouldn't be worth the pain. This is what some older iterations of BenchmarkTools (Benchmarks.jl, BenchmarkTrackers.jl, etc.) tried to do, and I recall it being quite tricky to get right. Note also that the primary reason interpolation exists isn't necessarily for performance reasons, but for transporting locally-scoped variables into benchmark scope (which is always top-level on purpose):

julia> using BenchmarkTools

julia> for i in 1:3
           @benchmark println(i) evals=1 samples=1
       end
ERROR: UndefVarError: i not defined

julia> for i in 1:3
           @benchmark println($i) evals=1 samples=1
       end
1
1
2
2
3
3

From my experience teaching people BenchmarkTools (or correcting their usage), the interpolation feature isn't hard to use or understand - as soon as users know about it, they pick it up easily. The problem is that people just don't know about it because they don't read the docs.

A less heavy-handed solution might be to just put a quick "Quick Start Example" or something in the README, and have it use interpolation everywhere. Or, we could print an (easily disabled via a .juliarc.jl flag) warning on package load.

Also, I actually use the interpolation feature pretty often to measure the effects of "toggling" the global-ness of specific variables. This is more useful for package authors than end-users, but if we did automatic interpolation, we'd need some per-variable way of toggling it (maybe just via $, but that would be inverting the normal Julia meaning of $).

rdeits · 2017-07-09T18:33:03Z

Ok, that all makes sense, thanks. I still want this feature for my own work, so I may try implementing it in a new package (which would use BenchmarkTools under the hood).

rdeits closed this as completed Jul 9, 2017

jrevels mentioned this issue Aug 14, 2017

broken on master: invalid syntax #69

Closed

jrevels mentioned this issue Sep 10, 2018

Document necessity to interpolate variables into top-level execution environment #111

Open

jlperla mentioned this issue Sep 21, 2018

Simple version of @btime with automatic interpolation #118

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Automatic interpolation to avoid global variable issues #65

Automatic interpolation to avoid global variable issues #65

rdeits commented Jul 7, 2017 •

edited

Loading

rdeits commented Jul 7, 2017

jrevels commented Jul 9, 2017

rdeits commented Jul 9, 2017

Automatic interpolation to avoid global variable issues #65

Automatic interpolation to avoid global variable issues #65

Comments

rdeits commented Jul 7, 2017 • edited Loading

rdeits commented Jul 7, 2017

jrevels commented Jul 9, 2017

rdeits commented Jul 9, 2017

rdeits commented Jul 7, 2017 •

edited

Loading