Make Iterators.partition split arrays into views for faster and easier parallelism #33533

mbauman · 2019-10-11T18:59:56Z

I've frequently wanted to use Iterators.partition to subdivide an iteration space into chunks amenable to parallelism, but its implementation — which defaulted to copying all elements of each partition into a new Vector — left some efficiencies to be desired. It did have one special case — wherein a partition of a Vector itself would helpfully use views. This PR makes that the default for all AbstractArrays. It goes further and adds very helpful performance specializations for some ranges and CartesianIndices essential for doing loops over partition(eachindex(...)).

This PR is divided into three commits:

The minor change, which makes all AbstractArrays return views
The optimization, which speeds up partitions of common ranges and CartesianIndices
The use-case, which makes BitArray broadcasts simpler, simdier, and faster. This was the use-case that drove my optimizations, and I put together a series of benchmarks. The short version is that by using partition we can easily split the algorithm into a @simdable section, leading to up to 10x speedups for code like a .== 0:

base/broadcast.jl

JeffBezanson · 2019-10-11T19:40:12Z

Cool!!

StefanKarpinski · 2019-10-11T20:19:50Z

Argument for why this change is an acceptable minor change even though it is technically breaking:

since the partitions were previously copies, mutating them was almost always pointless
the main case that could break is if someone was hanging onto a partition and then mutating the main array or keeping the partition as an independent array from the original and mutating that

Both of these potential usages seem pretty unlikely. The main usage of this seems like it would be to just look at different partitions of the original array without doing any mutation of the original or the partition. With this change, a new potential use case becomes possible: modifying the original array by operating on partitions. Going back from views to copies would be more breaking since it would make code that used that use case stop working.

StefanKarpinski · 2019-10-11T20:21:09Z

Another nice coincidence: @vtjnash is doing compiler work that will make taking views usually non-allocating and therefore much more efficient in many more cases, so this could get even more efficient by the time 1.4 is actually ready.

ararslan · 2019-10-11T20:33:14Z

FWIW I've been using partition a lot lately and I didn't even realize it wasn't using views (nor did I do any mutation), so I'd second Stefan's comment:

Both of these potential usages seem pretty unlikely

This is great!

StefanKarpinski · 2019-10-15T19:50:24Z

Hasn't been officially triaged, but if anyone has any objections, please post them here. I think this seems popular enough based on the reactions to be merged without a triage debate.

Previously only `::Vector` was special-cased to use views. The trade-off here is that we lose the ability to predict the concrete eltype -- since arrays can potentially choose to return something different from `vec` or `view`. Generic iterables still collect their elements into a freshly-allocated `Vector`, like before.

And also recompute ranges instead of using views for partitions of ranges. Since `Iterators.partition` is so handy for dividing up iteration spaces, it makes sense to optimize this as much as possible. While it is enherently a "linear" operation, it is a batched linear operation that allows us to skip doing all the effective ind2sub work on every single iteration.

base/multidimensional.jl

vtjnash · 2019-10-15T21:56:29Z

base/iterators.jl

+function iterate(itr::PartitionIterator{<:AbstractRange}, state=1)
+    state > length(itr.c) && return nothing
+    r = min(state + itr.n - 1, length(itr.c))
+    return @inbounds itr.c[state:r], r + 1


OT: Are we missing an abstraction here: should we define that view(::AbstractRange, slice::AbstractRange) isa AbstractRange, or would that confuse consumes or view?

That's #26872 — looks like it got stalled because it was proposed before we really had a handle on minor changes.

shorter lines, more spaces, and comments (because I had already forgotten how this worked myself)

mbauman · 2019-10-31T20:54:16Z

Triage was in favor and I addressed the style review... and while I was at it I added a few more comments because I didn't understand how this worked anymore.

Sacha0 · 2019-11-05T20:32:39Z

Very nice! :)

KristofferC reviewed Oct 11, 2019

View reviewed changes

base/broadcast.jl Show resolved Hide resolved

mbauman added 3 commits October 15, 2019 15:29

Simplify, simdify, and speed up BitArray broadcasts with partitions

9e10015

mbauman force-pushed the mb/fast-partitions branch from 6f6c88e to 9e10015 Compare October 15, 2019 20:29

vtjnash reviewed Oct 15, 2019

View reviewed changes

base/multidimensional.jl Outdated Show resolved Hide resolved

vtjnash reviewed Oct 15, 2019

View reviewed changes

mbauman and others added 2 commits October 31, 2019 15:05

NFC: style changes per review

e472b14

shorter lines, more spaces, and comments (because I had already forgotten how this worked myself)

Merge branch 'master' into mb/fast-partitions

47e49ab

mbauman removed the triage This should be discussed on a triage call label Oct 31, 2019

mbauman merged commit 3f0e7d6 into master Nov 1, 2019

mbauman deleted the mb/fast-partitions branch November 1, 2019 22:30

omus mentioned this pull request Mar 5, 2020

Differences in Base.Iterators.partition and IterTools.partition JuliaCollections/IterTools.jl#67

Open

jonas-schulze mentioned this pull request May 12, 2020

Should sinpi and cospi return integers for integer input? #35820

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make Iterators.partition split arrays into views for faster and easier parallelism #33533

Make Iterators.partition split arrays into views for faster and easier parallelism #33533

mbauman commented Oct 11, 2019 •

edited

Loading

JeffBezanson commented Oct 11, 2019

StefanKarpinski commented Oct 11, 2019

StefanKarpinski commented Oct 11, 2019 •

edited

Loading

ararslan commented Oct 11, 2019 •

edited

Loading

StefanKarpinski commented Oct 15, 2019

vtjnash Oct 15, 2019

mbauman Oct 15, 2019

mbauman commented Oct 31, 2019 •

edited

Loading

Sacha0 commented Nov 5, 2019

Make Iterators.partition split arrays into views for faster and easier parallelism #33533

Make Iterators.partition split arrays into views for faster and easier parallelism #33533

Conversation

mbauman commented Oct 11, 2019 • edited Loading

JeffBezanson commented Oct 11, 2019

StefanKarpinski commented Oct 11, 2019

StefanKarpinski commented Oct 11, 2019 • edited Loading

ararslan commented Oct 11, 2019 • edited Loading

StefanKarpinski commented Oct 15, 2019

vtjnash Oct 15, 2019

Choose a reason for hiding this comment

mbauman Oct 15, 2019

Choose a reason for hiding this comment

mbauman commented Oct 31, 2019 • edited Loading

Sacha0 commented Nov 5, 2019

mbauman commented Oct 11, 2019 •

edited

Loading

StefanKarpinski commented Oct 11, 2019 •

edited

Loading

ararslan commented Oct 11, 2019 •

edited

Loading

mbauman commented Oct 31, 2019 •

edited

Loading