RFC: Deprecate implicit scalar broadcasting in setindex! #26347

mbauman · 2018-03-07T00:52:20Z

The current setindex! function (the A[i] = b syntax) does three very different operations:

Given an index i that refers to only one location (scalar indexing), A[i] = b modifies A in that location to hold b.
Given an index I that refers to more than one location (non-scalar indexing), A[I] = b can mean one of two things:
- If b is an AbstractArray (multi-value), assign the values it contains to those locations I in A. That is, essentially, [A[i] = bᵢ for (i, bᵢ) in zip(I, b)].
- If b is not an AbstractArray (scalar-value), then broadcast its value to every location selected by I in A.

These two different behaviors in the non-scalar indexing case basically make using this function in a generic way impossible. If you want to always set the value b to many indices of A, you cannot use this syntax because b might happen to be an array in some cases, radically changing the meaning of the expression. You need to manually iterate over the indices, using scalar setindex methods. But we now also have the new broadcast! syntax, A[I] .= B, which uses the usual broadcasting semantics to determine how B should fill into the values of A.

This PR deprecates the implicit broadcasting of scalar values in non-scalar indexing in favor of an explicit .= broadcast syntax, leaving multi-value non-scalar indexing intact. This is the exact opposite of PR #24368.

There are still a few things left to do here, making this still a WIP:

[-] ~~Make setindex_shape_check more stringent~~ To be done in a separate PR. Or maybe not.
Examine cases where the deprecation message — as suggested — did not work and try to find less-verbose alternatives than an explicit for loop. Edit: This is much improved now that .= nothing works.
Re-implement the old setindex! methods as either fill! or broadcast! specializations (or both — having broadcast fall back to fill for Scalar() styles)
Check performance

mbauman · 2018-03-13T21:30:56Z

Alright, so I hit a fairly major performance snag that is inherent to either of the suggested replacement APIs (fill!(view(A, I...), value) or A[I...] .= value). BitArray had been specializing setindex!(A::BitArray, value, I::BitArray) to operate chunk-wise, 64 bits at a time. Unfortunately, both fill! and broadcast! require constructing a SubArray to encapsulate the indices, and SubArrays require collecting the indices selected by logical masks in order to maintain O(1) indexing. This means that we have a ~100x perf regression in this specific case.

@oschulz had suggested a fill!(A, value, I...) method, but that feels a little strange to me since no indices means filling the whole array. Is this use-case (and deprecation message) worth exporting a whole new function? fillindices!?

JeffBezanson · 2018-03-13T22:07:37Z

Maybe we can somehow specialize dotview for BitArrays?

oschulz · 2018-03-15T17:01:11Z

@oschulz had suggested a fill!(A, value, I...) method, but that feels a little strange to me since no indices means filling the whole array.

I think fill! would come quite naturally, and be a good fit, semantically - as long as there are no problems with dispatch, of course. copyto!, for example, also operates on the whole array, unless additional parameters are given.

mbauman · 2018-03-15T17:53:25Z

That's a compelling argument for fill!.

I'd love to implement a special MaskedBitArray view — it'd be super fast on vectorized functions. But accessing single elements would be pretty darn slow. It would compound the general iterative slowness of BitArrays with an O(~N) lookup. It's slightly improved because we can do a bunch of popcnts to get to the right chunk, but I think that's still too slow for general purpose use.

mbauman · 2018-04-12T04:29:35Z

Good news! The new optimizer has fixed my performance issues! Running BaseBenchmarks locally with JuliaCI/BaseBenchmarks.jl#195 applied leaves me with just two regressions (with default tuning settings) at the moment:

julia> pairs = leaves(regs)
2-element Array{Any,1}:
 (Any["linalg", "blas", "axpy!"], TrialJudgement(+43.17% => regression))
 (Any["problem", "laplacian", "laplace_sparse_matvec"], TrialJudgement(+74.37% => regression))

The other issue here is that I have an ambiguity problem with the BitMaskedBitArray. That one is easier to fix on #25377, but I'll try to find an amenable solution here (in isolation), too.

The current `setindex!` function (the `A[i] = b` syntax) does three very different operations: * Given an index `i` that refers to only one location (scalar indexing), `A[i] = b` modifies `A` in that location to hold `b`. * Given an index `I` that refers to more than one location (non-scalar indexing), `A[I] = b` can mean one of two things: * If `b` is an AbstractArray (multi-value), assign the values it contains to those locations `I` in `A`. That is, essentially, `[A[i] = bᵢ for (i, bᵢ) in zip(I, b)]`. * If `b` is not an AbstractArray (scalar-value), then broadcast its value to every location selected by `I` in `A`. These two different behaviors in the non-scalar indexing case basically make using this function in a generic way impossible. If you want to _always_ set the value `b` to many indices of `A`, you _cannot_ use this syntax because `b` might happen to be an array in some cases, radically changing the meaning of the expression. You need to manually iterate over the indices, using scalar setindex methods. But we now also have the new `broadcast!` syntax, `A[I] .= B`, which uses the usual broadcasting semantics to determine how `B` should fill into the values of `A`. This PR deprecates the implicit broadcasting of scalar values in non-scalar indexing in favor of an explicit `.=` broadcast syntax, leaving multi-value non-scalar indexing intact. This is the _exact opposite_ of PR #24368.

mbauman · 2018-04-26T20:13:52Z

@nanosoldier runbenchmarks(ALL, vs = ":master")

nanosoldier · 2018-04-27T01:50:20Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here. cc @ararslan

mbauman · 2018-04-27T15:47:31Z

Well rebasing this on top of #26891 solved multiple issues. The performance is looking pretty good (I had been seeing 10x regressions before), and I think it's technically no longer breaking.

That is, the deprecations will use the correct replacement, but we're splitting a dual behavior into two discrete APIs. If you depend upon this dual behavior, then neither of the static replacements will be correct and replacing it will be more challenging… and we cannot know at the time of the depwarn what other call sites might exist.

mbauman · 2018-05-01T17:13:44Z

I'll merge this tomorrow barring any further comments here.

oschulz · 2018-05-02T08:41:55Z

Thanks again, @mbauman !

mbauman · 2018-05-02T20:10:10Z

Travis failures were: Linux x86_64 failed due to some httpbin.org issue; macOS was a LLVM build issue.

Appveyor was the httpbin.org issue and a timeout.

mschauer · 2018-05-02T22:59:17Z

julia/base/indices.jl

Line 158 in b5d4f3c

setindex_shape_check(X, I...) = nothing # Non-arrays broadcast to all idxs

looks like it should not exist now.

mbauman added domain:arrays [a, r, r, a, y, s] kind:deprecation This change introduces or involves a deprecation kind:breaking This change will break code status:needs news A NEWS entry is required for this change labels Mar 7, 2018

JeffBezanson added this to the 1.0 milestone Mar 7, 2018

JeffBezanson added the status:triage This should be discussed on a triage call label Mar 8, 2018

JeffBezanson mentioned this pull request Mar 8, 2018

RFC: Deprecate multi-value non-scalar indexed assignment #24368

Closed

JeffBezanson removed the status:triage This should be discussed on a triage call label Mar 8, 2018

mbauman force-pushed the mb/deprecatesinglevaluebroadcastingwithinsetindex branch from e472a09 to a5a8fd6 Compare March 13, 2018 20:47

mbauman mentioned this pull request Mar 22, 2018

Deprecated syntax using the value of .= in the REPL. #26516

Closed

mbauman force-pushed the mb/deprecatesinglevaluebroadcastingwithinsetindex branch from 7cfbfcd to ae186be Compare March 27, 2018 22:55

mbauman mentioned this pull request Apr 4, 2018

Revert #190 and anticipate deprecating implicit scalar broadcasting in setindex! JuliaCI/BaseBenchmarks.jl#195

Merged

mbauman force-pushed the mb/deprecatesinglevaluebroadcastingwithinsetindex branch from ae186be to b2c5372 Compare April 11, 2018 23:13

mbauman force-pushed the mb/deprecatesinglevaluebroadcastingwithinsetindex branch from b2c5372 to 8ecfdb4 Compare April 26, 2018 20:11

mbauman changed the title ~~WIP: Deprecate implicit scalar broadcasting in setindex!~~ RFC: Deprecate implicit scalar broadcasting in setindex! Apr 27, 2018

Add NEWS.md

4a9c1ff

mbauman removed status:needs news A NEWS entry is required for this change kind:breaking This change will break code labels Apr 27, 2018

mbauman merged commit 7e2ce0e into master May 2, 2018

mbauman deleted the mb/deprecatesinglevaluebroadcastingwithinsetindex branch May 2, 2018 20:11

mbauman mentioned this pull request May 2, 2018

What to do about multivalue setindex!? #24086

Open

mbauman mentioned this pull request May 3, 2018

Move scalar broadcast setindex_shape_check method to deprecated #26967

Merged

martinholters mentioned this pull request May 7, 2018

sparse setindex! with boolean indices broken #27013

Closed

Tchanders mentioned this pull request Jul 26, 2020

GRN inference error when getting nodes. Tchanders/InformationMeasures.jl#29

Closed

dkarrasch mentioned this pull request Oct 14, 2020

Fix sparse Subarray fill! #38021

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RFC: Deprecate implicit scalar broadcasting in setindex! #26347

RFC: Deprecate implicit scalar broadcasting in setindex! #26347

mbauman commented Mar 7, 2018 •

edited

mbauman commented Mar 13, 2018

JeffBezanson commented Mar 13, 2018

oschulz commented Mar 15, 2018

mbauman commented Mar 15, 2018 •

edited

mbauman commented Apr 12, 2018

mbauman commented Apr 26, 2018

nanosoldier commented Apr 27, 2018

mbauman commented Apr 27, 2018

mbauman commented May 1, 2018

oschulz commented May 2, 2018

mbauman commented May 2, 2018

mschauer commented May 2, 2018

RFC: Deprecate implicit scalar broadcasting in setindex! #26347

RFC: Deprecate implicit scalar broadcasting in setindex! #26347

Conversation

mbauman commented Mar 7, 2018 • edited

mbauman commented Mar 13, 2018

JeffBezanson commented Mar 13, 2018

oschulz commented Mar 15, 2018

mbauman commented Mar 15, 2018 • edited

mbauman commented Apr 12, 2018

mbauman commented Apr 26, 2018

nanosoldier commented Apr 27, 2018

mbauman commented Apr 27, 2018

mbauman commented May 1, 2018

oschulz commented May 2, 2018

mbauman commented May 2, 2018

mschauer commented May 2, 2018

mbauman commented Mar 7, 2018 •

edited

mbauman commented Mar 15, 2018 •

edited