RFC: Drop dimensions indexed by scalars #13612

mbauman · 2015-10-14T18:39:55Z

This is the first step towards APL indexing semantics. It does not yet allow indexing with multiple multi-dimensional arrays (the new feature with the catchy "rank of the result is the sum of the rank of the indices" semantic); it simply drops all scalar dimensions (a major breaking change).

There is an interesting interaction between these array semantics and transposes that I didn't fully appreciate until looking at this more in-depth. The tricky part about APL indexing is that it loses the orientation of row vectors, and so we can no longer define complex matrix multiplication in terms of slices directly:

This makes A[1,:] * B[:,1] an illegal (n,) × (n,) operation.
Similarly, dot(A[1,:], B[:,1]) and A[1,:]' * B[:,1] erroneously conjugate the row.

Working with complex row slices becomes a little dicier. When do you conjugate? Or (c)transpose?

Fortunately, the behavior here is a strict superset of our current indexing rules. You can explicitly request that the shapes of the slices be maintained in both dimensions: A[1:1,:] * B[:,1:1] if you wish to use the result as a linear algebra construct.

mbauman · 2015-10-14T19:59:32Z

AppVeyor failure looks like #9572.

IainNZ · 2015-10-15T01:44:28Z

base/linalg/diagonal.jl

@@ -121,7 +121,10 @@ function A_ldiv_B!{T}(D::Diagonal{T}, V::AbstractMatrix{T})
        if d == zero(T)
            throw(SingularException(i))
        end
-        V[i,:] *= inv(d)
+        d⁻¹ = inv(d)


andreasnoack · 2015-11-04T14:15:40Z

@mbauman Would you rebase this one. I think we should consider merging this soon such that people will start using it.

mbauman · 2015-11-04T19:43:19Z

Done.

mschauer · 2015-11-05T13:06:32Z

Thanks for keeping it rolling here

andreasnoack · 2015-11-05T14:00:40Z

Okay to merge this?

mbauman · 2015-11-05T14:03:26Z

Sure, let's give it a shot.

timholy · 2015-11-06T09:59:47Z

Sorry I've not had time to review this. Looks fine to me. As you noted, it depends on a lot more splatting (esp in checksize), but all that is O(1) and not O(N) in the array elements. So I'm fine with seeing how it goes (and it might be further incentive to fix the splatting penalty), though if you've run any benchmarks it might be interesting to learn more about any performance hits.

RFC: Drop dimensions indexed by scalars

yuyichao · 2015-11-16T01:06:49Z

Should we also drop the scalar dimension for SubArray (especially if we'd like to return SubArray as the result of indexing).

This breaks the equivalence between getindex and sub

julia> a = rand(2, 2)
2x2 Array{Float64,2}:
 0.377862   0.85667 
 0.0177678  0.695366

julia> a[2, 1:2] == sub(a, 2, 1:2)
false

timholy · 2015-11-16T01:54:09Z

slice already does that---essentially, now getindex is equivalent to slice rather than sub.

mschauer · 2016-01-07T14:15:05Z

Are there plans to adapt mapslices to slice rather than to sub?

In JuliaLang#13612 I converted checksize from a generated function to a recursive lispy definition, but the methods are too complicated to be automatically inlined. Manually adding the inline annotation fixes this performance regression JuliaLang#14594. Master is now faster than 0.4.0 on most of the array perf tests.

andreasnoack · 2016-01-09T17:10:20Z

I don't think it has been discussed. Can you explain further?

mschauer · 2016-01-09T19:08:03Z

Currently, mapslices and special cased reductions over dimensions keep the reduced dimension as singleton dimension. As the indexing behavior now changed I thought mapslices and friends should follow soonish. This would mean changing the standard behavior of mapslices to dimension dropping as hinted at by

julia/base/abstractarray.jl

Line 1217 in dc5c974

# TODO: maybe support removing dimensions

timholy · 2016-01-09T22:35:02Z

I'm not sure I agree that those two design decisions are coupled. Meaning, I don't think it's inconsistent for getindex to drop dimensions but reductions to retain them: they are different function calls, and reductions are obviously different from indexing.

It's so common to say

Xnorm = X ./ maximum(X, 2)

that it would be a shame to make that any harder. (And dropping dimensions definitely does make that harder.) When you want to drop the dimensions, it's easier than ever to do so.

StefanKarpinski · 2016-01-11T02:39:23Z

I agree – I would prefer that reductions keep reduced dimensions for exactly this reason.

mschauer · 2016-01-11T10:58:34Z

I kind of agree for the monadic case, for a dyadic mapslices function the current scheme does not translate well. I now recall why I noted this, I have even RFC #13317 but I forgot about it. Maybe someone could comment there? Some feedback would be appreciated.

stevengj · 2016-08-02T02:45:29Z

I think this should be moved to the "Breaking Changes" part of the NEWS file. This just caused silently incorrect results in one of my packages (Hadamard.jl)

ref #13612 (comment)

that have non-Int indices, introduced by #13612 and which has caused `make -C test/perf` to fail for the last 9 months searchsortedfirst does not have methods for general Integer indices

davidavdav · 2016-08-19T10:07:32Z

Hello,

For NamedArrays I need to copy the exact behaviour of the dropping of dimensions (for julia-0.4 and 0.5). The AbstractArray contained in a n::NamedArray is sliced by calling getindex(n.array, index...)---so there I get the same behaviour automatically. But for the names of the dimensions in n, I need to know exactly which types of index... I need to keep.

Is there a Base function that will tell me this? I was looking at Base.index_shape but that isn't defined for all index types (e.g., CartesianIndex). Currently I use a helper function

dimkeepingtype(x) = false
dimkeepingtype(x::Vector) = true
dimkeepingtype(x::Range) = true
dimkeepingtype(x::BitVector) = true

that tells me if an index type x (∈ index) is non-scalar or not. I am not sure this covers all index types in the same way that AbstractArray keeps dimensions, so I'd rather refer similar functionality in Base.

timholy · 2016-08-19T17:07:23Z

Can't you add the missing index_shape_dim method(s) to NamedArrays? For julia-0.5 we could add such a method to Base, though---probably a good idea.

that have non-Int indices, introduced by #13612 and which has caused `make -C test/perf` to fail for the last 9 months searchsortedfirst does not have methods for general Integer indices (cherry picked from commit b1d5321) ref #18022

davidavdav · 2016-08-20T09:11:48Z

Thanks. But I do not really understand the logic in index_shape_dim---it seems a recursive helper function for index_shape. It doesn't look like the way towards telling me which dimensions are kept in slices. I need these to know which dimension names I need to select, and from which dimensions I have to slice the names of the indices in the NamedArray.

Somewhat related, I saw that in Julia-0.5 the dimension of a sliced array is the sum of the dimensions of the indices. That is great, but I don't really know yet how to come up with dimension and index names, with cases like NamedArray[Array] and NamedArray[NamedArray]. Does anybody know of a precedence for this case? I only know of R named arrays, but R doesn't have this fancy indexing scheme.

mbauman · 2016-08-21T04:12:09Z

I don't know about precedence in other languages, but perhaps you can take some inspiration from AxisArrays:

AxisArray[Array1, Array2] returns an AxisArray with axis names row_1, row_2, …, row_$(ndims(Array1)), and col_1, col_2, …, col_$(ndims(Array2)). The left-hand side of the _ comes from the names of the parent AxisArray.
AxisArray[AxisArray] is similar, except instead of using numbers on the right-hand side of the _, it uses the axis names of the index. This is where the axis names time_sub and time_rep come from on the README example.

davidavdav · 2016-08-21T11:50:22Z

Thanks, that is indeed a useful way of naming the dimensions.

For the names of the indexes along the dimensions, though, things are more complicated. I think I will opt for copying the names of the indexes of the indexing array (if this has ndims() > 1), or defaulting to 1..size(index) if the indexing array itself doesn't have names.

Further challenges will occur when indexing a NamedArray with an AxisArray or vice versa. I don't even want to think about this at this moment...

ref JuliaLang#13612 (comment)

that have non-Int indices, introduced by JuliaLang#13612 and which has caused `make -C test/perf` to fail for the last 9 months searchsortedfirst does not have methods for general Integer indices

mbauman added kind:breaking This change will break code domain:linear algebra Linear algebra labels Oct 14, 2015

IainNZ reviewed Oct 15, 2015
View reviewed changes

This was referenced Oct 19, 2015

Arraypocalypse Now and Then #13157

Closed

Taking vector transposes seriously #4774

Closed

mbauman force-pushed the mb/drop-bear branch from 037bb58 to 3396010 Compare November 4, 2015 19:42

mbauman force-pushed the mb/drop-bear branch from 3396010 to 6817fc5 Compare November 4, 2015 21:19

mbauman added 2 commits November 4, 2015 20:31

Drop dimensions indexed by scalars

4d3abff

Doc and news for dropping scalar dimensions

3709e54

mbauman force-pushed the mb/drop-bear branch from 6817fc5 to 3709e54 Compare November 5, 2015 01:32

andreasnoack added a commit that referenced this pull request Nov 9, 2015

Merge pull request #13612 from JuliaLang/mb/drop-bear

12dbaab

RFC: Drop dimensions indexed by scalars

andreasnoack merged commit 12dbaab into master Nov 9, 2015

tkelman deleted the mb/drop-bear branch November 10, 2015 14:06

jiahao mentioned this pull request Nov 10, 2015

Fix harmonic restart svdl on master JuliaLinearAlgebra/IterativeSolvers.jl#59

Merged

yuyichao mentioned this pull request Nov 16, 2015

Fix test for array indexing breakage JuliaArrays/ArrayViews.jl#40

Merged

c42f mentioned this pull request Dec 26, 2015

Arraypocalypse slicing compatibility JuliaLang/Compat.jl#158

Closed

mbauman mentioned this pull request Jan 8, 2016

Ensure checksize inlines #14609

Merged

Jutho mentioned this pull request Mar 25, 2016

julia resizing to n-1 dimension array in singleton dimensions #15621

Closed

andreasnoack mentioned this pull request May 11, 2016

Row extraction does not work as expected anymore #16317

Closed

mschauer mentioned this pull request May 27, 2016

array reductions (sum, mean, etc.) and dropping dimensions #16606

Open

simonbyrne mentioned this pull request Jun 9, 2016

Change sub behaviour to match getindex, possibly rename. #16846

Closed

blakejohnson mentioned this pull request Jun 28, 2016

Handling new indexing behavior in Julia 0.5 lindahua/Devectorize.jl#51

Open

tkelman mentioned this pull request Aug 2, 2016

NEWS: document command-line option changes #17740

Closed

tkelman added a commit that referenced this pull request Aug 3, 2016

NEWS: Move scalar index dropping to breaking changes

6d72687

ref #13612 (comment)

tkelman added a commit that referenced this pull request Aug 3, 2016

NEWS: Move scalar index dropping to breaking changes

282c3e8

ref #13612 (comment)

tkelman mentioned this pull request Aug 14, 2016

Fix a row-indexing bug with sparse matrices that have non-Int indices #18022

Merged

mfasi pushed a commit to mfasi/julia that referenced this pull request Sep 5, 2016

NEWS: Move scalar index dropping to breaking changes

6ba4922

ref JuliaLang#13612 (comment)

andreasnoack mentioned this pull request Oct 5, 2016

Row vector issue in v0.5 #18805

Closed

floswald mentioned this pull request Oct 12, 2016

findmax does not squeeze out singleton index #18884

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RFC: Drop dimensions indexed by scalars #13612

RFC: Drop dimensions indexed by scalars #13612

mbauman commented Oct 14, 2015

mbauman commented Oct 14, 2015

IainNZ Oct 15, 2015

malmaud Oct 15, 2015

andreasnoack commented Nov 4, 2015

mbauman commented Nov 4, 2015

mschauer commented Nov 5, 2015

andreasnoack commented Nov 5, 2015

mbauman commented Nov 5, 2015

timholy commented Nov 6, 2015

yuyichao commented Nov 16, 2015

timholy commented Nov 16, 2015

mschauer commented Jan 7, 2016

andreasnoack commented Jan 9, 2016

mschauer commented Jan 9, 2016

timholy commented Jan 9, 2016

StefanKarpinski commented Jan 11, 2016

mschauer commented Jan 11, 2016

stevengj commented Aug 2, 2016

davidavdav commented Aug 19, 2016

timholy commented Aug 19, 2016

davidavdav commented Aug 20, 2016

mbauman commented Aug 21, 2016 •

edited

Loading

davidavdav commented Aug 21, 2016

RFC: Drop dimensions indexed by scalars #13612

RFC: Drop dimensions indexed by scalars #13612

Conversation

mbauman commented Oct 14, 2015

mbauman commented Oct 14, 2015

IainNZ Oct 15, 2015

Choose a reason for hiding this comment

malmaud Oct 15, 2015

Choose a reason for hiding this comment

andreasnoack commented Nov 4, 2015

mbauman commented Nov 4, 2015

mschauer commented Nov 5, 2015

andreasnoack commented Nov 5, 2015

mbauman commented Nov 5, 2015

timholy commented Nov 6, 2015

yuyichao commented Nov 16, 2015

timholy commented Nov 16, 2015

mschauer commented Jan 7, 2016

andreasnoack commented Jan 9, 2016

mschauer commented Jan 9, 2016

timholy commented Jan 9, 2016

StefanKarpinski commented Jan 11, 2016

mschauer commented Jan 11, 2016

stevengj commented Aug 2, 2016

davidavdav commented Aug 19, 2016

timholy commented Aug 19, 2016

davidavdav commented Aug 20, 2016

mbauman commented Aug 21, 2016 • edited Loading

davidavdav commented Aug 21, 2016

mbauman commented Aug 21, 2016 •

edited

Loading