Add slicing functionality #165

dkarrasch · 2021-12-02T13:47:03Z

This is an alternative approach to #145, without adding yet another LinearMap subtype. As it stands now, there is no switch for turning on/off indexing/slicing. I was hoping one could have a submodule and "hide" it from the user, unless s/he asks for using LinearMaps.GetIndex, but that doesn't seem to work as easy as I was hoping for. OTOH, we might as well trust our users that they are cautious about indexing/slicing and not put additional stones in their way.

This may require a few more tests to keep coverage high, some announcement in the docs and some information on how to extend that functionality to own map types.

Closes #145, closes #38.

codecov · 2021-12-02T17:12:52Z

Codecov Report

Merging #165 (93270b6) into master (7363170) will increase coverage by 0.02%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master     #165      +/-   ##
==========================================
+ Coverage   99.63%   99.65%   +0.02%     
==========================================
  Files          14       15       +1     
  Lines        1089     1167      +78     
==========================================
+ Hits         1085     1163      +78     
  Misses          4        4

Impacted Files	Coverage Δ
src/LinearMaps.jl	`100.00% <ø> (ø)`
src/getindex.jl	`100.00% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 7363170...93270b6. Read the comment docs.

codecov · 2022-01-25T10:08:18Z

Codecov Report

Merging #165 (d378b04) into master (12c13fb) will increase coverage by 0.06%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master     #165      +/-   ##
==========================================
+ Coverage   98.54%   98.61%   +0.06%     
==========================================
  Files          15       16       +1     
  Lines        1307     1372      +65     
==========================================
+ Hits         1288     1353      +65     
  Misses         19       19

Impacted Files	Coverage Δ
src/LinearMaps.jl	`100.00% <ø> (ø)`
src/getindex.jl	`100.00% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 12c13fb...d378b04. Read the comment docs.

dkarrasch · 2022-06-03T09:18:23Z

Anyone interested in indexing and slicing LinearMaps? 😛 @oschulz

@JeffFessler: if I forward indexing LinearMapsA* to the wrapped LinearMap, LinearMapsAA.jl's tests pass for me locally. Could you please double-check?

If this looks good, we could get this in v3.7 soon, otherwise I'd go and make a release with the other currently merged PRs.

oschulz · 2022-06-03T09:25:37Z

Anyone interested in indexing and slicing LinearMaps?

Very much so! With this, could we do LinearMap <: AbstractMatrix (#180) as well, to help with dispatch for "non-LinearMap-aware" code?

dkarrasch · 2022-06-03T10:30:24Z

I'll comment on that in the other issue. For the time being, I'd be interested in getting feedback on the implementation. ;-)

oschulz · 2022-06-03T11:22:46Z

Ok ;-)

JeffFessler · 2022-06-03T13:49:26Z

Wow, this is a very comprehensive implementation with several features I didn't know about like Base.Slice and diagind (that therefore are not supported in LinearMapsAA).

I'm not exactly sure how to test this independently.

When you say "forward indexing" do you mean something like this?
Base.getindex(A::LinearMapAX, args...) = Base.getindex(A._lmap, args...)

src/getindex.jl

dkarrasch · 2022-06-03T15:03:32Z

When you say "forward indexing" do you mean something like this?
Base.getindex(A::LinearMapAX, args...) = Base.getindex(A._lmap, args...)

Exactly.

JeffFessler · 2022-06-05T19:33:40Z

Personally, I would prefer that anything throws that's "inefficient by construction", so to speak. My suggestion would be any single-element/scalar getindex, as well as row-access in the no-adjoint case.

I think I agree, as long as "scalar" means A[i,j] or A[k] - those are the cases that need warnings (if any).
I consider A[:,j] and probably even A[:,J] to be fine (i.e., no warning needed). Let's also keep in mind that Matrix(A) is same as A[:,:] and there has never been any warning for that case and I'd prefer to retain that behavior.
The general case of A[I,J] also reverts to scalar so also could warn.

oschulz · 2022-06-06T08:34:11Z

I think I agree, as long as "scalar" means A[i,j] or A[k]

Yes, that's what I meant.

I consider A[:,j] and probably even A[:,J] to be fine (i.e., no warning needed).

I fully agree.

that Matrix(A) is same as A[:,:] and there has never been any warning

Absolutely - that's an efficient operation, after all.

My inuition would be (with scalar indices i, j and array/range-indices I,J) to

Allow A[:,j], A[:,J], A[I,J] and A[:,:] in general.
Only allow A[i,:] for maps that support adjoint (need to be careful with complex numbers)
Disallow A[i,j] in general.

A[I,J] I would allow in general, I think, to support use cases like A[begin+1:end-1:begin+1:end-1] and similar, which would typically still very efficient.

dkarrasch · 2022-06-09T13:14:02Z

The latest commit largely implements the suggestions (thanks for the clear roadmap!). What I currently have errors on A[i,j], and suggests to consider using A[:,j][i] if one really needs to. Question: shall we do the analogous thing for A[I,j], i.e., error and suggest A[:,j][I]? It allocates the unit vector, the target vector, and then yet another vector after slicing the target. The horizontal slicing behaviour should perhaps be consistent with regard to throwing or working. I wonder what rationale exactly applies to allowing A[I,J] but not A[i,j]? Is it that when we compute entire columns (or rows, whatever is going to require less map applications) and then slice by some vector, that "computing the entire row or column was more worth it" than just for a single component? For the same reason as with A[I,j], A[begin+1:end-1,begin+1:end-1] may not be more efficient than convert(Matrix, A)[begin+1:end-1,begin+1:end-1], especially after #173.

oschulz · 2022-06-09T13:34:40Z

The latest commit largely implements the suggestions (thanks for the clear roadmap!).

Thanks a lot for doing all of this!

What I currently have errors on A[i,j], and suggests to consider using A[:,j][i] if one really needs to.

Oh, that's a nice approach I think!

Question: shall we do the analogous thing for A[I,j], i.e., error and suggest A[:,j][I]? ... I wonder what rationale exactly applies to allowing A[I,J] but not A[i,j]

My intuition was that code which iterates over A[I,j] with short ranges of I is comparatively rare, while stuff like A[begin+1:end-1,j] is not uncommon (getting the inner part things that were padded to fold with kernels and so on). The same for A[I,J].

A[begin+1:end-1,begin+1:end-1] may not be more efficient than convert(Matrix, A)[begin+1:end-1,begin+1:end-1]

Yes, I think so too. I was assuming that the user code wouldn't want to have the actual matrix in hand, necessarily. Maybe it would in fact be best if A[I,J] would not evaluate anything be return something like a view? But the implementation of A[begin+1:end-1,begin+1:end-1] * x wouldn't be trivial - one would have to pad x first, to the mul, and then do a getindex on the result.

Maybe it's best if we disallow A[I,j], A[i,J] and A[I,J] for now (error with suggestion of alternative code, you you already have) until we come across valid use cases?

dkarrasch · 2022-06-09T13:55:29Z

Maybe it's best if we disallow A[I,j], A[i,J] and A[I,J] for now (error with suggestion of alternative code, you you already have) until we come across valid use cases?

Yes, I guess that's much better than first allow it and then handle potential "Why is XYZ so slow?" issues... or even disallow it later. I think the main point that the user needs to understand is that these cases do correspond to first compute entire slices and then slice again. If users will have to write that out they will naturally want to think twice if that's really what they want, or whether they were writing it just out of convenience.

JeffFessler · 2022-06-09T14:01:52Z

Makes sense.
Then probably A[:,:] also should throw for now, whereas Matrix(A) continues to work silently (and possibly slowly)?

dkarrasch · 2022-06-09T14:04:05Z

Ha, that will make a fantastic PR: basically add a whole bunch of methods that throw! Progress! Yippieh! 🤣

oschulz · 2022-06-09T15:22:15Z

Then probably A[:,:] also should throw for now,

I think A[:,:] should be allowed. It's very efficient, and it would be surprising for the user if A[:,i] is allowed but A[:,:] isn't.

JeffFessler · 2022-06-09T15:38:55Z

It's very efficient

I guess that is in the eye of the beholder. If A is the N-point DFT implemented via an FFT (a typical use case for me), then A*x is $O(N \log N)$ whereas A[:,:] is $O(N^2 \log N)$ which is more expensive than the $O(N^2)$ needed to simply construct the matrix in the first place! So my view is that it should be discouraged a bit (if we are also discouraging A[I,J]), but I don't feel too strongly about it.
Perhaps ideally there would be a user switch (like with GPU scalars) but it's not a high enough priority for me to want to try to figure that out.

oschulz · 2022-06-09T15:44:43Z

then A*x is P(N log N) whereas A[:,:] is O(N^2 log N)

Yes, that's what I mean - it scales proportionally to the number of outputs that you want to produce, both for A[:,i] and A[:,:]. An overhead of log(N) is typically still considered quite efficient in numerics, after all.

I would put it like this: A[:,i] is one mul for an output vector with N numbers - that's definitely Ok, it's the bare performance of the linear map. And A[:,:] is N muls for an output matrix of N^2 numbers. That's very Ok too, I would say.

dkarrasch · 2022-06-09T19:49:07Z

For higher-dimensional arrays, I guess it is not completely uncommon to put colons in two dimensions. For "matrices", who would construct a (deep) copy via A[:,:], when you have Matrix(A)?

BTW, Matrix(A) was never meant to be "efficient". It was just introduced so that, if you really need (for instance, for stubborn downstream packages 😝 ) to provide a matrix. One often repeated argument by @Jutho is that, for applications that require indexing of many different components, it may be beneficial to pay the full (matrix construction) price once than to have it potentially an unbound number of allocations etc. For such cases, we have Matrix(A) and sparse(A) constructors. So, not "efficient", but perhaps "better than". For that reason, Matrix(A) was never viewed in terms of indexing/slicing. Now, with this PR, these two things (matrix construction and slicing) touch each other via A[:,:], and one may take two perspectives: (a) construction of a matrix, and (b) double-slicing. From the perspective (a), "efficiency" doesn't play a role, it's just supposed to be "better than"; from the perspective (b), this is an awful operation, and should be warned about, to the very least. Obviously, it's just a matter of taste, but I tend to prefer path (b). Note that generic code could use convert(AbstractMatrix, A), which is a noop for AbstractMatrixes.

oschulz · 2022-06-10T01:14:25Z

For "matrices", who would construct a (deep) copy via A[:,:], when you have Matrix(A)

It's not quite the same in general, because for normal arrays A[:,:] would typically preserve the array type. For LinearMaps it won't matter much which is used, I guess.

My argument for allowing A[:,:] is more that we can easily do it and that allowing A[:,i] (which we want) makes forbidding A[:,:] (which has the same relative efficiency) surprising for the user.

dkarrasch · 2022-06-10T08:40:14Z

Ok, the rationale of allowing "any complete slicing" (as opposed to partial slicing, which requires more allocations than just the unavoidable unit vector and the result) sounds convincing to me.

As I said, it's borderline and a matter of taste: from the point of view of matrix construction it's clear that this is expensive (and could deserve a warning), from the point of view of slicing it is "expectedly expensive" (and could therefore be allowed). I'll see if I need to adjust the announcement. Perhaps, there should also be a section in the types.md docs, that repeats considerations and lays out the rationale of what is allowed and what not.

dkarrasch · 2022-06-10T09:23:51Z

Uff, I think I finally understood the macro stuff! 🎉

What shall we do then? Allow slicing in general without interference of macros and allow scalar indexing/partial slicing upon a macro signal, or allow any slicing/indexing only upon macro signal? Having two macros, one for slicing and one for indexing and partial slicing is perhaps overkill.

oschulz · 2022-06-10T10:07:09Z

What shall we do then? Allow slicing in general without interference of macros

From a user perspective, I'd be happy with A[:,j] and A[:,:] allowed, A[i,:] allowed if adjoint exits, and everything else resulting in an error with a helpful "don't do this, it's slow" error message. Anything beyond (macro magic to allow stuff explicitly, etc.) is fine too, of course.

dkarrasch · 2022-06-10T10:41:17Z

Ok, then you should be happy with the current state, I hope. We could release this as is and then take some time to work out how to allow things that are currently forbidden. That would be "a new feature" and non-breaking, as opposed to first allowing something and later require a macro to do that. I'll wait for @JeffFessler's thumb.

oschulz · 2022-06-10T10:59:55Z

Sounds great - again, thanks a lot!

JeffFessler · 2022-06-10T12:54:18Z

src/getindex.jl

+Base.lastindex(A::LinearMap) = last(eachindex(IndexLinear(), A))
+Base.firstindex(A::LinearMap) = first(eachindex(IndexLinear(), A))


I noticed that codecov complains about these two lines. I wonder if they should be commented out because we are not supporting A[1] and A[end] at this point in time, right?

JeffFessler · 2022-06-10T12:59:02Z

src/getindex.jl

+    return x
+end
+
+function _fillbyrows!(dest, A, I, J)


should these be commented out since A[I,J] is not supported?
the codecov warning is what caught my eye here.

The tests were not variable enough. This method is needed in general.

JeffFessler · 2022-06-10T13:04:26Z

I'd be happy with A[:,j] and A[:,:] allowed, A[i,:] allowed if adjoint exits

this is also fine with me. thumbs up other than a couple questions i asked about commenting out possibly unused code which i leave to you to decide.

oschulz · 2022-06-13T07:38:07Z

Thanks again!

dkarrasch force-pushed the dk/getindex branch 3 times, most recently from 7e4d6e2 to a3b8c34 Compare December 2, 2021 17:00

dkarrasch force-pushed the dk/getindex branch from 176ef4f to 7b70fc8 Compare December 3, 2021 16:30

dkarrasch force-pushed the dk/getindex branch from abc5e38 to ef2867f Compare February 28, 2022 14:40

dkarrasch force-pushed the dk/getindex branch from ef2867f to 4fe0a61 Compare March 21, 2022 11:13

dkarrasch force-pushed the dk/getindex branch from 02a964c to d946e3a Compare May 23, 2022 17:29

dkarrasch marked this pull request as ready for review May 25, 2022 20:45

JeffFessler reviewed Jun 3, 2022

View reviewed changes

src/getindex.jl Outdated Show resolved Hide resolved

JeffFessler reviewed Jun 3, 2022

View reviewed changes

src/getindex.jl Outdated Show resolved Hide resolved

dkarrasch and others added 12 commits June 3, 2022 17:14

Add getindex functionality

1a52dea

increase code coverage

97c1fb3

simplify compiler macros

d56060f

reduce code repetition

0c9529c

fix & test rectangular case, further optimizations

743e456

minor cleanup

f61f2a6

update Aqua.jl badge

090d928

fix test

f546fe1

add logical and diagonal indexing

4e87153

revert README change

6582b3e

revert unrelated test changes

04b86fc

add history entry

a205d15

overhaul after code review

84a95bb

dkarrasch added 2 commits June 9, 2022 16:23

throw more

46865b8

improve coverage

8f17ed0

allow double-colon

c5c849a

more (robust) tests

31bfd43

JeffFessler reviewed Jun 10, 2022

View reviewed changes

improve coverage

d378b04

dkarrasch changed the title ~~Add getindex functionality~~ Add slicing functionality Jun 12, 2022

dkarrasch merged commit 0a42082 into master Jun 12, 2022

dkarrasch deleted the dk/getindex branch June 12, 2022 09:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add slicing functionality #165

Add slicing functionality #165

dkarrasch commented Dec 2, 2021 •

edited

Loading

codecov bot commented Dec 2, 2021 •

edited

Loading

codecov bot commented Jan 25, 2022 •

edited

Loading

dkarrasch commented Jun 3, 2022

oschulz commented Jun 3, 2022

dkarrasch commented Jun 3, 2022

oschulz commented Jun 3, 2022

JeffFessler commented Jun 3, 2022

dkarrasch commented Jun 3, 2022

JeffFessler commented Jun 5, 2022

oschulz commented Jun 6, 2022

dkarrasch commented Jun 9, 2022

oschulz commented Jun 9, 2022

dkarrasch commented Jun 9, 2022 •

edited

Loading

JeffFessler commented Jun 9, 2022

dkarrasch commented Jun 9, 2022

oschulz commented Jun 9, 2022

JeffFessler commented Jun 9, 2022

oschulz commented Jun 9, 2022

dkarrasch commented Jun 9, 2022 •

edited

Loading

oschulz commented Jun 10, 2022

dkarrasch commented Jun 10, 2022

dkarrasch commented Jun 10, 2022

oschulz commented Jun 10, 2022

dkarrasch commented Jun 10, 2022

oschulz commented Jun 10, 2022

JeffFessler Jun 10, 2022

JeffFessler Jun 10, 2022

dkarrasch Jun 10, 2022

JeffFessler commented Jun 10, 2022

oschulz commented Jun 13, 2022

		Base.lastindex(A::LinearMap) = last(eachindex(IndexLinear(), A))
		Base.firstindex(A::LinearMap) = first(eachindex(IndexLinear(), A))

Add slicing functionality #165

Add slicing functionality #165

Conversation

dkarrasch commented Dec 2, 2021 • edited Loading

codecov bot commented Dec 2, 2021 • edited Loading

Codecov Report

codecov bot commented Jan 25, 2022 • edited Loading

Codecov Report

dkarrasch commented Jun 3, 2022

oschulz commented Jun 3, 2022

dkarrasch commented Jun 3, 2022

oschulz commented Jun 3, 2022

JeffFessler commented Jun 3, 2022

dkarrasch commented Jun 3, 2022

JeffFessler commented Jun 5, 2022

oschulz commented Jun 6, 2022

dkarrasch commented Jun 9, 2022

oschulz commented Jun 9, 2022

dkarrasch commented Jun 9, 2022 • edited Loading

JeffFessler commented Jun 9, 2022

dkarrasch commented Jun 9, 2022

oschulz commented Jun 9, 2022

JeffFessler commented Jun 9, 2022

oschulz commented Jun 9, 2022

dkarrasch commented Jun 9, 2022 • edited Loading

oschulz commented Jun 10, 2022

dkarrasch commented Jun 10, 2022

dkarrasch commented Jun 10, 2022

oschulz commented Jun 10, 2022

dkarrasch commented Jun 10, 2022

oschulz commented Jun 10, 2022

JeffFessler Jun 10, 2022

Choose a reason for hiding this comment

JeffFessler Jun 10, 2022

Choose a reason for hiding this comment

dkarrasch Jun 10, 2022

Choose a reason for hiding this comment

JeffFessler commented Jun 10, 2022

oschulz commented Jun 13, 2022

dkarrasch commented Dec 2, 2021 •

edited

Loading

codecov bot commented Dec 2, 2021 •

edited

Loading

codecov bot commented Jan 25, 2022 •

edited

Loading

dkarrasch commented Jun 9, 2022 •

edited

Loading

dkarrasch commented Jun 9, 2022 •

edited

Loading