[Feature suggestion] einops for array indexing #194

shoyer · 2022-06-23T20:59:49Z

As suggested on Twitter by @colah:
https://twitter.com/ch402/status/1539774943214178304

Is there an "einsum version" of gather()? Somehow gather is the most painful function to use when I need it (at least since I started using einsum).

eg. gather(array, "a b inds[a b] c -> a b c", inds=inds) represents indexing into third dim with inds or somethingsh

There are a few more syntax ideas in the Twitter thread. I'm not entirely sure what this could look like, but I agree that array indexing syntax is one of the hardest parts of NumPy that isn't already served by Einops.

The text was updated successfully, but these errors were encountered:

arogozhnikov · 2022-06-27T09:39:52Z

@shoyer, I've been thinking about this quite extensively in the past, and probably I'm settled about how that would like.
Not expecting everyone to like it though.

I've put some relatively simple implementation and examples here, would be nice to get your thoughts
https://github.com/arogozhnikov/einops/blob/master/einops/experimental/indexing.py

Wanted this to be my first python-array-api based function, but found out indexing isn't really supported by standard.

MilesCranmer · 2022-07-04T21:45:02Z

Some syntax ideas here: https://github.com/mcabbott/Tullio.jl

From their README:

Tullio is a very flexible einsum macro. It understands many array operations written in index notation -- not just matrix multiplication and permutations, but also convolutions, stencils, scatter/gather, and broadcasting

@tullio M[x,y,c] := N[x+i, y+j,c] * K[i,j]     # sum over i,j, and create M

@tullio S[x] = P[x,y] * log(Q[x,y] / R[y])     # sum over y, and write into S

@tullio A[i,j] += B[i,k,l] * C[l,j] * D[k,j]   # sum over k,l, and add to values in A

@tullio (*) Z[j] := X[ind[k],j] * exp(-Y[k])   # product over k

arogozhnikov · 2022-07-05T09:15:25Z

Julia's line-level macros really shine for this kind of stuff

@tullio M[x,y,c] := N[x+i, y+j,c] * K[i,j]     # sum over i,j, and create M

Here is the problem with arbitrary expressions in indexers (aside from implementation complexity): they will be immediately used for conv-style operations (which would work slower and with much larger memory footprint than cudnn) and immediately fall into out-of-bounds or negative indices. I don't see a way to meet variable user's expectations for 'reasonable' out-of-bound processing.

Path with indexing or maybe indexing+reduction looks feasible.

I realize that indexing proposal above is a bit extraterrestrial at first, but only until you couple indexing with the second part of proposal (how those indices should be computed):

# for every timeframe in a video, find the token with the highest norm (across h and w), and compose a new stack of them
norm_bthw = x_bthwc.norm(dim=-1)
# here you explicitly say which axes argmax should be taken on, and the shape of output is readable - 2 x b x t
indices_2bt = argmax(norm_bthw, 'b t h w -> [h, w] b t')
# note that '[h, w] b t' part just migrated from the previous operation
selected_embeddings_btc = einindex('b t c <- b t h w c, [h, w] b t', x_bthwc, indices_2bt)

AFAIK, multidim argmax / topk + indexing are not solved in numpy and existing frameworks, and above looks like a quite consistent solution to me

lucidrains · 2022-07-15T16:10:58Z

this would be huge! you have no idea the needless complexity i have written up in the past https://github.com/lucidrains/point-transformer-pytorch/blob/main/point_transformer_pytorch/point_transformer_pytorch.py#L13 lol

lucidrains · 2023-02-21T17:34:28Z

@arogozhnikov what would it take for you to build this out to your heart's content?

you are the only one in the world who can do this justice, imo

jakubMitura14 · 2023-03-11T15:10:11Z

This would be fantastic, I love Julia but still in production python is rather required in most companies, still getting this in einops would be stellar!

lucidrains · 2023-03-11T15:59:03Z

i'm convinced that works of art like einops can't be extrinsically motivated into existence, but if Alex wants to put up a Patreon, would be glad to become a patron in the short term, with no obligation on his end. "greatness cannot be planned"

lucidrains · 2023-05-18T15:57:33Z

oh. my. god. https://github.com/arogozhnikov/eindex it's happening

shoyer added the feature suggestion label Jun 23, 2022

arogozhnikov mentioned this issue Jun 27, 2022

Proposal: add APIs for getting and setting elements via a list of indices (i.e., take, put, etc) data-apis/array-api#177

Open

This comment was marked as off-topic.

Sign in to view

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature suggestion] einops for array indexing #194

[Feature suggestion] einops for array indexing #194

shoyer commented Jun 23, 2022

arogozhnikov commented Jun 27, 2022

MilesCranmer commented Jul 4, 2022

arogozhnikov commented Jul 5, 2022 •

edited

Loading

lucidrains commented Jul 15, 2022

lucidrains commented Feb 21, 2023

jakubMitura14 commented Mar 11, 2023

lucidrains commented Mar 11, 2023

lucidrains commented May 18, 2023 •

edited

Loading

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

[Feature suggestion] einops for array indexing #194

[Feature suggestion] einops for array indexing #194

Comments

shoyer commented Jun 23, 2022

arogozhnikov commented Jun 27, 2022

MilesCranmer commented Jul 4, 2022

arogozhnikov commented Jul 5, 2022 • edited Loading

lucidrains commented Jul 15, 2022

lucidrains commented Feb 21, 2023

jakubMitura14 commented Mar 11, 2023

lucidrains commented Mar 11, 2023

lucidrains commented May 18, 2023 • edited Loading

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

arogozhnikov commented Jul 5, 2022 •

edited

Loading

lucidrains commented May 18, 2023 •

edited

Loading