Create einsum operation #197

MilesCranmer · 2022-07-04T04:17:43Z

This creates the functional einsum function as requested on #73. CC @arogozhnikov @cgarciae

The current implementation simply parses the string and converts it to einsum notation by mapping axis names to single characters (I use string.ascii_letters, starting from a, b, c etc).

Currently, it has the following features:

Supports the backends: tensorflow, numpy, jax, pytorch, chainer, oneflow, keras, cupy.
Allows for an arbitrary number of tensors passed.
Allows ellipsis specification, including for multiple tensors, so long as it is provided on both the left and the right of the ->.

It does not currently support

Reshape operations, such as "(batch channel) feature, feature -> batch channel".
Custom reduction operations.

These could be added later if desired. Some backends do not support custom reductions in their einsum implementations so it will be a bit more work.

I also added a docstring and some unittests (in tests/test_einsum.py).

Here are some examples of use, with the numpy backend:

# Filter a set of images:
>>> batched_images = np.random.randn(128, 16, 16)
>>> filters = np.random.randn(16, 16, 30)
>>> result = einsum(batched_images, filters,
...                 "batch h w, h w channel -> batch channel") 

>>> result.shape
(128, 30)

# Matrix multiplication, with an unknown input shape:
>>> batch_shape = (50, 30)
>>> data = np.random.randn(*batch_shape, 20)
>>> weights = np.random.randn(10, 20)
>>> result = einsum(weights, data, 
...                 "out_dim in_dim, ... in_dim -> ... out_dim")
>>> result.shape
(50, 30, 10)

Note that the number of spaces next to the comma above are arbitrary, you could do either "in_dim, ..." or "in_dim , ..." - both will work.

Eager to hear feedback on this!

Cheers,
Miles

Edit 1: Got working for repeat indices on one side (as used in, e.g., trace).
Edit 2: Added support for chainer, oneflow, cupy, tensorflow.keras.
Edit 3: Added many more tests, some mirroring those used in the np.einsum tests.
Edit 4: More and more unit tests.
Edit 5: Tweaked the syntax to have tensors first, pattern second. Adapted tests, and added new validation for order of arguments.

MilesCranmer · 2022-07-04T04:25:48Z

I think implementing the rearrange operations inside shouldn't be too bad: since einsum doesn't require any specific order of indices, you could call

for (tensor, left_expression) in zip(tensors, left_expressions):
    axis_names = ...
    tensor = rearrange(tensor, left_expression + "->" + " ".join(axis_names))

for every left_expression. Then, you would pass " ".join(axis_names) back to einops.einsum.

Then, you could do a similar rearrange on the output expression.

What do you think @arogozhnikov? (for a future PR, of course)

MilesCranmer · 2022-07-04T06:08:13Z

~~Actually I realized this doesn't work with repeat variable names. e.g., if I want to compute the trace of a tensor:~~

einsum("index index -> ", np.ones((5, 5)))

~~ParsedExpression doesn't allow duplicate dimensions so this doesn't work. I guess I could modify it to allow this.~~

Edit: fixed with allow_duplicates argument to ParsedExpression.

arogozhnikov

cool, thanks Miles for very clean PR and collecting test suite

I've left some thoughts and request for two kinds of tests:

fail on parsing of features that aren't yet supported
add tests for symbolic backends. Probably latter would only include tf, but still

einops/einops.py

tests/test_einsum.py

einops/_backends.py

MilesCranmer · 2022-07-05T20:36:27Z

Okay, all suggestions implemented. Let me know what you think.

MilesCranmer · 2022-07-09T20:18:05Z

Okay everything is implemented for the new syntax:

y = einsum(x, x, "i j, i k -> j k")

I also added new validation checks for the argument order, and corresponding unit-tests.

In the unittests, I also now check the specific message of each error, rather than the error type.

Let me know what you think.

tests/test_einsum.py

arogozhnikov · 2022-07-10T06:28:29Z

for every left_expression. Then, you would pass " ".join(axis_names) back to einops.einsum.
Then, you could do a similar rearrange on the output expression.

It's trickier since you want some of axes to be derived from the inputs shapes.
In the examples you previously posted it could be like (i j) k, i -> j k, so first shape of the second argument should be parsed. For the output that's actually straighforward and just applying a pattern would always work

Not relevant for the PR. Just commenting since you asked

arogozhnikov · 2022-07-10T08:44:13Z

einops/_backends.py

@@ -560,6 +578,12 @@ def layers(self):
        from .layers import keras
        return keras

+    def einsum(self, pattern, *x):
+        return self.tf.vectorized_map(


want to understand why it looks so strange in tf.keras

I'm not sure if I was interpreting the symbolic (layer=True) backends correctly or not.

Basically, this einsum assumes the x tensors have a leading batch axis, which are assumed to not be specified in the pattern. I assumed this because the create_symbol method specifies the shape as a batch shape, rather than an absolute shape. Is that correct, or should it assume the pattern also specifies the batch axis?

Actually, I think my implementation has a potential issue: if one symbol is batched, and one symbol is not (like a weight matrix).

What do you think the correct strategy is here? Should I avoid adding einsum for keras, since it is technically a layer=True backend?

layer=True just refers to providing layers, it should not be related to any batch variables, and patterns should include batch variables. Anyway, I forgot keras now just a redirection to TF layers, so just excluded this part

arogozhnikov · 2022-07-15T09:09:13Z

PR is merged, made very minor changes.
thank you for paying attention to details and keeping pushing this!

MilesCranmer · 2022-07-15T14:05:27Z

Awesome!! Great to hear.

gerdm · 2022-10-04T06:51:04Z

This is a great PR!
+1 for “ Custom reduction operations”. Is there anyone already working on this?

alok · 2023-05-09T20:15:39Z

I'm interested in adding rearrange support. Any pointers?

MilesCranmer added 8 commits July 3, 2022 22:17

Add einsum for np, torch, tf, jax backends

95f670f

Add function that converts to einsum pattern

2cfa0ef

Add function that calls einsum for some tensors

f499573

Include missing string import for einsum

ce91d48

Include einsum in main library

05fa3c4

Refactor test_einsum

6cd3b1b

Add tests for functional einsum

b167619

Create docstring for einsum

4589c6d

MilesCranmer mentioned this pull request Jul 4, 2022

einops.einsum #73

Closed

Clean up doctest for einsum

7154b5a

MilesCranmer added 13 commits July 4, 2022 02:16

Allow duplicates in ParsedExpression for einsum

5672ea8

Add tests for repeated indices in einsum

c8173b0

Clean up einops.einsum docstrings

31ae88b

Improve readability of einsum implementation

29c8392

Implement einsum for additional backends

decdfbe

Include einsum tests for additional backends

7f4285c

Expand einsum documentation

f94234a

Fix einsum docstring example

fc93d3b

Test values passed through einsum

1060621

Include helpful error message if -> missing

5a7805f

Remove unused class in einsum test

b0d8c45

Include many more einsum tests

d01ae74

Remove unnecessary strip() in einsum

21ec6bd

arogozhnikov reviewed Jul 5, 2022

View reviewed changes

MilesCranmer added 4 commits July 5, 2022 10:23

Error for singleton axes

a8ec295

Include symbolic backends in einsum test

7590a77

Attempt inclusion of mxnet backend for einsum

27e8ebd

Remove mxnet einsum as non-functional

102e732

MilesCranmer added 5 commits July 5, 2022 12:45

Reduce indentation of einsum tests

fc076fa

Clean up axis name validation in einsum

a12071d

Fix error checking for einsum

555484e

Fix einsum for keras implementation

082aec0

Add symbolic test for einsum in keras

eaa57c8

MilesCranmer commented Jul 5, 2022

View reviewed changes

einops/_backends.py Show resolved Hide resolved

Clean up einsum tests

33ecafa

MilesCranmer force-pushed the einsum branch from b26e967 to 33ecafa Compare July 5, 2022 20:16

Test error paths of einsum pattern creation

7abee99

MilesCranmer requested a review from arogozhnikov July 5, 2022 20:36

MilesCranmer added 5 commits July 9, 2022 15:44

Change einsum syntax to *tensors, pattern

52da9e5

Test for bad order of tensors

43a8ef3

Validate actual error messages

291a777

Remove unused error; add tests

ed0038f

Remove unused testing functionality

ceb33ab

Update ordering of params in docstring

5ab5c0d

arogozhnikov reviewed Jul 10, 2022

View reviewed changes

tests/test_einsum.py Show resolved Hide resolved

arogozhnikov reviewed Jul 10, 2022

View reviewed changes

arogozhnikov merged commit e168125 into arogozhnikov:master Jul 15, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create einsum operation #197

Create einsum operation #197

MilesCranmer commented Jul 4, 2022 •

edited

MilesCranmer commented Jul 4, 2022 •

edited

MilesCranmer commented Jul 4, 2022 •

edited

arogozhnikov left a comment

MilesCranmer commented Jul 5, 2022

MilesCranmer commented Jul 9, 2022

arogozhnikov commented Jul 10, 2022 •

edited

arogozhnikov Jul 10, 2022

MilesCranmer Jul 10, 2022 •

edited

MilesCranmer Jul 10, 2022

MilesCranmer Jul 14, 2022

arogozhnikov Jul 15, 2022

arogozhnikov commented Jul 15, 2022

MilesCranmer commented Jul 15, 2022

gerdm commented Oct 4, 2022

alok commented May 9, 2023

Create einsum operation #197

Create einsum operation #197

Conversation

MilesCranmer commented Jul 4, 2022 • edited

MilesCranmer commented Jul 4, 2022 • edited

MilesCranmer commented Jul 4, 2022 • edited

arogozhnikov left a comment

Choose a reason for hiding this comment

MilesCranmer commented Jul 5, 2022

MilesCranmer commented Jul 9, 2022

arogozhnikov commented Jul 10, 2022 • edited

arogozhnikov Jul 10, 2022

Choose a reason for hiding this comment

MilesCranmer Jul 10, 2022 • edited

Choose a reason for hiding this comment

MilesCranmer Jul 10, 2022

Choose a reason for hiding this comment

MilesCranmer Jul 14, 2022

Choose a reason for hiding this comment

arogozhnikov Jul 15, 2022

Choose a reason for hiding this comment

arogozhnikov commented Jul 15, 2022

MilesCranmer commented Jul 15, 2022

gerdm commented Oct 4, 2022

alok commented May 9, 2023

MilesCranmer commented Jul 4, 2022 •

edited

MilesCranmer commented Jul 4, 2022 •

edited

MilesCranmer commented Jul 4, 2022 •

edited

arogozhnikov commented Jul 10, 2022 •

edited

MilesCranmer Jul 10, 2022 •

edited