Add einsum, closes #124 #363

Vindaar · 2019-06-17T18:50:22Z

This PR adds the Einstein summation via the einsum macro to Arraymancer.

It's still a WIP, but all features described here:
https://rockt.github.io/2018/04/30/einsum
are implemented and tested in the tests (as a bonus, taking the diagonal of a tensor also works, which according to the article does not in pytorch ;) ).

The major thing still missing is actually taking the type of the input tensor into account. At the moment I just map to float tensors.

The code has to be clean up and refactored in parts. Documentation also has to be added.

In principle the macro works as follows. Assuming we have two (theoretically also only one or more tensors) tensors a, b, we can use Einstein summation as follows:

let res = einsum(a, b):
  a[i,j] * b[j,k]

to express a matrix multiplication implicitly or:

let res = einsum(a, b):
  res[i,k] = a[i,j] * b[j,k]

to assign the resulting indices explicitly. In the former case we calculate the indices that will be contracted automatically and assign the result along the axis specified by the order in which non contracting indices appear. The explicit usage allows to also transpose the result directly (e.g. exchange i, k -> k, i in the res LHS above). In some cases the explicit form is strictly necessary, e.g. if one wants to only iterate over (not contract) an axis, but keep it in the result. All tests are done in implicit and explicit form. If no implicit form is tested, it means the test case does not make sense without the constraints from the explicit LHS (or would yield a different result, e.g. look at the Hadamard product) in a hypothetical implicit form:

let res = einsum(a, b):
  a[i,j] * b[i,j]

normal Einstein summation demands a full contraction of the tensors. Only by explicitly stating to keep both axes:

let res = einsum(a, b):
  res[i,j] = a[i,j] * b[i,j]

can it be expressed. This means that the implicit form favors actual Einstein summation, i.e. an index appearing more than once is summed over.

Note that the identifier used on the LHS of the statement in the explicit case does not actually matter. It can be the same as what you assign to, but it can be something else entirely too.

The implementation simply writes nested for loops over the non contracting axes and within over the contracting axes and assigns the result.

I open the PR already, because I'm unsure about the file structure of this.

Where should the test go? Is the tensors subdirectory correct?
How do I run the test from the nimble file? Is every test_ file run in the tests directory?

- full contraction is working - transposition is working - hopefully more? The above two are in tests

…apes

All features are now working. The code is a mess though :)

mratsim · 2019-06-18T09:11:50Z

Excellent.

The major thing still missing is actually taking the type of the input tensor into account. At the moment I just map to float tensors.

Feel free to use getSubtype from

Arraymancer/src/private/ast_utils.nim

Lines 42 to 44 in 25cf5e3

    
           macro getSubType*(TT: typedesc): untyped = 
        
             # Get the subtype T of an AnyTensor[T] input 
        
             getTypeInst(TT)[1][1]

.
I don't mind adding float only at first, working with types in macros is tricky nim-lang/RFCs#44 but I usually manage to find a way for those to work so if you struggle just use floats at first.

If no implicit form is tested, it means the test case does not make sense without the constraints from the explicit LHS (or would yield a different result, e.g. look at the Hadamard product) in a hypothetical implicit form

We should probably have an assert or compile-time error or at the very-least a documentation paragraph that says "Please use the explicit form of einstein summation, for example C[i, j] = A[i, j] * B[i, j] instead of just A[i, j] * B[i, j]" where we can.

Note that the identifier used on the LHS of the statement in the explicit case does not actually matter. It can be the same as what you assign to, but it can be something else entirely too.

No problem, we can also consider this syntax

einsum(res, a, b):
  let res[i,k] = a[i,j] * b[j,k]

instead of

let res = einsum(a, b):
  res[i,k] = a[i,j] * b[j,k]

Where should the test go? Is the tensors subdirectory correct?

Yes it's correct

How do I run the test from the nimble file? Is every test_ file run in the tests directory?

Add it to:

tests_cpu.nim: https://github.com/mratsim/Arraymancer/blob/25cf5e35621c26bbb7c48b0775ef36ad30a22bf6/tests/tests_cpu.nim which is the full test suite compiled in one file
test_tensor_part03: https://github.com/mratsim/Arraymancer/blob/25cf5e35621c26bbb7c48b0775ef36ad30a22bf6/tests/_split_tests/tests_tensor_part03.nim which is the split test suite used to not go out of memory in CI (Complex test suite may out of memory #359 (comment))

Also refactor out shape assertions into its own proc.

Introduces a probably questionable interpretation of the explicit macro usage by assigning explicit case to a `let` var of the chosen identifier. Unfortunately the following is invalid syntax: ```nim einsum(a): let b[j,i] = a[i,j] ``` otherwise we could let the user choose.

Vindaar · 2019-06-23T17:43:17Z

Ok, finally did some refactoring.

Regarding the explicit / implicit syntax. Unfortunately the syntax you proposed:

einsum(res, a, b):
  let res[i,k] = a[i,j] * b[j,k]

is not valid Nim syntax (throws a Error: ':' or '=' expected, but got '[').

So right now I just introduced a dual behavior. Implicit is mapped to just the block, user must assign to a variable. While explicit creates a let var of the user desired identifier.
That way unfortunately we can't have the macro produce a var variable. :/

Not sure in general if in general the rather subtle, profound change in the way einsum works is desirable for these two different cases. Maybe have those under different macro names?

edit: I also started adding a file that contains examples that are supposed to fail. I know that testament supports failing tests, but unittest does not, does it? Is it worth to expand on this?

The types of all tensors given as arguments to `einsum` must match. Thus we check whether they are all the same. If they are, we use the type of the first tensor.

In both cases `einsum` again just returns a tensor.

Instead of working with the given tensors, we now create local copies, which are made contiguous and (if required) converted to row major order. This way our iteration should be more efficient, in case column major tensors are present.

This way the right most indices of the accessor will be the inner most loops. Since we force row major ordering, those indices will be closest together.

Vindaar · 2019-06-26T22:17:19Z

There's still some stuff I would consider changing (I'm not happy with the main macro code, to be honest; but refactoring more seemed like writing procs with tons of arguments :/) and I'm not sure if the documentation will come out fine (almost no RST experience).

Aside from that I consider this PR to be mostly done for now.

We now create local tensors with asContiguous, forcing row major order.

edit:
Oh, one thing I just remembered: I didn't do any proper name mangling yet, because I had in mind that you had a way to do that in the library, but I couldn't find it before.
If I'm mistaken, I'd just replace T0Mangle, tmp and res by a call to genSym.
edit_end

For a complex example, the code created now looks like this:

let b = einsum(m, n):
  b[p,s,t,u,v] = m[p,q,r,s] * n[t,u,q,v,r]
# will expand to
let b = block:                                                                                                                
  type                                                                                                                
    T0Mangle = getSubType(type(m))                                                                                    
  when T0Mangle isnot getSubType(type(m)):                                                                            
    {.error: "All tensors must be of the same type! " &                                                               
        $"m" & " is of " & "type " &                                                                                  
        $typeName(getSubType(type(m))) & " while " &                                                                  
        $"m" & " is of type " &                                                                                       
        $typeName(getSubType(type(m))) & "!".}                                                                        
  elif T0Mangle isnot getSubType(type(n)):                                                                            
    {.error: "All tensors must be of the same type! " &                                                               
        $"m" & " is of " & "type " &                       
        $typeName(getSubType(type(m))) & " while " &       
        $"n" & " is of type " &                            
        $typeName(getSubType(type(n))) & "!".}             
  let mCont = asContiguous[getSubType(type(m))](m, layout = rowMajor, force = true)
  let nCont = asContiguous[getSubType(type(n))](n, layout = rowMajor, force = true)                                   
  doAssert nCont.rank ==                                                                                              
      5                                                                                                               
  var shapes = newSeq[int](5) 
  []=(shapes, 0,        
      mCont.shape[0])                                      
  []=(shapes, 1,            
      mCont.shape[3])  
  []=(shapes, 2,            
      nCont.shape[0])
  []=(shapes, 3,                                           
      nCont.shape[1])                                      
  []=(shapes, 4,                                           
      nCont.shape[3])                                      
  var shapesContr = newSeq[int](2)                         
  []=(shapesContr, 0,                                      
      nCont.shape[2])                                      
  []=(shapesContr, 1,                                      
      nCont.shape[4])                                                                                                 
  var tmp = newTensor[getSubType(type(m))](shapes)
  for p in 0 ..<                                           
      shapes[0]:                                           
    for s in 0 ..<       
        shapes[1]:    
      for t in 0 ..<                                       
          shapes[2]:                                                                                                  
        for u in 0 ..<      
            shapes[3]:  
          for v in 0 ..<
              shapes[4]:
            var res: getSubType(type(m))
            for q in 0 ..<                                                                                            
                shapesContr[0]:                                                                                       
              for r in 0 ..<
                  shapesContr[1]:
                res += mCont[p, q, r, s] * nCont[t, u, q, v, r]
            tmp[p, s, t, u, v] = res                       
  tmp

mratsim · 2019-06-27T09:11:12Z

Regarding name_mangling you are probably mixing that up with laser: https://github.com/numforge/laser/blob/bf751f4bbec3d178cd3a80da73e446658d0f8dff/laser/openmp.nim#L13-L25

var mangling_rng {.compileTime.} = initRand(0x1337DEADBEEF)
var current_suffix {.compileTime.} = ""

proc omp_suffix*(genNew: static bool = false): string {.compileTime.} =
  ## genNew:
  ##   if false, return the last suffix
  ##   else return a fresh one
  # This is exported because you cannot bind the symbol early enough
  # for exportc

  if genNew:
    current_suffix = mangling_rng.rand(high(uint32)).toHex
  result = current_suffix

I don't even mangle the layers in the neural network DSL:

Arraymancer/src/nn_dsl/dsl_types.nim

Lines 83 to 85 in 25cf5e3

    
           proc hash*(x: NimNode): Hash = 
        
             assert x.kind == nnkIdent 
        
             result = hash($x)

This avoids problems, if the user hands a tensor with the identifier `tmp`, `shape`, `shapeContr` or `res`.

Vindaar · 2019-06-27T14:02:47Z

Oh, then I saw it in laser.

I now use genSym for all variables. I added a few more comments to the documentation and cleaned up a few things here and there.

Vindaar added 10 commits June 15, 2019 13:10

WIP commit of einsum implementation

1fb6b18

- full contraction is working - transposition is working - hopefully more? The above two are in tests

allow contraction of a single axis

35ed0a4

add explicit and implicit matrix vector multiplication

8a1238d

WIP commit towards a better approach for mapping for idx idents to sh…

a6aee65

…apes

fix transposition by assigning indices in order of appearing in stmt

580accf

WIP commit making all tests work

b98e526

All features are now working. The code is a mess though :)

remove a bunch of dead code / debug echoes

583f933

add reference to where tests come from

48b48be

only import tensor

9f778dd

move einsum tests to tensor subdirectory

336eb01

mratsim approved these changes Jun 18, 2019

View reviewed changes

Vindaar added 12 commits June 23, 2019 13:00

add einsum test to test suite

4fb5202

fix import path of arraymancer in einsum test

cbb4ab0

refactor out statement split

a928293

add a note about getTensors

18cf9b2

add check about number of einsum statement

694ca64

add a reference test file for tests designed to fail

b566d06

add TensorIdx object to combine tensor and its indices

4f122dd

Also refactor out shape assertions into its own proc.

remove unnecessary indexing variable in axes iterations

e15c7a1

add enumerateIdx iterator to yield indices of seq[TensorIdx]

3297ad8

only generate shapesContr variable, if contracting at least 1 axis

51681c2

refactor code further, simplify index / result shape mapping

dfac63c

Vindaar added 5 commits June 23, 2019 19:43

remove now useless findAxes

d48e3be

remove dead line in failed test file

69b660a

perform a type check of all tensors, use that type as result type

33bd3ea

The types of all tensors given as arguments to `einsum` must match. Thus we check whether they are all the same. If they are, we use the type of the first tensor.

revert returning a let section for explicit einsum

b472e11

In both cases `einsum` again just returns a tensor.

Vindaar added 4 commits June 26, 2019 23:07

add example to docstring

966f128

add some documentation to the top of the file

ac95c1c

fix two links in the documentation

366f62c

reverse the order of idxIdentPairs for correct loop order

3153677

This way the right most indices of the accessor will be the inner most loops. Since we force row major ordering, those indices will be closest together.

Vindaar marked this pull request as ready for review June 26, 2019 22:20

Vindaar added 5 commits June 27, 2019 15:27

use genSym for unique symbols in the macro block

57d20ad

This avoids problems, if the user hands a tensor with the identifier `tmp`, `shape`, `shapeContr` or `res`.

move type declarations to top of file, add short comment

5fb96f6

clean up sub type gen, use new type for contiguous tensors

1332276

take into account tensor type for scalar result

a269b7c

comment about scalar result type, arbitrariness of LHS for explicit

08d2771

mratsim merged commit 686836e into mratsim:master Jun 30, 2019

mratsim mentioned this pull request Jul 13, 2019

Einsum backpropagation #367

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add einsum, closes #124 #363

Add einsum, closes #124 #363

Vindaar commented Jun 17, 2019

mratsim commented Jun 18, 2019

Vindaar commented Jun 23, 2019 •

edited

Loading

Vindaar commented Jun 26, 2019 •

edited

Loading

mratsim commented Jun 27, 2019 •

edited

Loading

Vindaar commented Jun 27, 2019

Add einsum, closes #124 #363

Add einsum, closes #124 #363

Conversation

Vindaar commented Jun 17, 2019

mratsim commented Jun 18, 2019

Vindaar commented Jun 23, 2019 • edited Loading

Vindaar commented Jun 26, 2019 • edited Loading

mratsim commented Jun 27, 2019 • edited Loading

Vindaar commented Jun 27, 2019

Vindaar commented Jun 23, 2019 •

edited

Loading

Vindaar commented Jun 26, 2019 •

edited

Loading

mratsim commented Jun 27, 2019 •

edited

Loading