Laser tensor PR rebase #477

Vindaar · 2020-11-27T15:54:34Z

Here it is finally. Sorry for taking so long.

This is a rebase of the laser tensor PR #420 onto the current state of arraymancer. The major complication was the restructure of the whole arraymancer library to support importing individual parts as submodules.

I had to fix a few things that came up:

there was a self assignment line in pca.nim, which caused the tensor to be destroyed maybe? I removed it and now it works with arc. Relevant commit
: 63b4705
Maybe my =destroy implementation is at fault? @Clyybber maybe?
I had to add a CpuStorageObj for the =destroy proc signature. That's the same as would happen implicitly I assume?
there's still the issue with the supportsCopyMem not supported in type sections and the workaround:
https://github.com/mratsim/Arraymancer/blob/laser-tensor-rebase/src/arraymancer/laser/tensor/datatypes.nim#L31-L36

As discussed on #420 this can cause issues where certain types (variant objects) match different branches in the type definition and in the code using when T.supportsCopyMem.
For that reason I will define a type class of types for which we know that they support copyMem which will then be used in the code as well as in the type definition to make sure laser based tensors are used for the vast majority of types used in practice.

I'll finish the latter maybe today and at the latest over the weekend. Wanted to open this PR first though, because I don't know how much more time I have today and to see what the CI says...

Vindaar · 2020-11-30T09:39:31Z

CI failure is because I forgot to add Complex to KnownSupportsCopyMem from here:
https://github.com/mratsim/Arraymancer/blob/laser-tensor-rebase/tests/tensor/test_filling_data.nim#L33

However, it's a bit worrying that this failed in the first place, indicating that non mem copyable types may have hidden bugs. It could be a good idea to run all basic tests in a mode where all types are treated as mem copyable? Maybe have a

when not defined(noPtrLenBackend):
type
  KnownSupportsCopyMem* = SomeNumber | char | Complex[float64] | Complex[float32]
else:
  KnownSupportsCopyMem* = distinct object

and then run tests with -d:noPtrLenBackend in addition?

edit:
I'm not sure if the test failure really is a failure or if the test is broken. copyFrom states that the two tensors should have the same shape:

  ## Copy the source tensor into the destination tensor.
  ## Both should have the same shape. If destination tensor is a view
  ## only the data exposed by the view is modified.

yet in the test they don't:

        let a = [[1,2],[3,4]].toTensor.reshape(2,2).astype(Complex[float64])
        var b = ones[Complex[float64]](4,1)

one is (2, 2), the other is (4, 1).

The thing is that the copyMem branch for contiguous tensors only performs a check on the size, but not the shape:

    if src.is_C_contiguous:
      assert dst.size == src.size
      omp_parallel_chunks(
            src.size, chunk_offset, chunk_size,
            OMP_MEMORY_BOUND_GRAIN_SIZE * 4):
        copyMem(
          dst.unsafe_raw_offset[chunk_offset].addr,
          src.unsafe_raw_offset[chunk_offset].unsafeAddr,
          chunk_size * sizeof(T)
        )

@mratsim: should copyMem perform an assert on the shape instead?
I'm going to fix the test case to use the same shape.

Vindaar · 2020-12-02T14:51:00Z

FYI: nim-lang/Nim#16185 is still an open issue, which makes the code here unsafe if used with ARC.

And a personal blocker: ggplotnim does not pass its dataframe tests (independent of ARC!). It throws an out of memory error. I'll try to debug that in the coming days. Once I figure out if it's a problem on ggplotnim's or arraymancer's side, I'd consider merging this PR. @mratsim still has to review it at that point though.

…build/tests_tensor_part01 -r tests/_split_tests/tests_tensor_part01.nim"

…assing all tests with gc:markandsweep

later state: - test_display - test_higherorder - test_ufunc Commonalities: - all rely on higher order templates map/apply - all have at least a test that involve mapping a string Issue can be reliably detected in its own module by wrapping tests into a proc and avoid globals and then calling GC_fullcollect or GC_collectZct

…leted without deprecation

This is needed, due to the changes that happened to the Arraymancer directory layout in the meantime.

This was already removed due to possibly being bugged in 56de8c9

`reset` sets the length of the underlying sequence to 0! This way we actually reset it to empty and resize to tensor size, which should reallocate a new block that will be zeroed for us. Since `newSeq` already zeroes in the first place, we don't have to call `setZero` when constructing a seq based tensor.

This at least _seems_ to work fine as far as I can test, both using ARC and normal GC. `export_tensor` still used `data` as well. For the time being we can use `toRawSeq` until a better solution is in place.

As long as this test case passes, self assignment of objects containing Tensors using ARC is broken in arraymancer!

…yMem

- `io_hdf5` is not imported automatically anymore if the module is installed. The reason for this is that the HDF5 library runs code in global scope to initialize the HDF5 library. This means dead code elimination does not work and a binary will always depend on the HDF5 shared library if the `nimhdf5` is installed, even if not used.

…uda for Nim 1.0.x

Vindaar mentioned this pull request Nov 29, 2020

[ARC] Memory zeroed after =sink even for self assignment nim-lang/Nim#16185

Closed

Vindaar force-pushed the laser-tensor-rebase branch from 5a46224 to d5339cc Compare December 7, 2020 17:16

Vindaar mentioned this pull request Dec 8, 2020

Tutorial about efficiently passing python list or array yglukhov/nimpy#114

Closed

mratsim and others added 25 commits December 9, 2020 21:07

Lay out the base data structure and iteration from laser

da653d3

Replace the core tensor data structure by Laser and see what breaks

a5cd959

Workaround + progress. Now compiler quits in the middle of "nim c -o:…

3f68ac6

…build/tests_tensor_part01 -r tests/_split_tests/tests_tensor_part01.nim"

remove usage of dataArray

7438c5b

rebase merge leftover

f3a1b45

Handle Nim distinctBase move to typetraits in devel

46c9add

supportsCopyMem in 1.0.6

c24bba9

Small fixes on deprecated "data" proc

43409a8

Make the test suite compile (returning var seq is broken)

811029a

Fix offset bug when iterating

c050ddc

deprecated cleanup

a1f0f70

fix out-of-bounds test

5d46580

typetraits is always needed

7b39341

deprecate toRawSeq (cannot be made backward compatible)

d46a724

The default LASER_MAX_RANK is 6

b17711e

Fix sparse softmax cross-entropy test. Nim random is inclusive :/

6d52b88

Fix the new parallel copyFrom (impacts index_select/embedding/GRU). P…

518eede

…assing all tests with gc:markandsweep

Update changelog + unsafe_raw_data leftovers

58b8a04

Make laser-tensor-iterator compile with latest Nim

85f47c1

update selectors to work with laser

742b15d

Fix the bug left (nim-lang/Nim#13598) and reactivate full test suite

a86be78

Don't use deprecated .data accessor + the mutable one is broken so de…

4fecf7c

…leted without deprecation

use unsafe_raw_offset in example instead of get_offset_ptr

11cf774

move laser files to tensor subdirectory

98e10a7

This is needed, due to the changes that happened to the Arraymancer directory layout in the meantime.

Vindaar added 22 commits December 9, 2020 21:08

remove data proc again after readded by mistake

02c1597

This was already removed due to possibly being bugged in 56de8c9

fix when condition in setZero

b1cf625

set memalloc pointer to nil after freeing in finalizer / destroy

5ed3253

define = as {.error.} for CpuStorageObj for ARC/ORC

d07362a

change toRawSeq to not use removed data, copy manually

02d4224

This at least _seems_ to work fine as far as I can test, both using ARC and normal GC. `export_tensor` still used `data` as well. For the time being we can use `toRawSeq` until a better solution is in place.

add a test case checking for self assignment bug

42d8b37

As long as this test case passes, self assignment of objects containing Tensors using ARC is broken in arraymancer!

add nimble tasks to test with ARC, use for travis / appveyor

d6fff49

make ARC specific test only run under ARC

755d13f

remove ARC test on appveyor since test runs Nim 1.0.6

9616415

add bool to KnownSupportsCopyMem

fac4ee9

make unsafe_raw_* procs only valid for KnownSupportsCopyMem

0c86636

make atIndex(Mut) only use unsafe_raw procs if not KnownSupportsCop…

5e66874

…yMem

make *stridedIteratior templates work for not KnownSupportsCopyMem

4b09f15

make atContiguousIndex not use unsafe_raw procs for non mem copyable

fae6d4f

avoid forEach macro for non KnownSupportsCopyMem in newTensorWith

ca3fc37

make *_inline templates work for non KnownSupportsMemCopy

6024a1e

add more safety checks for map/apply reg KnownSupportsCopyMem

f45b7fc

error for get_*_ptr, dataArray for not KnownSupportsCopyMem

7ecfb37

only allow argmax_max for SomeNumber since it uses unsafe_raw

5df60b4

add fromBuffer to create a tensor w/o copy from a raw pointer

f2f6c8c

mratsim force-pushed the laser-tensor-rebase branch from e7dd185 to f2f6c8c Compare December 9, 2020 20:18

Remove MetadataArray ambiguous call, pass OpenCL tests, can compile C…

fed700b

…uda for Nim 1.0.x

mratsim approved these changes Dec 10, 2020

View reviewed changes

mratsim merged commit ccb60d5 into master Dec 10, 2020

mratsim mentioned this pull request Dec 10, 2020

[Blocked by upstream bug] Change Tensor backend to pointer+length #420

Closed

mratsim deleted the laser-tensor-rebase branch December 12, 2020 10:45

Clonkk mentioned this pull request Dec 14, 2020

Arraymancer is bugged with gc:arc #423

Closed

Vindaar mentioned this pull request Jan 4, 2021

Future direction for memory management? #489

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Laser tensor PR rebase #477

Laser tensor PR rebase #477

Vindaar commented Nov 27, 2020

Vindaar commented Nov 30, 2020 •

edited

Vindaar commented Dec 2, 2020 •

edited

Laser tensor PR rebase #477

Laser tensor PR rebase #477

Conversation

Vindaar commented Nov 27, 2020

Vindaar commented Nov 30, 2020 • edited

Vindaar commented Dec 2, 2020 • edited

Vindaar commented Nov 30, 2020 •

edited

Vindaar commented Dec 2, 2020 •

edited