Features/174 concatenate #319

coquelin77 · 2019-07-01T09:01:59Z

closes issue #174

…PI error but no message is thrown. commit before comp dies

bug was a Isend command was started but no data was sent, this resulted in an extra send command causing other commands to fail later on

…r this file, here: revert array to standard

codecov-io · 2019-07-01T09:06:40Z

Codecov Report

Merging #319 into master will increase coverage by 0.08%.
The diff coverage is 98.76%.

@@            Coverage Diff             @@
##           master     #319      +/-   ##
==========================================
+ Coverage    96.8%   96.88%   +0.08%     
==========================================
  Files          53       53              
  Lines        8347     8744     +397     
==========================================
+ Hits         8080     8472     +392     
- Misses        267      272       +5

Impacted Files	Coverage Δ
heat/core/factories.py	`100% <100%> (ø)`	⬆️
heat/core/tests/test_manipulations.py	`100% <100%> (ø)`	⬆️
heat/core/types.py	`90.86% <100%> (+0.3%)`	⬆️
heat/core/dndarray.py	`95.05% <100%> (-0.01%)`	⬇️
heat/core/manipulations.py	`97.67% <96.66%> (-1.01%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update bd28e9d...ac3d0b7. Read the comment docs.

…tz-analytics/heat into features/174-concatenate

heat/core/manipulations.py

…cleaner

previously there was a bug that it would crash in __len__ of DNDarray when generating a new one. this will also handle the generation of a new array with the same split

reduction buffer used to determine if the array can be formed. new code generates the proper array gshape (previously the reduction buffer would sometimes be off by 1)

promote types used to detemine if the arrays are not the proper dtype, if they are not then they are cast to the proper type

coquelin77 · 2019-07-17T14:25:49Z

@Markus-Goetz I made some changes to types and factories.array. these might interest you. I couldnt debug that reduction buffer anymore so i left it and call another allreduce to get the proper shape after the raise statement

…tz-analytics/heat into features/174-concatenate

ClaudiaComito

Hi Daniel @coquelin77,

I ran a few tests and as far as I can tell it works fine (except for the negative axis case).

I see that the concatenated tensor inherits the split value from the input tensors and is distributed, it makes sense but it's not what e.g. maximum() and minimum() are doing (and probably other reduction operations as well, I've got to check), so if we agree that's what we want, I'll make the relevant changes.

ClaudiaComito · 2019-07-23T07:10:48Z

heat/core/manipulations.py

+    if not isinstance(arr0, dndarray.DNDarray) and not isinstance(arr1, dndarray.DNDarray):
+        raise TypeError('Both arrays must be DNDarrays')
+    if not isinstance(axis, int):
+        raise TypeError('axis must be an integer, currently: {}'.format(type(axis)))


I suppose you have a good reason not to call sanitize_axis? At the moment ht.concatenate fails if axis<0.

Example axis>0, this one works fine:

a = ht.array(ht.random.randn(2, 4), split=1) b = ht.array(ht.random.randn(2, 7), split=1) c = ht.concatenate((a, b), axis=1)

Equivalent with axis<0, doesn't run, see below:

a = ht.array(ht.random.randn(2, 4), split=1) b = ht.array(ht.random.randn(2, 7), split=1) c = ht.concatenate((a, b), axis=-1)

mpirun --use-hwthread-cpus -n 2 -tag-output python local_test.py
[1,0]:Traceback (most recent call last):
[1,0]: File "local_test.py", line 8, in
[1,0]: c = ht.concatenate((a, b), axis=-1)
[1,0]: File "/Users/c.comito/HAF/heat/heat/core/manipulations.py", line 108, in concatenate
[1,0]: ' {}, {}'.format(arr0.gshape, arr1.gshape))
[1,0]:ValueError: Arrays cannot be concatenated, gshapes must be the same in every axis except the selected axis: (2, 4), (2, 7)
[1,1]:Traceback (most recent call last):
[1,1]: File "local_test.py", line 8, in
[1,1]: c = ht.concatenate((a, b), axis=-1)
[1,1]: File "/Users/c.comito/HAF/heat/heat/core/manipulations.py", line 108, in concatenate
[1,1]: ' {}, {}'.format(arr0.gshape, arr1.gshape))
[1,1]:ValueError: Arrays cannot be concatenated, gshapes must be the same in every axis except the selected axis: (2, 4), (2, 7)

Hi Daniel @coquelin77,

I ran a few tests and as far as I can tell it works fine (except for the negative axis case).

I see that the concatenated tensor inherits the split value from the input tensors and is distributed, it makes sense but it's not what e.g. maximum() and minimum() are doing (and probably other reduction operations as well, I've got to check), so if we agree that's what we want, I'll make the relevant changes.

The reason that I did this was to minimize communication. I feel that it makes intuitive sense to inherent the split axis in this case. I do not handle the separate split case here as it would require more overhead. instead i leave it for the user to decide if they want to resplit the data. and if so then they can do it themselves.

heat/core/tests/test_manipulations.py

coquelin77 added 12 commits June 25, 2019 09:51

adding tests for cat

632faed

changed isinstance to type == clauses

64ef583

added some basic test cases and early outs for if loops

181d6ca

adding unit tests, argmin/max + unique now failing

0ff2151

Merge branch 'master' into features/174-concatenate

5b3eb45

tests failing after pulling latest master

1b427d3

Merge branch 'master' into features/174-concatenate

d6e9478

reset dndarray file

288d6bd

functions with array called after cat are failing. seems to be some M…

ec89dec

…PI error but no message is thrown. commit before comp dies

fixed bug in data adjustment loop

4519472

bug was a Isend command was started but no data was sent, this resulted in an extra send command causing other commands to fail later on

the previous commit was tags for the wrong file, see message there fo…

905fcc9

…r this file, here: revert array to standard

added docs to concatenate

23935f2

coquelin77 requested review from krajsek and ClaudiaComito July 1, 2019 09:01

coquelin77 and others added 5 commits July 1, 2019 11:25

Merge branch 'master' into features/174-concatenate

b342216

finished unit tests for cat

282c817

Merge branch 'master' into features/174-concatenate

c0ad2ab

Merge branch 'master' into features/174-concatenate

7880059

Merge branch 'features/174-concatenate' of https://github.com/helmhol…

5123051

…tz-analytics/heat into features/174-concatenate

TheSlimvReal reviewed Jul 15, 2019

View reviewed changes

heat/core/manipulations.py Outdated Show resolved Hide resolved

heat/core/manipulations.py Outdated Show resolved Hide resolved

coquelin77 added 8 commits July 16, 2019 08:04

Merge branch 'master' into features/174-concatenate

e446fb8

removed semi-redudent code, requires more if loops but code is a bit …

5bd8d5e

…cleaner

added comments, fixed recursive bug when >2 arrays, minor code cleaning

b02f40d

modified the __new__ function in types to cast a DNDarray

ec59c41

previously there was a bug that it would crash in __len__ of DNDarray when generating a new one. this will also handle the generation of a new array with the same split

added code to avoid the reduction buffer in array

273030c

reduction buffer used to determine if the array can be formed. new code generates the proper array gshape (previously the reduction buffer would sometimes be off by 1)

reverted the old code back to the orinial formatting

84d1f49

added type casting and tests required

e20a52d

promote types used to detemine if the arrays are not the proper dtype, if they are not then they are cast to the proper type

Merge branch 'master' into features/174-concatenate

2a33b0d

Markus-Goetz and others added 3 commits July 22, 2019 09:37

Merge branch 'master' into features/174-concatenate

2991d6d

Merge branch 'master' into features/174-concatenate

dc883e0

Merge branch 'features/174-concatenate' of https://github.com/helmhol…

dc500b8

…tz-analytics/heat into features/174-concatenate

ClaudiaComito previously approved these changes Jul 23, 2019

View reviewed changes

added tests for raises and changed order of raises in concatinate

ac3d0b7

coquelin77 dismissed ClaudiaComito’s stale review via ac3d0b7 July 24, 2019 13:18

ClaudiaComito approved these changes Jul 24, 2019

View reviewed changes

coquelin77 merged commit 474d74e into master Jul 24, 2019

coquelin77 deleted the features/174-concatenate branch July 24, 2019 13:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Features/174 concatenate #319

Features/174 concatenate #319

coquelin77 commented Jul 1, 2019

codecov-io commented Jul 1, 2019 •

edited

coquelin77 commented Jul 17, 2019

ClaudiaComito left a comment

ClaudiaComito Jul 23, 2019

coquelin77 Jul 23, 2019

Features/174 concatenate #319

Features/174 concatenate #319

Conversation

coquelin77 commented Jul 1, 2019

codecov-io commented Jul 1, 2019 • edited

Codecov Report

coquelin77 commented Jul 17, 2019

ClaudiaComito left a comment

Choose a reason for hiding this comment

ClaudiaComito Jul 23, 2019

Choose a reason for hiding this comment

coquelin77 Jul 23, 2019

Choose a reason for hiding this comment

codecov-io commented Jul 1, 2019 •

edited