Add split shape utility #3062

jantonguirao · 2021-06-16T15:42:09Z

Signed-off-by: Joaquin Anton janton@nvidia.com

Why we need this PR?

Pick one, remove the rest

It adds a new internal utility needed to split a big shape into blocks for parallelization.

What happened in this PR?

Fill relevant points, put NA otherwise. Replace anything inside []

What solution was applied:
Introduced a utility function that provides per-dimension split factor given a shape, a desired number of blocks and a minimum practical block size.
Affected modules and functionalities:
NA
Key points relevant for the review:
split_shape implementation, tests (did I miss any meaningful testcase?)
Validation and testing:
Unit tested
Documentation (including examples):
Doxygen

JIRA TASK: [DALI-2128]

mzient · 2021-06-17T09:40:09Z

dali/kernels/common/split_shape.h

+/**
+ * @brief Utility to divide a bigger shape into smaller blocks, given a desired number of blocks
+ *        and a minimum practical block size.
+ *        The algorithm start splitting from the outtermost dimension until the number of blocks


Suggested change

* The algorithm start splitting from the outtermost dimension until the number of blocks

* The algorithm starts splitting from the outermost dimension until the number of blocks

mzient · 2021-06-17T09:48:44Z

dali/kernels/common/split_shape.h

+template <typename SplitFactor, typename Shape>
+void split_shape(SplitFactor& split_factor, const Shape& in_shape, int min_nblocks = 8,
+                 int min_sz = (16 << 10)) {


If this is intended for threading, then there are several issues:

If we can divide the shape into a number of threads that's a multiple of thread count, then we need no more partitions.

We should avoid uneven splits unless there's a good chance that it will be masked by the large number of blocks (much more than threads).

My idea is that the caller will specify a suitable value for min_nblocks (a multiple of the number of threads)

Signed-off-by: Joaquin Anton <janton@nvidia.com>

mzient · 2021-06-17T13:09:17Z

dali/kernels/common/split_shape.h

+  int ndim = in_shape.size();
+  assert(static_cast<int>(split_factor.size()) == ndim);


Suggested change

int ndim = in_shape.size();

assert(static_cast<int>(split_factor.size()) == ndim);

int ndim = dali::size(in_shape);

assert(static_cast<int>(dali::size(size) == ndim);

mzient · 2021-06-17T13:09:57Z

dali/kernels/common/split_shape.h

+  for (int d = 0; d < ndim; d++)
+    split_factor[d] = 1;
+
+  int64_t vol = volume(in_shape.begin(), in_shape.end());


If it's iterable, volume should know what to do.

Suggested change

int64_t vol = volume(in_shape.begin(), in_shape.end());

int64_t vol = volume(in_shape);

mzient · 2021-06-17T14:23:59Z

dali/kernels/common/split_shape_test.cc

+  TensorShape<> sh(10, 10, 10);
+  std::vector<int> split_factor = {1, 1, 1};
+
+  split_shape(split_factor, sh, 3, 1000);  // minimum volume is bigger than the input volume


Nitpick ;)

Suggested change

split_shape(split_factor, sh, 3, 1000); // minimum volume is bigger than the input volume

split_shape(split_factor, sh, 3, 1000); // minimum volume is equal than the input volume

Signed-off-by: Joaquin Anton <janton@nvidia.com>

jantonguirao · 2021-06-17T16:08:42Z

!build

dali-automaton · 2021-06-17T16:11:59Z

CI MESSAGE: [2484608]: BUILD STARTED

dali-automaton · 2021-06-18T00:34:35Z

CI MESSAGE: [2484608]: BUILD PASSED

jantonguirao marked this pull request as ready for review June 17, 2021 07:02

jantonguirao assigned mzient Jun 17, 2021

jantonguirao force-pushed the split_shape branch from 17f7922 to 294ee75 Compare June 17, 2021 07:05

banasraf self-assigned this Jun 17, 2021

mzient reviewed Jun 17, 2021

View reviewed changes

jantonguirao force-pushed the split_shape branch from 294ee75 to 9676f7c Compare June 17, 2021 10:57

Add split shape utility

b57f9e4

Signed-off-by: Joaquin Anton <janton@nvidia.com>

jantonguirao force-pushed the split_shape branch from 9676f7c to b57f9e4 Compare June 17, 2021 11:45

mzient reviewed Jun 17, 2021

View reviewed changes

jantonguirao force-pushed the split_shape branch from aa6041b to 648d04d Compare June 17, 2021 13:35

mzient reviewed Jun 17, 2021

View reviewed changes

Code review fixes

66717ff

Signed-off-by: Joaquin Anton <janton@nvidia.com>

jantonguirao force-pushed the split_shape branch from 648d04d to 66717ff Compare June 17, 2021 14:31

mzient approved these changes Jun 17, 2021

View reviewed changes

banasraf approved these changes Jun 17, 2021

View reviewed changes

jantonguirao merged commit ae2b242 into NVIDIA:main Jun 18, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add split shape utility #3062

Add split shape utility #3062

jantonguirao commented Jun 16, 2021 •

edited

Loading

mzient Jun 17, 2021 •

edited

Loading

mzient Jun 17, 2021

jantonguirao Jun 17, 2021

mzient Jun 17, 2021

mzient Jun 17, 2021

mzient Jun 17, 2021

jantonguirao commented Jun 17, 2021

dali-automaton commented Jun 17, 2021

dali-automaton commented Jun 18, 2021

	* The algorithm start splitting from the outtermost dimension until the number of blocks
	* The algorithm starts splitting from the outermost dimension until the number of blocks

		int ndim = in_shape.size();
		assert(static_cast<int>(split_factor.size()) == ndim);

	int64_t vol = volume(in_shape.begin(), in_shape.end());
	int64_t vol = volume(in_shape);

	split_shape(split_factor, sh, 3, 1000); // minimum volume is bigger than the input volume
	split_shape(split_factor, sh, 3, 1000); // minimum volume is equal than the input volume

Add split shape utility #3062

Add split shape utility #3062

Conversation

jantonguirao commented Jun 16, 2021 • edited Loading

Why we need this PR?

What happened in this PR?

mzient Jun 17, 2021 • edited Loading

Choose a reason for hiding this comment

mzient Jun 17, 2021

Choose a reason for hiding this comment

jantonguirao Jun 17, 2021

Choose a reason for hiding this comment

mzient Jun 17, 2021

Choose a reason for hiding this comment

mzient Jun 17, 2021

Choose a reason for hiding this comment

mzient Jun 17, 2021

Choose a reason for hiding this comment

jantonguirao commented Jun 17, 2021

dali-automaton commented Jun 17, 2021

dali-automaton commented Jun 18, 2021

jantonguirao commented Jun 16, 2021 •

edited

Loading

mzient Jun 17, 2021 •

edited

Loading