Add DLPack input support to the ExternalSource operator #2023

JanuszL · 2020-06-15T12:59:49Z

adds an ability to pass DLPack object in the ExternalSource operator

Signed-off-by: Janusz Lisiecki jlisiecki@nvidia.com

Why we need this PR?

Pick one, remove the rest

It adds DLPack input support to the ExternalSource operator

What happened in this PR?

Fill relevant points, put NA otherwise. Replace anything inside []

What solution was applied:
adds DLPack input support to the ExternalSource operator
Affected modules and functionalities:
backend, pipeline, external source
Key points relevant for the review:
NA
Validation and testing:
new tests added
Documentation (including examples):
docs updated

JIRA TASK: [DALI-1465]

review-notebook-app · 2020-06-15T12:59:54Z

Check out this pull request on

Review Jupyter notebook visual diffs & provide feedback on notebooks.

Powered by ReviewNB

JanuszL · 2020-06-15T12:59:55Z

!build

JanuszL · 2020-06-15T13:00:21Z

It goes after #1997

dali-automaton · 2020-06-15T13:05:57Z

CI MESSAGE: [1395708]: BUILD STARTED

dali-automaton · 2020-06-15T13:44:24Z

CI MESSAGE: [1395708]: BUILD FAILED

JanuszL · 2020-06-17T19:10:39Z

!build

dali-automaton · 2020-06-17T20:08:21Z

CI MESSAGE: [1403482]: BUILD STARTED

dali-automaton · 2020-06-17T21:03:04Z

CI MESSAGE: [1403482]: BUILD FAILED

JanuszL · 2020-06-18T08:56:16Z

!build

dali-automaton · 2020-06-18T09:00:27Z

CI MESSAGE: [1405230]: BUILD STARTED

dali-automaton · 2020-06-18T09:35:12Z

CI MESSAGE: [1405230]: BUILD FAILED

JanuszL · 2020-06-18T11:03:53Z

!build

dali-automaton · 2020-06-18T11:19:00Z

CI MESSAGE: [1405462]: BUILD STARTED

dali-automaton · 2020-06-18T13:36:44Z

CI MESSAGE: [1405462]: BUILD PASSED

jantonguirao · 2020-06-19T12:47:30Z

dali/python/backend_impl.cc

 template <typename TensorType>
-TensorShape<> FillTensorData(const py::object object, TensorType *t, int device_id, string layout) {
+void FillTensorCudaArrayInterfaceData(const py::object object, TensorType *batch,
+                                               int device_id, string layout) {


nitpick: align

jantonguirao · 2020-06-19T12:51:35Z

dali/python/backend_impl.cc

+            auto dlm_tensor_ptr = DLMTensorRawPtrFromCapsule(capsule, false);
+            const auto &dl_tensor = dlm_tensor_ptr->dl_tensor;
+            list.append(dl_tensor.ctx.device_type == kDLGPU);
+            if (dl_tensor.ctx.device_type != kDLGPU && dl_tensor.ctx.device_type != kDLCPU) {


How about:

list.append(dl_tensor.ctx.device_type == kDLGPU || dl_tensor.ctx.device_type == kDLCPU); list.append(dl_tensor.ctx.device_type == kDLGPU);

jantonguirao · 2020-06-19T12:53:21Z

dali/python/backend_impl.cc

@@ -292,10 +399,26 @@ void ExposeTensor(py::module &m) {
      )code");

  py::class_<Tensor<GPUBackend>>(m, "TensorGPU")
+    .def(py::init([](py::capsule &capsule, string layout = "") {
+          auto t = new Tensor<GPUBackend>;
+          FillTensorDlpackInterfaceData(capsule, t, layout);


I suggest we make a shorter name FillTensorFromDlPack

jantonguirao · 2020-06-19T12:53:32Z

dali/python/backend_impl.cc

    .def(py::init([](const py::object object, string layout = "", int device_id = -1) {
          auto t = new Tensor<GPUBackend>;
-          auto shape = FillTensorData(object, t, device_id, layout);
-          t->Resize(shape);
+          FillTensorCudaArrayInterfaceData(object, t, device_id, layout);


FillTensorFromCudaArray?

jantonguirao · 2020-06-19T13:04:04Z

dali/test/python/test_backend_impl_torch_dlpack.py

+    tensor_list = TensorListCPU(to_dlpack(arr), "NHWC")
+    dali_torch_tensor = convert_to_torch(tensor_list, device=arr.device, dtype=arr.dtype)
+    assert(torch.all(arr.eq(dali_torch_tensor)))
+test_dlpack_tensor_list_cpu_direct_creation()


mzient · 2020-06-19T13:59:58Z

dali/pipeline/data/tensor_list.h

+   * persist while it is in use by the Tensor.
+   */
+  DLL_PUBLIC inline void ShareData(void *ptr, size_t bytes) {
+    ShareData(shared_ptr<void>(ptr, [](void *) {}), bytes, uniform_list_shape(1, {0}));


Why this particular shape? What's wrong with TensorListshape<>{}?

mzient · 2020-06-19T14:01:36Z

dali/pipeline/data/tensor_list.h

@@ -219,14 +217,14 @@ class DLL_PUBLIC TensorList : public Buffer<Backend> {
   * the user to manage the lifetime of the allocation such that it
   * persist while it is in use by the Tensor.
   */
-  DLL_PUBLIC inline void ShareData(void *ptr, size_t bytes) {
+  inline void ShareData(const shared_ptr<void> &ptr, size_t bytes, const TensorListShape<> &shape) {


For completeness - maybe there should also be a type?

Suggested change

inline void ShareData(const shared_ptr<void> &ptr, size_t bytes, const TensorListShape<> &shape) {

inline void ShareData(const shared_ptr<void> &ptr, size_t bytes, const TensorListShape<> &shape, const TypeInfo &type = {}) {

...this would be in line with the new Resiz.

I wanted to align the API with the Tensor API. It doesn't accept type in the ShareData function.

JanuszL · 2020-06-19T16:54:23Z

!build

dali-automaton · 2020-06-19T17:00:22Z

CI MESSAGE: [1409136]: BUILD STARTED

dali-automaton · 2020-06-19T19:58:43Z

CI MESSAGE: [1409136]: BUILD PASSED

JanuszL · 2020-06-25T12:45:49Z

!build

dali-automaton · 2020-06-25T12:50:34Z

CI MESSAGE: [1423002]: BUILD STARTED

dali-automaton · 2020-06-25T12:54:07Z

CI MESSAGE: [1423002]: BUILD FAILED

mzient · 2020-06-25T13:00:41Z

dali/c_api/c_api.cc

+                                        sample_dim, layout_str, pipe_handle->copy_stream, true,
+                                        is_pinned);


Nitpick: indentation.

JanuszL · 2020-06-25T13:06:22Z

!build

dali-automaton · 2020-06-25T13:10:47Z

CI MESSAGE: [1423041]: BUILD STARTED

- adds an ability to pass DLPack object in the ExternalSource operator - sorts CPU work on ExternalSource by size Signed-off-by: Janusz Lisiecki <jlisiecki@nvidia.com>

Signed-off-by: Janusz Lisiecki <jlisiecki@nvidia.com>

dali-automaton · 2020-06-25T14:35:16Z

CI MESSAGE: [1423041]: BUILD FAILED

mzient · 2020-06-25T13:23:13Z

dali/pipeline/data/tensor_list.h

@@ -219,14 +217,15 @@ class DLL_PUBLIC TensorList : public Buffer<Backend> {
   * the user to manage the lifetime of the allocation such that it
   * persist while it is in use by the Tensor.
   */
-  DLL_PUBLIC inline void ShareData(void *ptr, size_t bytes) {
+  inline void ShareData(const shared_ptr<void> &ptr, size_t bytes, const TensorListShape<> &shape,
+                        const TypeInfo &type = TypeInfo::Create<NoType>()) {


Isn't that sufficient?

Suggested change

const TypeInfo &type = TypeInfo::Create<NoType>()) {

const TypeInfo &type = {}) {

mzient · 2020-06-25T14:15:44Z

dali/pipeline/operator/builtin/external_source.h

@@ -57,6 +57,10 @@ class CachingList {
    return full_data_.empty();
  }

+  T &PeakFront() {


Suggested change

T &PeakFront() {

T &PeekFront() {

mzient · 2020-06-25T14:16:04Z

dali/pipeline/operator/builtin/external_source.h

+      output_desc[0].shape = tl_data_.PeakFront()->shape();
+      output_desc[0].type = tl_data_.PeakFront()->type();


Suggested change

output_desc[0].shape = tl_data_.PeakFront()->shape();

output_desc[0].type = tl_data_.PeakFront()->type();

output_desc[0].shape = tl_data_.PeekFront()->shape();

output_desc[0].type = tl_data_.PeekFront()->type();

mzient · 2020-06-25T14:23:52Z

dali/python/backend_impl.cc

+void CheckStrides(TStrides &strides, TShape &shape, size_t type_size,
+                  size_t strides_size, size_t shape_size) {


Suggested change

void CheckStrides(TStrides &strides, TShape &shape, size_t type_size,

size_t strides_size, size_t shape_size) {

void CheckContiguousTensor(const TStrides &strides, size_t num_strides, const TShape &shape,

size_t num_extents, size_t element_size) {

...and add an overload:

void CheckContiguousTensor(const TStrides &strides, const TShape &shape, size_t element_size) { CheckContiguousTensor(strides, dali::size(strides), shape, dali::size(shape), element_size); }

mzient · 2020-06-25T14:26:51Z

dali/python/backend_impl.cc

+  std::vector<Index> tensor_shape(shape.size()-1);
+  for (int i = 1; i < shape.size(); ++i) {
+    tensor_shape[i-1] = shape[i];
+  }
+  return uniform_list_shape(shape[0], tensor_shape);


Suggested change

std::vector<Index> tensor_shape(shape.size()-1);

for (int i = 1; i < shape.size(); ++i) {

tensor_shape[i-1] = shape[i];

}

return uniform_list_shape(shape[0], tensor_shape);

return uniform_list_shape(shape[0], shape.last(shape.size()-1));

mzient · 2020-06-25T14:33:10Z

dali/python/backend_impl.cc

+}
+
+template<typename SrcBackend>
+TensorShape<> CreateShape(TensorShape<> &shape, Tensor<SrcBackend>*) {


Suggested change

TensorShape<> CreateShape(TensorShape<> &shape, Tensor<SrcBackend>*) {

const TensorShape<> &ConvertShape(const TensorShape<> &shape, Tensor<SrcBackend>*) {

mzient · 2020-06-25T14:37:51Z

dali/python/backend_impl.cc

+        CheckStrides(info.strides, info.shape, info.itemsize, info.strides.size(),
+                     info.shape.size());


Suggested change

CheckStrides(info.strides, info.shape, info.itemsize, info.strides.size(),

info.shape.size());

CheckStrides(info.strides, info.shape, info.itemsize);

The function should take care of obtaining the sizes.

mzient · 2020-06-25T14:38:16Z

dali/python/backend_impl.cc

+          CheckStrides(info.strides, info.shape, info.itemsize, info.strides.size(),
+                       info.shape.size());


Suggested change

CheckStrides(info.strides, info.shape, info.itemsize, info.strides.size(),

info.shape.size());

CheckStrides(info.strides, info.shape, info.itemsize);

mzient · 2020-06-25T14:38:40Z

dali/python/backend_impl.cc

-          " whereas densely packed data of this shape would have a stride ", stride_from_shape));
-      stride_from_shape *= shape[i];
-    }
+    CheckStrides(strides, shape, type.size(), strides.size(), shape.size());


Suggested change

CheckStrides(strides, shape, type.size(), strides.size(), shape.size());

CheckStrides(strides, shape, type.size());

mzient · 2020-06-25T14:40:46Z

dali/python/backend_impl.cc

+      It returns a two element tuple, if this is a valid DLPack object, and if data
+      resides on the GPU.


Suggested change

It returns a two element tuple, if this is a valid DLPack object, and if data

resides on the GPU.

It returns a tuple of two boolean values: one indicating if this is a valid DLPack object, and the other if the data

resides on the GPU.

JanuszL · 2020-06-25T15:55:49Z

!build

mzient · 2020-06-25T16:02:14Z

dali/test/python/test_external_source_impl.py

@@ -504,17 +523,13 @@ def define_graph(self):

        def iter_setup(self):
            if use_list:
-                batch_data = [random_array([100, 100, 3]) for _ in range(self.batch_size)]
+                batch_data = [cast_to(random_array([100, 100, 3]), datapy.uint8) for _ in range(self.batch_size)]


Suggested change

batch_data = [cast_to(random_array([100, 100, 3]), datapy.uint8) for _ in range(self.batch_size)]

batch_data = [cast_to(random_array([100, 100, 3])*255, datapy.uint8) for _ in range(self.batch_size)]

mzient · 2020-06-25T16:02:33Z

dali/test/python/test_external_source_impl.py

            else:
-                batch_data = random_array([self.batch_size, 100, 100, 3])
-            self.feed_input(self.batch, batch_data)
+                batch_data = cast_to(random_array([self.batch_size, 100, 100, 3]), datapy.uint8)


Suggested change

batch_data = cast_to(random_array([self.batch_size, 100, 100, 3]), datapy.uint8)

batch_data = cast_to(random_array([self.batch_size, 100, 100, 3])*256, datapy.uint8)

dali-automaton · 2020-06-25T16:29:57Z

CI MESSAGE: [1423418]: BUILD STARTED

dali-automaton · 2020-06-25T17:55:23Z

CI MESSAGE: [1423418]: BUILD FAILED

Signed-off-by: Janusz Lisiecki <jlisiecki@nvidia.com>

dali-automaton · 2020-06-25T20:29:39Z

CI MESSAGE: [1424173]: BUILD STARTED

dali-automaton · 2020-06-26T01:08:08Z

CI MESSAGE: [1424173]: BUILD PASSED

Signed-off-by: Janusz Lisiecki <jlisiecki@nvidia.com>

dali-automaton · 2020-06-26T08:53:48Z

CI MESSAGE: [1425810]: BUILD STARTED

dali-automaton · 2020-06-26T13:28:24Z

CI MESSAGE: [1425810]: BUILD PASSED

JanuszL mentioned this pull request Jun 15, 2020

Add zero-copy to the ExternalSource operator #2024

Merged

JanuszL force-pushed the dlpack_input branch 2 times, most recently from 184b06e to 7461691 Compare June 17, 2020 19:10

JanuszL force-pushed the dlpack_input branch from d3cef83 to de8646f Compare June 18, 2020 08:35

JanuszL force-pushed the dlpack_input branch from de8646f to cfcddd2 Compare June 18, 2020 09:40

jantonguirao approved these changes Jun 19, 2020

View reviewed changes

mzient reviewed Jun 19, 2020

View reviewed changes

JanuszL force-pushed the dlpack_input branch from 8142a99 to dab1784 Compare June 19, 2020 16:14

JanuszL force-pushed the dlpack_input branch 2 times, most recently from 51ecddf to 363846d Compare June 23, 2020 14:44

JanuszL mentioned this pull request Jun 23, 2020

Question about ExternalSource and GPU #2052

Closed

JanuszL force-pushed the dlpack_input branch from 363846d to 49ffa80 Compare June 24, 2020 17:46

JanuszL force-pushed the dlpack_input branch from 49ffa80 to 04b3e05 Compare June 25, 2020 12:45

mzient reviewed Jun 25, 2020

View reviewed changes

JanuszL force-pushed the dlpack_input branch from 04b3e05 to d146ac8 Compare June 25, 2020 13:05

JanuszL added 2 commits June 25, 2020 15:58

Add DLPack input support to the ExternalSource operator

ff37c0d

- adds an ability to pass DLPack object in the ExternalSource operator - sorts CPU work on ExternalSource by size Signed-off-by: Janusz Lisiecki <jlisiecki@nvidia.com>

Review fixes

35d315e

Signed-off-by: Janusz Lisiecki <jlisiecki@nvidia.com>

JanuszL force-pushed the dlpack_input branch from d146ac8 to 35d315e Compare June 25, 2020 13:59

mzient reviewed Jun 25, 2020

View reviewed changes

Code review fixes

a995170

Signed-off-by: Janusz Lisiecki <jlisiecki@nvidia.com>

JanuszL force-pushed the dlpack_input branch from 45668a7 to a995170 Compare June 25, 2020 20:27

Align conda test

01b09cb

Signed-off-by: Janusz Lisiecki <jlisiecki@nvidia.com>

mzient approved these changes Jun 26, 2020

View reviewed changes

JanuszL merged commit 933636d into NVIDIA:master Jun 26, 2020

JanuszL deleted the dlpack_input branch June 26, 2020 13:48

JanuszL mentioned this pull request Jun 30, 2020

Using Dali Pipeline on a GPU input #1478

Closed

	inline void ShareData(const shared_ptr<void> &ptr, size_t bytes, const TensorListShape<> &shape) {
	inline void ShareData(const shared_ptr<void> &ptr, size_t bytes, const TensorListShape<> &shape, const TypeInfo &type = {}) {

		sample_dim, layout_str, pipe_handle->copy_stream, true,
		is_pinned);

	const TypeInfo &type = TypeInfo::Create<NoType>()) {
	const TypeInfo &type = {}) {

		output_desc[0].shape = tl_data_.PeakFront()->shape();
		output_desc[0].type = tl_data_.PeakFront()->type();

		void CheckStrides(TStrides &strides, TShape &shape, size_t type_size,
		size_t strides_size, size_t shape_size) {

	TensorShape<> CreateShape(TensorShape<> &shape, Tensor<SrcBackend>*) {
	const TensorShape<> &ConvertShape(const TensorShape<> &shape, Tensor<SrcBackend>*) {

		CheckStrides(info.strides, info.shape, info.itemsize, info.strides.size(),
		info.shape.size());

	CheckStrides(strides, shape, type.size(), strides.size(), shape.size());
	CheckStrides(strides, shape, type.size());

		It returns a two element tuple, if this is a valid DLPack object, and if data
		resides on the GPU.

	batch_data = [cast_to(random_array([100, 100, 3]), datapy.uint8) for _ in range(self.batch_size)]
	batch_data = [cast_to(random_array([100, 100, 3])*255, datapy.uint8) for _ in range(self.batch_size)]

	batch_data = cast_to(random_array([self.batch_size, 100, 100, 3]), datapy.uint8)
	batch_data = cast_to(random_array([self.batch_size, 100, 100, 3])*256, datapy.uint8)

Add DLPack input support to the ExternalSource operator #2023

Add DLPack input support to the ExternalSource operator #2023

Conversation

JanuszL commented Jun 15, 2020

Why we need this PR?

What happened in this PR?

review-notebook-app bot commented Jun 15, 2020

JanuszL commented Jun 15, 2020

JanuszL commented Jun 15, 2020

dali-automaton commented Jun 15, 2020

dali-automaton commented Jun 15, 2020

JanuszL commented Jun 17, 2020

dali-automaton commented Jun 17, 2020

dali-automaton commented Jun 17, 2020

JanuszL commented Jun 18, 2020

dali-automaton commented Jun 18, 2020

dali-automaton commented Jun 18, 2020

JanuszL commented Jun 18, 2020

dali-automaton commented Jun 18, 2020

dali-automaton commented Jun 18, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

JanuszL commented Jun 19, 2020

dali-automaton commented Jun 19, 2020

dali-automaton commented Jun 19, 2020

JanuszL commented Jun 25, 2020

dali-automaton commented Jun 25, 2020

dali-automaton commented Jun 25, 2020

Choose a reason for hiding this comment

JanuszL commented Jun 25, 2020

dali-automaton commented Jun 25, 2020

dali-automaton commented Jun 25, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

JanuszL commented Jun 25, 2020

Choose a reason for hiding this comment

mzient Jun 25, 2020 • edited Loading

Choose a reason for hiding this comment

dali-automaton commented Jun 25, 2020

dali-automaton commented Jun 25, 2020

dali-automaton commented Jun 25, 2020

dali-automaton commented Jun 26, 2020

dali-automaton commented Jun 26, 2020

dali-automaton commented Jun 26, 2020

mzient Jun 25, 2020 •

edited

Loading