Add GPU implementation of SparseReshape #47251

benbarsdell · 2021-02-19T01:44:05Z

This follows #46275.

sanjoy · 2021-02-25T05:25:53Z

tensorflow/python/kernel_tests/sparse_reshape_op_test.py

@@ -94,7 +94,7 @@ def testPropagatesFullyKnownDenseShapeWhenShapePartiallyKnown(self):
    self.assertAllEqual((2, 3 * 4), sp_output.shape)

  def testSameShape(self):
-    with self.session(use_gpu=False) as sess:
+    with self.session(use_gpu=True) as sess:


You no longer need to set use_gpu=True explicitly, it default to True.

sanjoy · 2021-02-25T05:26:48Z

tensorflow/core/kernels/reshape_util_gpu.cu.cc

+  GPU_1D_KERNEL_LOOP(sparse_index, nnz) {
+    const Tindex* input_index = &input_indices[sparse_index * input_rank];
+    Tindex* output_index = &output_indices[sparse_index * output_rank];
+    Tindex dense_index = 0;
+    // Flatten input index from slowest- to fastest-changing dimension.
+    for (int i = 0; i < input_rank; ++i) {
+      dense_index = dense_index * input_shape[i] + input_index[i];
+    }
+    // Compute output index from fastest- to slowest-changing dimension.
+    for (int i = output_rank; i-- > 0;) {
+      Tindex output_size = output_shape[i];
+      output_index[i] = dense_index % output_size;
+      dense_index /= output_size;
+    }


Do you need to care about integer overflow in case the indices are in int32? Maybe always use int64 for dense_index?

sanjoy · 2021-02-25T05:28:54Z

tensorflow/core/kernels/reshape_util_gpu.cu.cc

+      dense_index = dense_index * input_shape[i] + input_index[i];
+    }
+    // Compute output index from fastest- to slowest-changing dimension.
+    for (int i = output_rank; i-- > 0;) {


Please use the more idiomatic i >= 0; i--. :)

sanjoy · 2021-02-25T05:29:52Z

tensorflow/core/kernels/reshape_util_gpu.cu.cc

+  auto config = GetGpuLaunchConfig(nnz, device);
+  return GpuLaunchKernel(ReshapeSparseTensorKernel<int64>, config.block_count,
+                         config.thread_per_block, 0, device.stream(), nnz,
+                         /* input_rank = */ input_rank,


Please spell these as /*input_rank=*/; our internal tooling will then check that the param names match up.

- Remove use_gpu=True because it is already the default. - Use int64 for dense_index inside kernel to avoid integer overflow. - Change reverse for-loop style. - Reformat inline comments to ensure internal tooling picks them up.

Add GPU implementation of SparseReshape

e87b51c

google-ml-butler bot added the size:M CL Change Size: Medium label Feb 19, 2021

google-cla bot added the cla: yes label Feb 19, 2021

gbaned self-assigned this Feb 19, 2021

gbaned added the comp:core issues related to core part of tensorflow label Feb 19, 2021

gbaned added this to Assigned Reviewer in PR Queue via automation Feb 19, 2021

gbaned requested a review from sanjoy February 19, 2021 04:24

sanjoy suggested changes Feb 25, 2021

View reviewed changes

PR Queue automation moved this from Assigned Reviewer to Reviewer Requested Changes Feb 25, 2021

Address minor review comments for PR 47251

b069726

- Remove use_gpu=True because it is already the default. - Use int64 for dense_index inside kernel to avoid integer overflow. - Change reverse for-loop style. - Reformat inline comments to ensure internal tooling picks them up.

PR Queue automation moved this from Reviewer Requested Changes to Approved by Reviewer Feb 25, 2021

sanjoy approved these changes Feb 25, 2021

View reviewed changes

google-ml-butler bot added kokoro:force-run Tests on submitted change ready to pull PR ready for merge process labels Feb 25, 2021

kokoro-team removed the kokoro:force-run Tests on submitted change label Feb 25, 2021

gbaned added ready to pull PR ready for merge process and removed ready to pull PR ready for merge process labels Feb 26, 2021

copybara-service bot merged commit e37b5e1 into tensorflow:master Feb 27, 2021

PR Queue automation moved this from Approved by Reviewer to Merged Feb 27, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add GPU implementation of SparseReshape #47251

Add GPU implementation of SparseReshape #47251

benbarsdell commented Feb 19, 2021

sanjoy Feb 25, 2021

sanjoy Feb 25, 2021

sanjoy Feb 25, 2021

sanjoy Feb 25, 2021

Add GPU implementation of SparseReshape #47251

Add GPU implementation of SparseReshape #47251

Conversation

benbarsdell commented Feb 19, 2021

sanjoy Feb 25, 2021

Choose a reason for hiding this comment

sanjoy Feb 25, 2021

Choose a reason for hiding this comment

sanjoy Feb 25, 2021

Choose a reason for hiding this comment

sanjoy Feb 25, 2021

Choose a reason for hiding this comment