New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

[XLA] Add fast path cases for common scatter and gather operations #15185

Merged

caisq merged 2 commits into tensorflow:master from DavidNorman:add-ta-scatter-gather-fast-paths

Dec 14, 2017

Contributor

DavidNorman commented Dec 7, 2017

This change checks if the indices vector passed to a scatter or gather operation is a constant, and does a fast-path operation when it is filled with a zero-based incrementing set.

This is quite a common case because of tensor-array stack and unstack.


          Add fast path cases for common scatter and gather operations

aad9733

Collaborator

tensorflow-jenkins commented Dec 7, 2017

Can one of the admins verify this patch?

googlebot added the cla: yes label

Contributor Author

DavidNorman commented Dec 7, 2017

although there are existing unit tests that catch this change, do you think I should add an explicit device-targetted test which demonstrates this working. It is hard to check that the fast path has been used, without checking the form of the XLA HLO graph - but this could be done in the XLA CC tests.

caisq requested a review from hawkinsp

December 7, 2017 16:13

caisq self-assigned this

caisq added the awaiting review label

hawkinsp reviewed

View reviewed changes

Contributor

hawkinsp left a comment

Thanks for the PR!

Sorry for the slow response: I missed the notification email about this change.

Could you also please verify that the new cases are tested by tensorflow/compiler/tests/tensor_array_ops_test.py, and extend the tests to cover them if not?

tensorflow/compiler/tf2xla/kernels/tensor_array_ops.cc Outdated

+                  std::vector<int64> const_indices;
+                  Status status = ctx->ConstantInputAsIntVector(1, &const_indices);
+                  if (status.ok()) {
+                    bool is_simple_gather = true;

Contributor

hawkinsp Dec 12, 2017

This would be more readable if you added a comment defining what a "simple" gather is.

Perhaps use a more descriptive name, maybe "gather_is_dense_slice"?

tensorflow/compiler/tf2xla/kernels/tensor_array_ops.cc Outdated

+                  std::vector<int64> const_indices;
+                  Status status = ctx->ConstantInputAsIntVector(1, &const_indices);
+                  if (status.ok() && num_indices==value_shape.dim_size(0)) {

Contributor

hawkinsp Dec 12, 2017

Nit: Add a space before and after "==" for consistency.

tensorflow/compiler/tf2xla/kernels/tensor_array_ops.cc Outdated

@@ @@ -352,30 +376,50 @@ class TensorArrayScatterOp : public XlaOpKernel { @@
                   const xla::ComputationDataHandle value = ctx->Input(2);
                   const xla::ComputationDataHandle flow = ctx->Input(3);
-                  auto slice_dims = value_shape.dim_sizes();
-                  slice_dims[0] = 1LL;
+                  bool is_simple = false;

Contributor

hawkinsp Dec 12, 2017

Same here. Add a comment describing what "simple" means here.

tensorflow/compiler/tf2xla/kernels/tensor_array_ops.cc Outdated


		if (is_simple) {
		ta = b->Add(ta, value);

Contributor

hawkinsp Dec 12, 2017

Nit: Remove blank line.

tensorflow/compiler/tf2xla/kernels/tensor_array_ops.cc Outdated

		}

Contributor

hawkinsp Dec 12, 2017

Remove extra blank line.

Contributor Author

DavidNorman commented Dec 12, 2017

thanks for the comments. I will check the coverage.

I'm not sure that the tensor_array_ops_test actually tests a lot of XLA device side stuff. Last time I looked it was mostly being compiled by the CPU constant removal code. this change certainly passes the tests in that file, but maybe only because the HLO graphs are optimized down to constants before they are compiled.

let me get back to you tomorrow....


          Updates following code review

Contributor Author

DavidNorman commented Dec 13, 2017

good news. both the scatter and gather changes are hit by the tensor_array_ops_test set of tests.

Contributor Author

DavidNorman commented Dec 13, 2017

i think i'll ignore those last 2 comments 😆

hawkinsp approved these changes

View reviewed changes

Contributor

hawkinsp left a comment

Thanks for the PR!

hawkinsp added awaiting testing (then merge) and removed awaiting review labels

Contributor

caisq commented Dec 14, 2017

@tensorflow-jenkins test this please

caisq added the kokoro:force-run label

kokoro-team removed the kokoro:force-run label

caisq merged commit 2bb302e into tensorflow:master

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment