Integrating cuBLASLt into XLA #55518

philipphack · 2022-04-06T19:36:20Z

Adds support for the cuBLASLt library for GEMM operations to XLA. The library can be activated by setting the XLA flag xla_gpu_enable_cublaslt=true.

@SandSnip3r can you run the test before merging?

SandSnip3r · 2022-04-06T23:55:54Z

tensorflow/compiler/xla/service/gpu/gemm_algorithm_picker.cc

+      BlasPlansCompatibleType(element_type)) {
+    TF_RETURN_IF_ERROR(
+        DoBlasPlansAutotune(stream, instr, allocator, gemm_config));
+    return {se::blas::kNoAlgorithm};


What is the significance of returning kNoAlgorithm here?

The cuBLASLt autotuner operates on se::blas::AlgorithmConfig instead of se::blas::AlgorithmType used in non-cuBLASLt autotuning. As such, the result of cuBLASLt autotuning is incompatible with the return value of DoGemmAutotune and the se::blas::kNoAlgorithm dummy value is returned instead. The outcome of cuBLASLt autotuning is stored in the instance of BlasPlansAutotuneCacheSingleton.

SandSnip3r · 2022-04-06T23:56:28Z

tensorflow/compiler/xla/service/gpu/gemm_algorithm_picker.cc

-  GemmCacheKey key =
-      std::make_tuple(stream->parent(), lhs->shape(), rhs->shape(),
-                      instr->shape(), gemm_config.SerializeAsString());
+  if (stream->parent()->SupportsBlasPlans() && config.use_cublaslt &&


Maybe slightly pedantic, but we ought to check the flag first, lest SupportBlasPlans has some kind of side effect.

SupportsBlasPlans() is introduced in this PR and has no side effects. It returns true if the CUDA version is greater than or equal to 11, and false otherwise.

tensorflow/compiler/xla/service/gpu/gemm_algorithm_picker.cc

gbaned · 2022-04-14T14:30:51Z

@philipphack Can you please resolve conflicts? Thank you!

PiperOrigin-RevId: 441800149

SandSnip3r · 2022-04-14T18:26:00Z

FYI @philipphack, this was merged. I'm not sure why this PR hasn't been updated.

gbaned · 2022-04-18T14:18:39Z

Seems auto-merge is not happening but the changes are merged into master now, so we can close this. Thank you for the PR.

Integrates cuBLASLt into XLA.

b1fa630

google-ml-butler bot added the size:XL CL Change Size:Extra Large label Apr 6, 2022

google-ml-butler bot assigned gbaned Apr 6, 2022

SandSnip3r reviewed Apr 6, 2022

View reviewed changes

gbaned added this to Assigned Reviewer in PR Queue via automation Apr 7, 2022

SandSnip3r reviewed Apr 11, 2022

View reviewed changes

tensorflow/compiler/xla/service/gpu/gemm_algorithm_picker.cc Outdated Show resolved Hide resolved

SandSnip3r approved these changes Apr 12, 2022

View reviewed changes

tensorflow/compiler/xla/service/gpu/gemm_algorithm_picker.cc Outdated Show resolved Hide resolved

PR Queue automation moved this from Assigned Reviewer to Approved by Reviewer Apr 12, 2022

google-ml-butler bot added kokoro:force-run Tests on submitted change ready to pull PR ready for merge process labels Apr 12, 2022

kokoro-team removed the kokoro:force-run Tests on submitted change label Apr 12, 2022

Integrates cuBLASLt into XLA.

9230405

google-ml-butler bot removed the ready to pull PR ready for merge process label Apr 12, 2022

gbaned requested a review from SandSnip3r April 13, 2022 14:47

google-ml-butler bot added the awaiting review Pull request awaiting review label Apr 13, 2022

gbaned added stat:awaiting response Status - Awaiting response from author and removed awaiting review Pull request awaiting review labels Apr 14, 2022

copybara-service bot pushed a commit that referenced this pull request Apr 14, 2022

Merge pull request #55518 from philipphack:uu_cublaslt_xla

232741c

PiperOrigin-RevId: 441800149

gbaned removed the stat:awaiting response Status - Awaiting response from author label Apr 18, 2022

gbaned closed this Apr 18, 2022

PR Queue automation moved this from Approved by Reviewer to Closed/Rejected Apr 18, 2022

bhack mentioned this pull request Apr 18, 2022

PR internally edited as CL are not automatically merged tensorflow/community#413

Open

philipphack mentioned this pull request Apr 20, 2022

Integrating cuBLASLt into TF #55685

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integrating cuBLASLt into XLA #55518

Integrating cuBLASLt into XLA #55518

philipphack commented Apr 6, 2022

SandSnip3r Apr 6, 2022

philipphack Apr 7, 2022

SandSnip3r Apr 6, 2022

philipphack Apr 7, 2022

gbaned commented Apr 14, 2022

SandSnip3r commented Apr 14, 2022

gbaned commented Apr 18, 2022

Integrating cuBLASLt into XLA #55518

Integrating cuBLASLt into XLA #55518

Conversation

philipphack commented Apr 6, 2022

SandSnip3r Apr 6, 2022

Choose a reason for hiding this comment

philipphack Apr 7, 2022

Choose a reason for hiding this comment

SandSnip3r Apr 6, 2022

Choose a reason for hiding this comment

philipphack Apr 7, 2022

Choose a reason for hiding this comment

gbaned commented Apr 14, 2022

SandSnip3r commented Apr 14, 2022

gbaned commented Apr 18, 2022