[CUDA] Correctly set CUDA default architecture #84017

jhuber6 · 2024-03-05T13:46:36Z

Summary:
We already had a special CUDA default that better tracked the state as
of modern CUDA installations. Recently this was bumped up to sm_52,
but there was a location that wasn't respecting this. Fix that.

llvmbot · 2024-03-05T13:48:13Z

@llvm/pr-subscribers-clang

@llvm/pr-subscribers-clang-driver

Author: Joseph Huber (jhuber6)

Changes

Summary:
We already had a special CUDA default that better tracked the state as
of modern CUDA installations. Recently this was bumped up to sm_52,
but there was a location that wasn't respecting this. Fix that.

Full diff: https://github.com/llvm/llvm-project/pull/84017.diff

1 Files Affected:

(modified) clang/lib/Driver/Driver.cpp (+1-1)

diff --git a/clang/lib/Driver/Driver.cpp b/clang/lib/Driver/Driver.cpp
index de8ceb2f0898bb..cecd34acbc92c0 100644
--- a/clang/lib/Driver/Driver.cpp
+++ b/clang/lib/Driver/Driver.cpp
@@ -3234,7 +3234,7 @@ class OffloadingActionBuilder final {
     CudaActionBuilder(Compilation &C, DerivedArgList &Args,
                       const Driver::InputList &Inputs)
         : CudaActionBuilderBase(C, Args, Inputs, Action::OFK_Cuda) {
-      DefaultCudaArch = CudaArch::SM_35;
+      DefaultCudaArch = CudaArch::CudaDefault;
     }
 
     StringRef getCanonicalOffloadArch(StringRef ArchStr) override {

Summary: We already had a special CUDA default that better tracked the state as of modern CUDA installations. Recently this was bumped up to `sm_52`, but there was a location that wasn't respecting this. Fix that. Fix tests

hahnjo · 2024-03-06T07:37:17Z

clang/test/Driver/cuda-omp-unsupported-debug-options.cu

-// RUN: not %clang -### --target=x86_64-linux-gnu -fopenmp=libomp -fopenmp-targets=nvptx64-nvidia-cuda -c %s \
+// RUN: %clang -### --target=x86_64-linux-gnu --offload-arch=sm_52 -nogpulib -nogpuinc -fopenmp=libomp -c %s \


Please note that this completely dropped -fopenmp-targets=nvptx64-nvidia-cuda, so the test likely lost coverage...

$ grep -R ../clang/test/ -e '-fopenmp-targets=nvptx64' -l | sort | uniq | wc -l 94

Sure, but this one is explicitly testing the (in)compatibility of debug options and these run lines are supposed to test them together with OpenMP offloading. Now they don't anymore...

--offload-arch and and -fopenmp-targets= are semantically equivalent in this context. They both enable the OpenMP offloding toolchain targeting NVPTX. The only difference is that --offload-arch sets the architecture manually while -fopenmp-targets= will look up the architecture through the nvptx-arch tool, which is extra work that I don't think is necessary for a test like this.

jhuber6 requested review from Artem-B, hahnjo, jlebar and MaskRay March 5, 2024 13:46

jhuber6 mentioned this pull request Mar 5, 2024

[clang] Add cuda-path arguments to failing test #84008

Closed

llvmbot added clang Clang issues not falling into any other category clang:driver 'clang' and 'clang++' user-facing binaries. Not 'clang-cl' labels Mar 5, 2024

Artem-B approved these changes Mar 5, 2024

View reviewed changes

[CUDA] Correctly set CUDA default architecture

d1bdd2a

Summary: We already had a special CUDA default that better tracked the state as of modern CUDA installations. Recently this was bumped up to `sm_52`, but there was a location that wasn't respecting this. Fix that. Fix tests

jhuber6 force-pushed the SetCUDADefault branch from 6c71951 to d1bdd2a Compare March 6, 2024 00:10

jhuber6 merged commit 433b711 into llvm:main Mar 6, 2024
4 checks passed

hahnjo reviewed Mar 6, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CUDA] Correctly set CUDA default architecture #84017

[CUDA] Correctly set CUDA default architecture #84017

jhuber6 commented Mar 5, 2024

llvmbot commented Mar 5, 2024 •

edited

hahnjo Mar 6, 2024

jhuber6 Mar 6, 2024

hahnjo Mar 6, 2024

jhuber6 Mar 6, 2024

		// RUN: not %clang -### --target=x86_64-linux-gnu -fopenmp=libomp -fopenmp-targets=nvptx64-nvidia-cuda -c %s \
		// RUN: %clang -### --target=x86_64-linux-gnu --offload-arch=sm_52 -nogpulib -nogpuinc -fopenmp=libomp -c %s \

[CUDA] Correctly set CUDA default architecture #84017

[CUDA] Correctly set CUDA default architecture #84017

Conversation

jhuber6 commented Mar 5, 2024

llvmbot commented Mar 5, 2024 • edited

hahnjo Mar 6, 2024

Choose a reason for hiding this comment

jhuber6 Mar 6, 2024

Choose a reason for hiding this comment

hahnjo Mar 6, 2024

Choose a reason for hiding this comment

jhuber6 Mar 6, 2024

Choose a reason for hiding this comment

llvmbot commented Mar 5, 2024 •

edited