Add option to build with CUDAToolkit and enable_language(CUDA) #336

peterbell10 · 2022-08-15T19:01:46Z

find_package(CUDA) is deprecated in newer versions of cmake. This adds the GLOO_USE_CUDA_TOOLKIT option to build with enable_language(CUDA) and find_package(CUDAToolkit) which are the modern cmake replacements.

cc @malfet

malfet

Sounds good to me, but I wonder how do you plan to test it

facebook-github-bot · 2022-08-15T19:04:02Z

@malfet has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

peterbell10 · 2022-08-15T19:06:09Z

I have pytorch/pytorch#83199 setup to build with my fork of gloo and with GLOO_USE_CUDA_TOOLKIT set.

peterbell10 · 2022-08-15T19:18:35Z

I notice there's no build with CUDA 11 here. Maybe A new CI job for CUDA 11 with GLOO_USE_CUDA_TOOLKIT defined would be useful?

peterbell10 · 2022-08-15T21:18:13Z

@malfet I've got the CI job running. The build log shows:

-- Found CUDAToolkit: /usr/local/cuda/include (found suitable version "11.7.99", minimum required is "7.0")

and also files show Building CUDA object, whereas when using cuda_add_library they show up as Building NVCC (Device) object.

facebook-github-bot · 2022-09-12T15:54:20Z

@malfet has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

malfet · 2022-09-20T18:15:01Z

Got internal approval, landing...

With CUDA-10.2 gone we can finally do it! This PR mostly contains build system related changes, invasive functional ones are to be followed. Among many expected tweaks to the build system, here are few unexpected ones: - Force onnx_proto project to be updated to C++17 to avoid `duplicate symbols` error when compiled by gcc-7.5.0, as storage rule for `constexpr` changed in C++17, but gcc does not seem to follow it - Do not use `std::apply` on CUDA but rely on the built-in variant, as it results in test failures when CUDA runtime picks host rather than device function when `std::apply` is invoked from CUDA code. - `std::decay_t` -> `::std::decay_t` and `std::move`->`::std::move` as VC++ for some reason claims that `std` symbol is ambigious - Disable use of `std::aligned_alloc` on Android, as its `libc++` does not implement it. Some prerequisites: - #89297 - #89605 - #90228 - #90389 - #90379 - #89570 - pytorch/gloo#336 - pytorch/gloo#343 - pytorch/builder@919676f Fixes #56055 Pull Request resolved: #85969 Approved by: https://github.com/ezyang, https://github.com/kulinseth

With CUDA-10.2 gone we can finally do it! This PR mostly contains build system related changes, invasive functional ones are to be followed. Among many expected tweaks to the build system, here are few unexpected ones: - Force onnx_proto project to be updated to C++17 to avoid `duplicate symbols` error when compiled by gcc-7.5.0, as storage rule for `constexpr` changed in C++17, but gcc does not seem to follow it - Do not use `std::apply` on CUDA but rely on the built-in variant, as it results in test failures when CUDA runtime picks host rather than device function when `std::apply` is invoked from CUDA code. - `std::decay_t` -> `::std::decay_t` and `std::move`->`::std::move` as VC++ for some reason claims that `std` symbol is ambigious - Disable use of `std::aligned_alloc` on Android, as its `libc++` does not implement it. Some prerequisites: - pytorch#89297 - pytorch#89605 - pytorch#90228 - pytorch#90389 - pytorch#90379 - pytorch#89570 - pytorch/gloo#336 - pytorch/gloo#343 - pytorch/builder@919676f Fixes pytorch#56055 Pull Request resolved: pytorch#85969 Approved by: https://github.com/ezyang, https://github.com/kulinseth

Add option to build with CUDAToolkit and enable_language(CUDA)

f8ab7a8

facebook-github-bot added the CLA Signed label Aug 15, 2022

malfet approved these changes Aug 15, 2022

View reviewed changes

Fix building with FindCUDA.cmake

ab376af

peterbell10 force-pushed the cudatoolkit branch from 90de735 to 5bc9db0 Compare August 15, 2022 19:50

Add cuda11.7 GLOO_USE_CUDA_TOOLKIT=ON CI job

f355214

peterbell10 force-pushed the cudatoolkit branch from 5bc9db0 to f355214 Compare August 15, 2022 19:54

peterbell10 added 2 commits August 15, 2022 21:19

Update gloo_known_gpu_archs for newer CUDA versions

09b5c8a

Install correct openssl version

5f69b55

peterbell10 force-pushed the cudatoolkit branch from 0374dca to 5f69b55 Compare August 15, 2022 20:44

Merge branch 'main' into cudatoolkit

34267e0

facebook-github-bot closed this in e6d509b Sep 20, 2022

malfet mentioned this pull request Dec 8, 2022

Migrate PyTorch to C++17 pytorch/pytorch#85969

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add option to build with CUDAToolkit and enable_language(CUDA) #336

Add option to build with CUDAToolkit and enable_language(CUDA) #336

Uh oh!

peterbell10 commented Aug 15, 2022

Uh oh!

malfet left a comment

Uh oh!

facebook-github-bot commented Aug 15, 2022

Uh oh!

peterbell10 commented Aug 15, 2022

Uh oh!

peterbell10 commented Aug 15, 2022

Uh oh!

peterbell10 commented Aug 15, 2022

Uh oh!

facebook-github-bot commented Sep 12, 2022

Uh oh!

malfet commented Sep 20, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add option to build with CUDAToolkit and enable_language(CUDA) #336

Add option to build with CUDAToolkit and enable_language(CUDA) #336

Uh oh!

Conversation

peterbell10 commented Aug 15, 2022

Uh oh!

malfet left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Aug 15, 2022

Uh oh!

peterbell10 commented Aug 15, 2022

Uh oh!

peterbell10 commented Aug 15, 2022

Uh oh!

peterbell10 commented Aug 15, 2022

Uh oh!

facebook-github-bot commented Sep 12, 2022

Uh oh!

malfet commented Sep 20, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants