Add 9.0a to cpp_extension supported compute archs #110587

dzhulgakov · 2023-10-05T06:21:58Z

There's an extended compute capability 9.0a for Hopper that was introduced in Cuda 12.0: https://docs.nvidia.com/cuda/archive/12.0.0/cuda-compiler-driver-nvcc/index.html#gpu-feature-list

E.g. Cutlass leverages it: https://github.com/NVIDIA/cutlass/blob/5f13dcad781284678edafa3b8d108120cfc6a6e4/python/cutlass/emit/pytorch.py#L684

This adds it to the list of permitted architectures to use in cpp_extension directly.

pytorch-bot · 2023-10-05T06:22:01Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/110587

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (2 Unrelated Failures)

As of commit 4e6b137 with merge base c36b31d ():

UNSTABLE - The following jobs failed but were likely due to flakiness present on trunk and has been marked as unstable:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ezyang · 2023-10-05T14:49:33Z

@pytorchbot merge

pytorchmergebot · 2023-10-05T14:51:36Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

…123243) When people build pytorch extensions with cmake, and the GPU supports 9.0a arch as introduced in #110587 , the cmake regex is not updated to recognize the change, leading to cmake breaks like #113948 and #119946 . This PR should fix them. Pull Request resolved: #123243 Approved by: https://github.com/malfet

…ytorch#123243) When people build pytorch extensions with cmake, and the GPU supports 9.0a arch as introduced in pytorch#110587 , the cmake regex is not updated to recognize the change, leading to cmake breaks like pytorch#113948 and pytorch#119946 . This PR should fix them. Pull Request resolved: pytorch#123243 Approved by: https://github.com/malfet

Add 9.0a into cpp_extension support archs

4e6b137

dzhulgakov requested a review from malfet October 5, 2023 06:21

dzhulgakov requested review from fmassa, soumith and ezyang as code owners October 5, 2023 06:21

pytorchbot added the open source label Oct 5, 2023

dzhulgakov added the release notes: build release notes category label Oct 5, 2023

ezyang approved these changes Oct 5, 2023

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Oct 5, 2023

pytorchmergebot added the merging label Oct 5, 2023

Skylion007 mentioned this pull request Oct 5, 2023

Investigate SM90a gencode for H100s facebookresearch/xformers#871

Open

pytorchmergebot added Merged and removed merging labels Oct 5, 2023

pytorchmergebot closed this in a0cea51 Oct 5, 2023

youkaichao mentioned this pull request Apr 3, 2024

[CMake] fix cmake regex to match newly introduced 9.0a architecture #123243

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add 9.0a to cpp_extension supported compute archs #110587

Add 9.0a to cpp_extension supported compute archs #110587

dzhulgakov commented Oct 5, 2023

pytorch-bot bot commented Oct 5, 2023 •

edited

ezyang commented Oct 5, 2023

pytorchmergebot commented Oct 5, 2023

Add 9.0a to cpp_extension supported compute archs #110587

Add 9.0a to cpp_extension supported compute archs #110587

Conversation

dzhulgakov commented Oct 5, 2023

pytorch-bot bot commented Oct 5, 2023 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/110587

✅ You can merge normally! (2 Unrelated Failures)

ezyang commented Oct 5, 2023

pytorchmergebot commented Oct 5, 2023

Merge started

pytorch-bot bot commented Oct 5, 2023 •

edited