Apply cuda::barrier and elect_one feedback by bernhardmgruber · Pull Request #6344 · NVIDIA/cccl

bernhardmgruber · 2025-10-27T12:40:03Z

This is a follow-up to #6329 after feedback from @ahendriksen

copy-pr-bot · 2025-10-27T12:40:07Z

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

ahendriksen · 2025-10-27T13:05:30Z

libcudacxx/include/cuda/__memcpy_async/elect_one.h

     const auto uniform_warp_id = __shfl_sync(~0, warp_id, 0); // broadcast from lane 0
     return uniform_warp_id == 0 && cuda::ptx::elect_sync(~0); // elect a leader thread among warp 0
-     ),
-    (::cuda::device::__cuda_elect_sync_is_not_supported_before_SM_90__(); _CCCL_UNREACHABLE();));


Shouldn't the

return threadIdx.x == 0;

go in the else here?

AFAIK PTX ISA is CTK 12.0. CCCL support 12.0 and up, so there is no need for the __cccl_ptx_isa ifdef.

Then I would need to add it twice, as else branch of the NV_IF_TARGET and as else branch of the _CCCL_CUDA_COMPILATION() && __cccl_ptx_isa >= 800

Regarding PTX ISA. I don't know whether the clang CUDA versions we test already support __cccl_ptx_isa >= 800

ok, let's just try

This is a follow-up to NVIDIA#6329 after feedback from ahendriksen

github-actions · 2025-10-30T19:10:16Z

🥳 CI Workflow Results

🟩 Finished in 7h 16m: Pass: 100%/134 | Total: 6d 15h | Max: 5h 10m | Hits: 51%/265638

See results here.

github-project-automation bot added this to CCCL Oct 27, 2025

github-project-automation bot moved this to Todo in CCCL Oct 27, 2025

cccl-authenticator-app bot moved this from Todo to In Progress in CCCL Oct 27, 2025

bernhardmgruber changed the title ~~Small cuda::barrier and elect_one fixes~~ Apply cuda::barrier and elect_one feedback Oct 27, 2025

bernhardmgruber marked this pull request as ready for review October 27, 2025 12:55

bernhardmgruber requested review from a team as code owners October 27, 2025 12:55

bernhardmgruber requested review from gevtushenko, gonidelis and pciolkosz October 27, 2025 12:55

cccl-authenticator-app bot moved this from In Progress to In Review in CCCL Oct 27, 2025

bernhardmgruber mentioned this pull request Oct 27, 2025

Improve cuda::barrier TMA examples and elect_one in DeviceTransform #6329

Merged

ahendriksen reviewed Oct 27, 2025

View reviewed changes

miscco approved these changes Oct 27, 2025

View reviewed changes

This comment has been minimized.

Sign in to view

bernhardmgruber added 2 commits October 30, 2025 12:50

Small cuda::barrier and elect_one fixes

d43a92e

This is a follow-up to NVIDIA#6329 after feedback from ahendriksen

Rework branches

869abbe

bernhardmgruber force-pushed the barrier_fixes branch from 973ef6c to 869abbe Compare October 30, 2025 11:50

This comment has been minimized.

Sign in to view

bernhardmgruber enabled auto-merge (squash) October 30, 2025 18:06

bernhardmgruber merged commit c8cd7bc into NVIDIA:main Oct 30, 2025
288 of 291 checks passed

github-project-automation bot moved this from In Review to Done in CCCL Oct 30, 2025

bernhardmgruber deleted the barrier_fixes branch October 30, 2025 20:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Apply cuda::barrier and elect_one feedback#6344

Apply cuda::barrier and elect_one feedback#6344
bernhardmgruber merged 2 commits intoNVIDIA:mainfrom
bernhardmgruber:barrier_fixes

bernhardmgruber commented Oct 27, 2025

Uh oh!

copy-pr-bot bot commented Oct 27, 2025

Uh oh!

ahendriksen Oct 27, 2025

Uh oh!

bernhardmgruber Oct 27, 2025

Uh oh!

bernhardmgruber Oct 27, 2025

Uh oh!

This comment has been minimized.

This comment has been minimized.

github-actions bot commented Oct 30, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

bernhardmgruber commented Oct 27, 2025

Uh oh!

copy-pr-bot bot commented Oct 27, 2025

Uh oh!

ahendriksen Oct 27, 2025

Choose a reason for hiding this comment

Uh oh!

bernhardmgruber Oct 27, 2025

Choose a reason for hiding this comment

Uh oh!

bernhardmgruber Oct 27, 2025

Choose a reason for hiding this comment

Uh oh!

This comment has been minimized.

This comment has been minimized.

github-actions bot commented Oct 30, 2025

🥳 CI Workflow Results

🟩 Finished in 7h 16m: Pass: 100%/134 | Total: 6d 15h | Max: 5h 10m | Hits: 51%/265638

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants