Skip to content
This repository has been archived by the owner on Mar 21, 2024. It is now read-only.

Fix reduce by key tile state for Pascal #715

Conversation

gevtushenko
Copy link
Collaborator

Last week's PR introduced a regression for direct users of ReduceByKeyScanTileState on pre-Volta architectures. Delay should contain thread fence in pre-Volta cases in order to prevent hoisting.

gevtushenko added a commit to gevtushenko/thrust that referenced this pull request Jun 13, 2023
@gevtushenko gevtushenko added the testing: gpuCI in progress Started gpuCI testing. label Jun 13, 2023
@gevtushenko gevtushenko added testing: gpuCI passed Passed gpuCI testing. and removed testing: gpuCI in progress Started gpuCI testing. labels Jun 13, 2023
@gevtushenko gevtushenko merged commit f76fbda into NVIDIA:main Jun 13, 2023
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
testing: gpuCI passed Passed gpuCI testing.
Projects
Archived in project
Development

Successfully merging this pull request may close these issues.

3 participants