Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fixed hang in test_hipcub_grid on gfx1030 #183

Merged
merged 1 commit into from
Feb 8, 2022

Conversation

mfep
Copy link
Contributor

@mfep mfep commented Feb 1, 2022

  • grid_barrier uses thread_load for loading values from global memory.
  • thread_load was previously implemented with two inline assembly statements. This gave opportunity to the compiler to inject instructions between the two, in our case it injected the comparison before the s_waitcnt.
  • Changed it to a single assembly statement, thereby no instruction can be inserted between the load and the wait.

@mfep mfep force-pushed the fix-grid-barrier-hang-gfx1030 branch from 4472b87 to 553855d Compare February 1, 2022 16:26
@stanleytsang-amd stanleytsang-amd merged commit 15398a6 into develop Feb 8, 2022
stanleytsang-amd added a commit that referenced this pull request Mar 4, 2022
Co-authored-by: Lőrinc Serfőző <mfep@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants