We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
memcpy_async
Refactor memcpy_async to take advantage of 1D TMA instructions when feasible.
The content you are editing has changed. Please copy your edits and refresh the page.
The text was updated successfully, but these errors were encountered:
cp.async.bulk
cuda::memcpy_async
Why was this marked completed? This still has open tasks.
Sorry, something went wrong.
Most likely because github still does not understand issues with more than one PR
griwes
Successfully merging a pull request may close this issue.
Refactor
memcpy_async
to take advantage of 1D TMA instructions when feasible.Tasks
memcpy_async
implementation to support more general src/dest dispatch #57The text was updated successfully, but these errors were encountered: