-
Notifications
You must be signed in to change notification settings - Fork 21.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[ROCm] enable nvfuser #82498
[ROCm] enable nvfuser #82498
Conversation
🔗 Helpful links
✅ No Failures (0 Pending)As of commit 6bbe7a7 (more details on the Dr. CI page): Expand to see more💚 💚 Looks good so far! There are no failures yet. 💚 💚 This comment was automatically generated by Dr. CI (expand for details).Please report bugs/suggestions to the (internal) Dr. CI Users group. |
@davidberard98 added the ciflow/periodic label since this is where we run the slow tests for ROCm. It revealed an unexpected success.
Should be fixed by 6bbe7a7. Unskipped the few NVFuser tests you found in the logs. See 47da32a. They passed. So now just waiting on another round of CI. |
@davidberard98 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
@davidberard98 I hope the phabricator build is going well. All CI is green, including rocm slow. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks mostly just mechanical changes. went through most files except the rocm specific runtime headers.
A nitpick on the cmake file to guard rocm header under USE_ROCM
. not sure if there's similar thing we can do with build_variables.bzl
@jeffdaily looks good, let me know if you want to wait on anything before landing - otherwise I'll land later today (or feel free to land it yourself). |
@jjsjann123 are there more instances of nv-specific asm you guys are using? What about bfloat16/fp16 conversions that also use inline asm? |
@pytorchbot merge |
@pytorchbot successfully started a merge job. Check the current status here. |
Hey @jeffdaily. |
Addresses comment in #82498 as a follow-up PR. #82498 (comment) Pull Request resolved: #84312 Approved by: https://github.com/jjsjann123
Summary: ### Description The nvfuser is enabled for ROCm. ### Testing CI label ciflow/trunk covers the newly enabled ROCm functionality as well as any CUDA regressions caused by these changes. Pull Request resolved: #82498 Approved by: https://github.com/jjsjann123, https://github.com/davidberard98 Test Plan: contbuild & OSS CI, see https://hud.pytorch.org/commit/pytorch/pytorch/d09486ab233284e9f298e45a43977fed8f075fe4 Original Phabricator Test Plan: Imported from GitHub, without a `Test Plan:` line. Reviewed By: mehtanirav Differential Revision: D39073112 Pulled By: mehtanirav fbshipit-source-id: c12e759b77589c7191ed2fe4cc16bbcdc1ae86bd
Summary: Addresses comment in #82498 as a follow-up PR. #82498 (comment) Pull Request resolved: #84312 Approved by: https://github.com/jjsjann123 Test Plan: contbuild & OSS CI, see https://hud.pytorch.org/commit/pytorch/pytorch/6efadf7e7e6655b543b5a9819b6e2eac2d76f09c Reviewed By: mehtanirav Differential Revision: D39213490 fbshipit-source-id: c6158d4880c0614289c23f5af4823773a9545dd1
### Description The nvfuser is enabled for ROCm. ### Testing CI label ciflow/trunk covers the newly enabled ROCm functionality as well as any CUDA regressions caused by these changes. Pull Request resolved: pytorch/pytorch#82498 Approved by: https://github.com/jjsjann123, https://github.com/davidberard98
Addresses comment in #82498 as a follow-up PR. pytorch/pytorch#82498 (comment) Pull Request resolved: pytorch/pytorch#84312 Approved by: https://github.com/jjsjann123
### Description The nvfuser is enabled for ROCm. ### Testing CI label ciflow/trunk covers the newly enabled ROCm functionality as well as any CUDA regressions caused by these changes. Pull Request resolved: pytorch/pytorch#82498 Approved by: https://github.com/jjsjann123, https://github.com/davidberard98
Addresses comment in #82498 as a follow-up PR. pytorch/pytorch#82498 (comment) Pull Request resolved: pytorch/pytorch#84312 Approved by: https://github.com/jjsjann123
In preparation for #89621. Partial reverts of #82498 and #86369. Pull Request resolved: #92182 Approved by: https://github.com/davidberard98
Description
The nvfuser is enabled for ROCm.
Testing
CI label ciflow/trunk covers the newly enabled ROCm functionality as well as any CUDA regressions caused by these changes.