Skip to content
This repository was archived by the owner on Aug 15, 2025. It is now read-only.

Conversation

amathews-amd
Copy link
Contributor

Fixes pytorch/pytorch#55552
Main PyTorch PR : pytorch/pytorch#56448

@amathews-amd
Copy link
Contributor Author

@jithunnair-amd @pruthvistony @jeffdaily @sunway513 - please review

@amathews-amd amathews-amd marked this pull request as draft April 21, 2021 19:11
Copy link
Contributor

@jeffdaily jeffdaily left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can confirm that the ./manywheel/deploy.sh is able to successfully build the pytorch/manylinux-rocm:4.1 image with these changes.

@amathews-amd amathews-amd marked this pull request as ready for review April 22, 2021 18:06
facebook-github-bot pushed a commit to pytorch/pytorch that referenced this pull request Apr 23, 2021
Summary:
Fixes #55552

* Root-caused issue to MAGMA kernels
* Issue is fixed on master of MAGMA
MAGMA issue: https://bitbucket.org/icl/magma/issues/43/zgetrf_batched-shfl-kernel-failure-seen-on
* Changing PyTorch to use particular commit sha from master of MAGMA project
* ~~Reactivating skipped ROCm tests~~ : We will reactivate tests in a different PR

Corresponding PyTorch builder PR: pytorch/builder#695

Pull Request resolved: #56448

Reviewed By: seemethere

Differential Revision: D27974563

Pulled By: janeyx99

fbshipit-source-id: 25e6f95a20a06d27a5199a623dd7c5db7ca8d6ea
@jithunnair-amd
Copy link
Contributor

@seemethere We have the exact same changes in PR pytorch/pytorch#56448 merged upstream. This PR is to make sure the wheels also have the correct version of Magma built. The only difference from the upstream PR change is the -lhipblas -lhipsparse, which is needed here because we reset the LIB variable in https://github.com/pytorch/builder/pull/695/files#diff-96c476b6de698c152d602bebd47741db50e37bea51d3ce19e61f7d89c37dd371R38

Please merge this PR if it looks okay.

@seemethere seemethere merged commit 0b73152 into pytorch:master Apr 27, 2021
krshrimali pushed a commit to krshrimali/pytorch that referenced this pull request May 19, 2021
Summary:
Fixes pytorch#55552

* Root-caused issue to MAGMA kernels
* Issue is fixed on master of MAGMA
MAGMA issue: https://bitbucket.org/icl/magma/issues/43/zgetrf_batched-shfl-kernel-failure-seen-on
* Changing PyTorch to use particular commit sha from master of MAGMA project
* ~~Reactivating skipped ROCm tests~~ : We will reactivate tests in a different PR

Corresponding PyTorch builder PR: pytorch/builder#695

Pull Request resolved: pytorch#56448

Reviewed By: seemethere

Differential Revision: D27974563

Pulled By: janeyx99

fbshipit-source-id: 25e6f95a20a06d27a5199a623dd7c5db7ca8d6ea
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

ROCM: After enabling Magma, 4 tests are failing in ROCM CI

5 participants