Skip to content

Conversation

visualxu
Copy link

@visualxu visualxu commented Apr 20, 2022

Fixes #76095, memory cross-border access on the ROCM platform.

@pytorch-bot pytorch-bot bot added the module: rocm AMD GPU support for Pytorch label Apr 20, 2022
@facebook-github-bot
Copy link
Contributor

Hi @visualxu!

Thank you for your pull request and welcome to our community.

Action Required

In order to merge any pull request (code, docs, etc.), we require contributors to sign our Contributor License Agreement, and we don't seem to have one on file for you.

Process

In order for us to review and merge your suggested changes, please sign at https://code.facebook.com/cla. If you are contributing on behalf of someone else (eg your employer), the individual CLA may not be sufficient and your employer may need to sign the corporate CLA.

Once the CLA is signed, our tooling will perform checks and validations. Afterwards, the pull request will be tagged with CLA signed. The tagging process may take up to 1 hour after signing. Please give it that time before contacting us about it.

If you have received this in error or have any questions, please contact us at cla@fb.com. Thanks!

@facebook-github-bot
Copy link
Contributor

facebook-github-bot commented Apr 20, 2022

🔗 Helpful links

✅ No Failures (0 Pending)

As of commit 5f36deb (more details on the Dr. CI page):

Expand to see more

💚 💚 Looks good so far! There are no failures yet. 💚 💚


This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

@facebook-github-bot
Copy link
Contributor

Thank you for signing our Contributor License Agreement. We can now accept your code for this (and any) Meta Open Source project. Thanks!

@facebook-github-bot
Copy link
Contributor

Thank you for signing our Contributor License Agreement. We can now accept your code for this (and any) Meta Open Source project. Thanks!

Copy link
Collaborator

@jeffdaily jeffdaily left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. CI passing, as well.

@kit1980
Copy link
Contributor

kit1980 commented Jun 17, 2022

@pytorchbot rebase

@kit1980
Copy link
Contributor

kit1980 commented Jun 17, 2022

@visualxu @jeffdaily is this something we still want to merge?

@pytorchmergebot
Copy link
Collaborator

@pytorchbot successfully started a rebase job. Check the current status here

@pytorchmergebot
Copy link
Collaborator

Successfully rebased rocm_iluvatar onto refs/remotes/origin/master, please pull locally before adding more changes (for example, via git checkout rocm_iluvatar && git pull --rebase)

@jeffdaily jeffdaily added the ciflow/trunk Trigger trunk jobs on your pull request label Jun 20, 2022
@jeffdaily
Copy link
Collaborator

@visualxu @jeffdaily is this something we still want to merge?

Yes. The explanation and fix seem reasonable to me. It wasn't clear to me how this issue was discovered, but shouldn't we fix things that are proven to be broken, even if they are perhaps rare cases?

CI was passing previously. For sanity, I kicked off rocm CI again (because rocm jobs moved to trunk-only, needed manual start for this PR).

@jeffdaily
Copy link
Collaborator

CI has passed for ROCm. 2 failures for ios-12-5-1 are not related.

@jeffdaily
Copy link
Collaborator

@pytorchbot rebase

@pytorch-bot
Copy link

pytorch-bot bot commented Jul 25, 2022

You don't have permissions to rebase this PR, only the PR author and pytorch organization members may rebase this PR.

@jeffdaily
Copy link
Collaborator

After the latest rebase can we get this merged?

@pytorchmergebot
Copy link
Collaborator

@pytorchbot successfully started a merge job. Check the current status here

@pytorchmergebot
Copy link
Collaborator

Merge failed due to This PR is too stale; the last push date was more than 3 days ago. Please rebase and try again.
Raised by https://github.com/pytorch/pytorch/actions/runs/2754541563

@visualxu
Copy link
Author

@pytorchbot merge

@pytorchmergebot
Copy link
Collaborator

@pytorchbot successfully started a merge job. Check the current status here

@pytorchmergebot
Copy link
Collaborator

Merge failed due to This PR is too stale; the last push date was more than 3 days ago. Please rebase and try again.
Raised by https://github.com/pytorch/pytorch/actions/runs/2754570892

@visualxu
Copy link
Author

@pytorchbot rebase

@pytorchmergebot
Copy link
Collaborator

@pytorchbot successfully started a rebase job. Check the current status here

@pytorchmergebot
Copy link
Collaborator

Successfully rebased rocm_iluvatar onto refs/remotes/origin/master, please pull locally before adding more changes (for example, via git checkout rocm_iluvatar && git pull --rebase)

@visualxu
Copy link
Author

@pytorchbot merge

@pytorchmergebot
Copy link
Collaborator

@pytorchbot successfully started a merge job. Check the current status here

@pytorchmergebot
Copy link
Collaborator

Merge failed due to Matched rule superuser, but PR #76100 was not reviewed yet by any of: suo, Nitrokitty, hanton, bochko, alexanderguzhva, ...
Raised by https://github.com/pytorch/pytorch/actions/runs/2754624499

@jithunnair-amd
Copy link
Collaborator

@kit1980 We need someone from PyTorch team to approve this PR

@jeffdaily
Copy link
Collaborator

@pytorchbot merge

@pytorchmergebot
Copy link
Collaborator

@pytorchbot successfully started a merge job. Check the current status here

@pytorchmergebot
Copy link
Collaborator

Merge failed due to This PR is too stale; the last push date was more than 3 days ago. Please rebase and try again.
Raised by https://github.com/pytorch/pytorch/actions/runs/2778534216

@jeffdaily
Copy link
Collaborator

@pytorchbot rebase

@pytorch-bot
Copy link

pytorch-bot bot commented Aug 1, 2022

You don't have permissions to rebase this PR, only the PR author and pytorch organization members may rebase this PR.

@jeffdaily
Copy link
Collaborator

@kit1980 thanks for the approval, can you also trigger a rebase? I only have permission to merge.

@kit1980
Copy link
Contributor

kit1980 commented Aug 1, 2022

@pytorchbot rebase

@pytorchmergebot
Copy link
Collaborator

@pytorchbot successfully started a rebase job. Check the current status here

@pytorchmergebot
Copy link
Collaborator

Successfully rebased rocm_iluvatar onto refs/remotes/origin/master, please pull locally before adding more changes (for example, via git checkout rocm_iluvatar && git pull --rebase)

@jeffdaily
Copy link
Collaborator

@pytorchbot merge

@pytorchmergebot
Copy link
Collaborator

@pytorchbot successfully started a merge job. Check the current status here

@github-actions
Copy link
Contributor

github-actions bot commented Aug 2, 2022

Hey @visualxu.
You've committed this PR, but it does not have both a 'release notes: ...' and 'topics: ...' label. Please add one of each to the PR. The 'release notes: ...' label should represent the part of PyTorch that this PR changes (fx, autograd, distributed, etc) and the 'topics: ...' label should represent the kind of PR it is (not user facing, new feature, bug fix, perf improvement, etc). The list of valid labels can be found here for the 'release notes: ...' and here for the 'topics: ...'.
For changes that are 'topic: not user facing' there is no need for a release notes label.

facebook-github-bot pushed a commit that referenced this pull request Aug 3, 2022
Summary:
Fixes #76095, memory cross-border access on the ROCM platform.

Pull Request resolved: #76100
Approved by: https://github.com/jeffdaily, https://github.com/kit1980

Test Plan: contbuild & OSS CI, see https://hud.pytorch.org/commit/pytorch/pytorch/62de55ed6b58275f9e7899aab89910818254e0b6

Reviewed By: kit1980

Differential Revision: D38359417

fbshipit-source-id: 134f261e18cb766ad1e5cb6edf175ba3c6731c50
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ciflow/trunk Trigger trunk jobs on your pull request cla signed Merged module: rocm AMD GPU support for Pytorch open source
Projects
None yet
Development

Successfully merging this pull request may close these issues.

memory cross-border access on the ROCM platform
7 participants