Skip to content

Conversation

[ghstack-poisoned]
Copy link

pytorch-bot bot commented Sep 20, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/163422

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 Cancelled Job

As of commit ca28793 with merge base 51152ef (image):

CANCELLED JOB - The following job was cancelled. Please retry:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

budget = torch.cuda.max_memory_allocated() - torch.cuda.memory_allocated()
try:
budget = torch.cuda.max_memory_allocated() - torch.cuda.memory_allocated()
except RuntimeError:
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Don't even log what the runtime error was? Dangerous...

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Look at the issue for context. I don't think spamming the user with warnings in that use case would be helpful.

[ghstack-poisoned]
[ghstack-poisoned]
[ghstack-poisoned]
[ghstack-poisoned]
@jansel jansel requested a review from eellison September 22, 2025 15:12
Copy link
Contributor

@eellison eellison left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Cudagraphs won't be useable with custom allocator, but yea, this should at least work outside of that case.

[ghstack-poisoned]
@jansel jansel added the ciflow/trunk Trigger trunk jobs on your pull request label Sep 22, 2025
@jansel
Copy link
Contributor Author

jansel commented Sep 23, 2025

@pytorchbot merge

@pytorchmergebot
Copy link
Collaborator

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

@pytorchmergebot
Copy link
Collaborator

The merge job was canceled or timed out. This most often happen if two merge requests were issued for the same PR, or if merge job was waiting for more than 6 hours for tests to finish. In later case, please do not hesitate to reissue the merge command
For more information see pytorch-bot wiki.

@jansel
Copy link
Contributor Author

jansel commented Sep 23, 2025

@pytorchbot merge -i

@pytorchmergebot
Copy link
Collaborator

Merge started

Your change will be merged while ignoring the following 1 checks: trunk / linux-jammy-rocm-py3.10 / test (distributed, 1, 1, linux.rocm.gpu.gfx942.4)

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

pytorchmergebot pushed a commit that referenced this pull request Sep 24, 2025
pytorchmergebot pushed a commit that referenced this pull request Sep 24, 2025
pytorchmergebot pushed a commit that referenced this pull request Sep 24, 2025
pytorchmergebot pushed a commit that referenced this pull request Sep 24, 2025
This reverts commit a8cd437.

See #163481 (comment)

This PR might also cause issues with cudagraphs.

Pull Request resolved: #163737
Approved by: https://github.com/ezyang
ghstack dependencies: #163386, #163398, #163387, #163414, #163415, #163419, #163434, #163393, #163412, #163422, #163481, #163520, #163482
dsashidh pushed a commit to dsashidh/pytorch that referenced this pull request Sep 26, 2025
jainapurva pushed a commit that referenced this pull request Sep 29, 2025
jainapurva pushed a commit that referenced this pull request Sep 29, 2025
jainapurva pushed a commit that referenced this pull request Sep 29, 2025
jainapurva pushed a commit that referenced this pull request Sep 29, 2025
This reverts commit a8cd437.

See #163481 (comment)

This PR might also cause issues with cudagraphs.

Pull Request resolved: #163737
Approved by: https://github.com/ezyang
ghstack dependencies: #163386, #163398, #163387, #163414, #163415, #163419, #163434, #163393, #163412, #163422, #163481, #163520, #163482
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants