Skip to content

Conversation

@huydhn
Copy link
Contributor

@huydhn huydhn commented Nov 3, 2022

Try to reset the NVIDIA devices if they get stuck in failed mode per comment in #88388

@huydhn huydhn requested a review from malfet November 3, 2022 22:16
@huydhn huydhn self-assigned this Nov 3, 2022
@huydhn huydhn requested a review from a team as a code owner November 3, 2022 22:16
@pytorch-bot pytorch-bot bot added the topic: not user facing topic category label Nov 3, 2022
@pytorch-bot
Copy link

pytorch-bot bot commented Nov 3, 2022

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/88459

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 Failures

As of commit 8e06dc9:

The following jobs have failed:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@huydhn huydhn added the ciflow/trunk Trigger trunk jobs on your pull request label Nov 3, 2022
@huydhn
Copy link
Contributor Author

huydhn commented Nov 3, 2022

@pytorchbot merge -f 'All cuda jobs have passed NVIDIA installation step. Force merge to avoid some a recent revert today that increase TTS significantly'

@pytorchmergebot
Copy link
Collaborator

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

kulinseth pushed a commit to kulinseth/pytorch that referenced this pull request Nov 5, 2022
Try to reset the NVIDIA devices if they get stuck in failed mode per comment in pytorch#88388

Pull Request resolved: pytorch#88459
Approved by: https://github.com/malfet
@huydhn huydhn deleted the reset-nvidia-a10g branch November 7, 2022 17:28
kulinseth pushed a commit to kulinseth/pytorch that referenced this pull request Dec 10, 2022
Try to reset the NVIDIA devices if they get stuck in failed mode per comment in pytorch#88388

Pull Request resolved: pytorch#88459
Approved by: https://github.com/malfet
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ciflow/inductor ciflow/trunk Trigger trunk jobs on your pull request Merged topic: not user facing topic category

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants