Skip to content

Conversation

atalman
Copy link
Contributor

@atalman atalman commented Sep 8, 2023

Pinning docker images, trying to address SEV : #108862

@atalman atalman requested a review from a team as a code owner September 8, 2023 15:31
@pytorch-bot
Copy link

pytorch-bot bot commented Sep 8, 2023

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/108871

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

❌ 1 New Failure, 7 Unrelated Failures

As of commit 88c8006 with merge base cdf7f3e (image):

NEW FAILURE - The following job has failed:

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

UNSTABLE - The following jobs failed but were likely due to flakiness present on trunk and has been marked as unstable:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@pytorch-bot pytorch-bot bot added the topic: not user facing topic category label Sep 8, 2023
@atalman atalman changed the title A100 CI Sev - pin docker images CI Sev - pin docker images for A100 workers Sep 8, 2023
Copy link
Contributor

@huydhn huydhn left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This looks good, but you probably want to cancel the nightly job and force merge it this case because it takes like 8 hours or something to finish that one.

@ezyang
Copy link
Contributor

ezyang commented Sep 10, 2023

It seems like... it didn't work?

@atalman
Copy link
Contributor Author

atalman commented Sep 10, 2023

It seems like... it didn't work?

I think I found the problem why it did not worked. Manual remediation step is required. Worth giving it a shot. Will merge this now, and run manual remediation.

@atalman
Copy link
Contributor Author

atalman commented Sep 10, 2023

@pytorchbot merge -f "working on CI sev"

@pytorchmergebot
Copy link
Collaborator

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

atalman added a commit to atalman/pytorch that referenced this pull request Sep 11, 2023
pytorchmergebot pushed a commit that referenced this pull request Sep 12, 2023
This reverts commit 89eb7a7.

Not required anymore since issue addressed by pytorch/test-infra#4563
But deploying normally. Want to get proper green signal for deployment

Pull Request resolved: #109071
Approved by: https://github.com/huydhn
michiboo pushed a commit to michiboo/pytorch that referenced this pull request Sep 17, 2023
Pinning docker images, trying to address SEV : pytorch#108862
Pull Request resolved: pytorch#108871
Approved by: https://github.com/huydhn
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants