Apply the same fix to cleanup process on Windows CPU build job #101460

huydhn · 2023-05-15T22:13:19Z

This goes together with pytorch/test-infra#4169. To be replace by the main branch once pytorch/test-infra#4169 merges

pytorch-bot · 2023-05-15T22:13:22Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/101460

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 6ccf7c4:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

.github/workflows/_win-test.yml

huydhn · 2023-05-15T23:16:58Z

@pytorchbot merge -f 'Windows CI build job has passed. Merge to fix trunk'

pytorchmergebot · 2023-05-15T23:18:59Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

This goes together with pytorch/test-infra#4169. To be replace by the main branch once pytorch/test-infra#4169 merges Pull Request resolved: #101460 Approved by: https://github.com/clee2000, https://github.com/PaliC

Windows flakiness strikes again. There is a new flaky issue start appearing on HUD in which tearing down Windows workspace fails with `Device or resource busy` error when trying to `rm -rf ./*` the workspace, for example https://github.com/pytorch/pytorch/actions/runs/5051845102/jobs/9064107717. It happens on both build and test jobs. I have looked into all commits since last weekend but there is nothing standing out or Windows-related. The error means that a process still hold the directory, but it's unclear which one as all CI processes should have been stopped by then (#101460) with the only exception of the runner daemon itself. On the other hand, the issue is flaky as the next job running on the same failed runner can clean up the workspace fine when checking out PyTorch (https://github.com/pytorch/pytorch/blob/main/.github/actions/checkout-pytorch/action.yml#L21-L35). For example, `i-0ec1767a38ec93b4e` failed at https://github.com/pytorch/pytorch/actions/runs/5051845102/jobs/9064107717 and its immediate next job succeeded https://github.com/pytorch/pytorch/actions/runs/5052147504/jobs/9064717085. So, I think that adding retrying should help mitigate this. Related to pytorch/test-infra#4206 (not the same root cause, I figured out pytorch/test-infra#4206 while working on this PR) Pull Request resolved: #102051 Approved by: https://github.com/kit1980

Apply the same fix to cleanup process on Windows CPU build job

11d61a0

huydhn added ciflow/trunk Trigger trunk jobs on your pull request test-config/default labels May 15, 2023

huydhn requested a review from clee2000 May 15, 2023 22:13

huydhn requested a review from a team as a code owner May 15, 2023 22:13

pytorch-bot bot added the topic: not user facing topic category label May 15, 2023

clee2000 approved these changes May 15, 2023

View reviewed changes

.github/workflows/_win-test.yml Outdated Show resolved Hide resolved

PaliC approved these changes May 15, 2023

View reviewed changes

huydhn mentioned this pull request May 15, 2023

Add a new shareable GH action to clean up non-ephemeral runners pytorch/test-infra#4169

Merged

Also update teardown-win

6ccf7c4

pytorchmergebot added the merging label May 15, 2023

pytorchmergebot added Merged and removed merging labels May 15, 2023

pytorchmergebot closed this in 3920ec1 May 15, 2023

PaliC added the ciflow/slow label May 16, 2023

huydhn mentioned this pull request May 23, 2023

Add retry when cleaning up Windows workspace #102051

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Apply the same fix to cleanup process on Windows CPU build job #101460

Apply the same fix to cleanup process on Windows CPU build job #101460

Uh oh!

huydhn commented May 15, 2023

Uh oh!

pytorch-bot bot commented May 15, 2023 •

edited

Loading

Uh oh!

Uh oh!

huydhn commented May 15, 2023

Uh oh!

pytorchmergebot commented May 15, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Apply the same fix to cleanup process on Windows CPU build job #101460

Apply the same fix to cleanup process on Windows CPU build job #101460

Uh oh!

Conversation

huydhn commented May 15, 2023

Uh oh!

pytorch-bot bot commented May 15, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/101460

✅ No Failures

Uh oh!

Uh oh!

huydhn commented May 15, 2023

Uh oh!

pytorchmergebot commented May 15, 2023

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

pytorch-bot bot commented May 15, 2023 •

edited

Loading