Skip to content

Conversation

@huydhn
Copy link
Contributor

@huydhn huydhn commented Mar 21, 2023

Per title, I suspect that having a leftover PyTorch built from CUDA 11.7 installed in non-ephemeral Windows runners could cause some flakiness on Windows CUDA 11.8 jobs also running on the same type of runners, for example win-vs2019-cuda11.8-py3 in https://hud.pytorch.org/pytorch/pytorch/commit/5d3c347bf6f0b86c96a1fe541db5d4f9586c8840 failed with a PATH error:

nvrtc: error: failed to open nvrtc-builtins64_117.dll.
Make sure that nvrtc-builtins64_117.dll is installed correctly.

This also cleans up the dead code about pytorch_env_restore.bat under ci_scripts temp directory. This directory is cleaned up always by teardown-win. So the bat script will never be there for the next job anyway. As Windows test jobs are doing fine, proving that we don't need this adhoc script anymore.

Testing

https://github.com/pytorch/pytorch/actions/runs/4485931686/jobs/7888513795

@huydhn huydhn added ciflow/trunk Trigger trunk jobs on your pull request test-config/default labels Mar 21, 2023
@pytorch-bot pytorch-bot bot added the release notes: releng release notes category label Mar 21, 2023
@pytorch-bot
Copy link

pytorch-bot bot commented Mar 21, 2023

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/97285

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 67a287d:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@huydhn huydhn changed the title Correctly reset CI env on non-ephemeral Windows runners Uninstall PyTorch after testing on non-ephemeral Windows runners Mar 22, 2023
@huydhn huydhn requested review from clee2000 and seemethere March 22, 2023 16:31
@huydhn huydhn marked this pull request as ready for review March 22, 2023 16:31
@huydhn huydhn requested a review from a team as a code owner March 22, 2023 16:31
@huydhn
Copy link
Contributor Author

huydhn commented Mar 23, 2023

@pytorchbot merge

@pytorchmergebot
Copy link
Collaborator

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

cyyever pushed a commit to cyyever/pytorch_private that referenced this pull request Mar 23, 2023
…285)

Per title, I suspect that having a leftover PyTorch built from CUDA 11.7 installed in non-ephemeral Windows runners could cause some flakiness on Windows CUDA 11.8 jobs also running on the same type of runners, for example `win-vs2019-cuda11.8-py3` in https://hud.pytorch.org/pytorch/pytorch/commit/5d3c347bf6f0b86c96a1fe541db5d4f9586c8840 failed with a PATH error:

```
nvrtc: error: failed to open nvrtc-builtins64_117.dll.
Make sure that nvrtc-builtins64_117.dll is installed correctly.
```

This also cleans up the dead code about `pytorch_env_restore.bat` under `ci_scripts` temp directory.  This directory is cleaned up always by [teardown-win](https://github.com/pytorch/pytorch/blob/master/.github/actions/teardown-win/action.yml#L33).  So the bat script will never be there for the next job anyway.  As Windows test jobs are doing fine, proving that we don't need this adhoc script anymore.

### Testing
https://github.com/pytorch/pytorch/actions/runs/4485931686/jobs/7888513795
Pull Request resolved: pytorch/pytorch#97285
Approved by: https://github.com/seemethere
cyyever pushed a commit to cyyever/pytorch_private that referenced this pull request Mar 27, 2023
…285)

Per title, I suspect that having a leftover PyTorch built from CUDA 11.7 installed in non-ephemeral Windows runners could cause some flakiness on Windows CUDA 11.8 jobs also running on the same type of runners, for example `win-vs2019-cuda11.8-py3` in https://hud.pytorch.org/pytorch/pytorch/commit/5d3c347bf6f0b86c96a1fe541db5d4f9586c8840 failed with a PATH error:

```
nvrtc: error: failed to open nvrtc-builtins64_117.dll.
Make sure that nvrtc-builtins64_117.dll is installed correctly.
```

This also cleans up the dead code about `pytorch_env_restore.bat` under `ci_scripts` temp directory.  This directory is cleaned up always by [teardown-win](https://github.com/pytorch/pytorch/blob/master/.github/actions/teardown-win/action.yml#L33).  So the bat script will never be there for the next job anyway.  As Windows test jobs are doing fine, proving that we don't need this adhoc script anymore.

### Testing
https://github.com/pytorch/pytorch/actions/runs/4485931686/jobs/7888513795
Pull Request resolved: pytorch/pytorch#97285
Approved by: https://github.com/seemethere
@huydhn huydhn deleted the windows-118-cleanup branch April 11, 2023 00:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ciflow/trunk Trigger trunk jobs on your pull request Merged release notes: releng release notes category test-config/default

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants