Swap CUDA 10.1 and CPU CI for windows #57493

janeyx99 · 2021-05-03T21:06:47Z

This change temporarily disables CUDA testing on PRs, but keeps it on master.
This is likely to increase the number of reverts, but this is necessary as a stop-gap measure to cap the CI costs growth.

facebook-github-bot · 2021-05-03T21:06:53Z

💊 CI failures summary and remediations

As of commit bf65dd8 (more details on the Dr. CI page):

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

.circleci/cimodel/data/windows_build_definitions.py

facebook-github-bot · 2021-05-03T21:42:29Z

@janeyx99 has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2021-05-03T21:44:36Z

@janeyx99 has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

ngimel · 2021-05-03T23:28:44Z

Hm, given how many cuda-related PRs and especially builds are broken on windows only, I'm afraid it will increase churn a lot.

zasdfgbnm · 2021-05-03T23:40:07Z

Does this mean no CI will be running for CUDA+Windows for PR? I feel worried about it as in the past Windows CI has been useful. (I saw Windows only failures ~twice last month)

ptrblck · 2021-05-03T23:58:02Z

This change temporarily disables CUDA testing on PRs, but keeps it on master.

If I understand it correctly, Windows CI would only be triggered after the PR was merged into the master branch?
How temporary would it be, as this could indeed increase the churn of reverting faulty PRs a lot, as already mentioned, and could additionally make fixes to PRs hard (they won't be tested before the re-merge of the "fixed PR" in CI).

seemethere · 2021-05-04T00:01:33Z

I think based on our current cost for running Windows gpu executors it's just not tenable to run Windows CUDA tests on every PR.

After this PR lands I'll do a follow up PR to add a CI label that we can use to trigger windows CUDA testing on PRs if requested.

It'll be similar to #54018

…sharding

facebook-github-bot · 2021-05-04T01:52:35Z

@janeyx99 has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2021-05-04T02:22:30Z

@janeyx99 merged this pull request in 1d3a9bf.

imaginary-person · 2021-05-07T01:28:14Z

If failures on Windows CUDA checks do not usually result from the fact that Windows CUDA CI checks use a Tesla T4 GPU (SM_75), whereas Linux CUDA CI checks use an SM_52 GPU, how about also having a (regular) Windows CI check with an SM_52 GPU, if an executor with an SM_52 GPU is significantly cheaper than an executor with an SM_75 GPU?

In that case, @seemethere can make two labels for Windows CUDA CI checks - one each for executors with an SM_52 & an SM_75 GPU respectively, so that one can choose the SM_52 one (cheaper), if the difference in CUDA Compute Capability would be deemed to not be a source of potential failures for an in-progress PR.

Summary: This change temporarily disables CUDA testing on PRs, but keeps it on master. This is likely to increase the number of reverts, but this is necessary as a stop-gap measure to cap the CI costs growth. Pull Request resolved: pytorch#57493 Reviewed By: seemethere Differential Revision: D28162697 Pulled By: janeyx99 fbshipit-source-id: 1bc529a405f7d63c07f4bd9f8ceca8da450743fc

Swap CUDA 10.1 and CPU CI for windows

f2c3e9a

facebook-github-bot added the cla signed label May 3, 2021

malfet reviewed May 3, 2021

View reviewed changes

.circleci/cimodel/data/windows_build_definitions.py Outdated Show resolved Hide resolved

malfet approved these changes May 3, 2021

View reviewed changes

seemethere approved these changes May 3, 2021

View reviewed changes

seemethere added module: ci Related to continuous integration module: windows Windows support for PyTorch labels May 3, 2021

Keep shard 1 on master only

d10b701

janeyx99 force-pushed the swap-windows-ci branch from 1b53f42 to d10b701 Compare May 3, 2021 21:44

remove quotes

499ef10

swap force_on_cpu test with normal cpu test to maintain integrity of …

bf65dd8

…sharding

facebook-github-bot closed this in 1d3a9bf May 4, 2021

facebook-github-bot added the Merged label May 4, 2021

janeyx99 mentioned this pull request May 6, 2021

Improve Windows testing on CI while keeping down costs #57782

Closed

janeyx99 mentioned this pull request Oct 12, 2021

No win cuda tests are running on PRs? #66471

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Swap CUDA 10.1 and CPU CI for windows #57493

Swap CUDA 10.1 and CPU CI for windows #57493

Uh oh!

janeyx99 commented May 3, 2021 •

edited

Loading

Uh oh!

facebook-github-bot commented May 3, 2021 •

edited

Loading

Uh oh!

Uh oh!

facebook-github-bot commented May 3, 2021

Uh oh!

facebook-github-bot commented May 3, 2021

Uh oh!

ngimel commented May 3, 2021

Uh oh!

zasdfgbnm commented May 3, 2021

Uh oh!

ptrblck commented May 3, 2021

Uh oh!

seemethere commented May 4, 2021

Uh oh!

facebook-github-bot commented May 4, 2021

Uh oh!

facebook-github-bot commented May 4, 2021

Uh oh!

imaginary-person commented May 7, 2021 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

Swap CUDA 10.1 and CPU CI for windows #57493

Swap CUDA 10.1 and CPU CI for windows #57493

Uh oh!

Conversation

janeyx99 commented May 3, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facebook-github-bot commented May 3, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CI failures summary and remediations

Uh oh!

Uh oh!

facebook-github-bot commented May 3, 2021

Uh oh!

facebook-github-bot commented May 3, 2021

Uh oh!

ngimel commented May 3, 2021

Uh oh!

zasdfgbnm commented May 3, 2021

Uh oh!

ptrblck commented May 3, 2021

Uh oh!

seemethere commented May 4, 2021

Uh oh!

facebook-github-bot commented May 4, 2021

Uh oh!

facebook-github-bot commented May 4, 2021

Uh oh!

imaginary-person commented May 7, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

janeyx99 commented May 3, 2021 •

edited

Loading

facebook-github-bot commented May 3, 2021 •

edited

Loading

imaginary-person commented May 7, 2021 •

edited

Loading