CI: Move to self-hosted Windows GPU runners #958

cryos · 2025-09-10T14:25:17Z

Description

Migrate the Windows testing to use the new NV GHA runners.

Checklist

I am familiar with the Contributing Guidelines.
New or existing tests cover these changes.
The documentation is up to date with these changes.

copy-pr-bot · 2025-09-10T14:25:21Z

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

cryos · 2025-09-10T14:25:36Z

/ok to test

copy-pr-bot · 2025-09-10T15:38:55Z

Auto-sync is disabled for ready for review pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

leofang · 2025-09-10T15:47:10Z

All green! Merge?

leofang · 2025-09-10T15:48:17Z

btw we should backport this, but #955 needs to land first, after that I hope the backport bot should just work? (not sure)

github-actions · 2025-09-10T16:16:38Z

Doc Preview CI
Preview removed because the pull request was closed or merged.

Migrate the Windows testing to use the new NV GHA runners. Cherry-pick NVIDIA#958.

* bump all CI jobs to CUDA 12.9.1 * CI: Consolidate test matrix configurations into ci/test-matrix.json with hard-coded values, optimized checkout, and prepared Windows self-hosted runner migration (#889) * Initial plan * Consolidate test matrices from workflows into ci/test-matrix.json Co-authored-by: leofang <5534781+leofang@users.noreply.github.com> * Hard-code all GPU and ARCH values in test-matrix.json with 6 fields per entry Co-authored-by: leofang <5534781+leofang@users.noreply.github.com> * Update Windows test matrix with a100 GPU and latest-1 driver, configure self-hosted runners Co-authored-by: leofang <5534781+leofang@users.noreply.github.com> * fix * Revert eed0b71 and change Windows DRIVER from latest-1 to latest Co-authored-by: leofang <5534781+leofang@users.noreply.github.com> * Add proxy cache setup to Windows workflow for self-hosted runners Co-authored-by: leofang <5534781+leofang@users.noreply.github.com> * Remove Git for Windows and gh CLI installation steps, add T4 GPU support to Windows matrix Co-authored-by: leofang <5534781+leofang@users.noreply.github.com> * Set fetch-depth: 1 for checkout steps and favor L4/T4 over A100 GPUs for Windows testing Co-authored-by: leofang <5534781+leofang@users.noreply.github.com> * Revert Windows workflow to GitHub-hosted runners with TODO comments for future self-hosted migration Co-authored-by: leofang <5534781+leofang@users.noreply.github.com> * [pre-commit.ci] auto code formatting * Revert Win runner name change for now --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: leofang <5534781+leofang@users.noreply.github.com> Co-authored-by: Leo Fang <leof@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * forgot to add windows * rerun codegen with 12.9.1 and update result/error explanations * First stab at the filter for CUDA < 13 in CI * Get data from the top-level array * Use the map function on select output * CI: Move to self-hosted Windows GPU runners Migrate the Windows testing to use the new NV GHA runners. Cherry-pick #958. --------- Co-authored-by: Copilot <198982749+Copilot@users.noreply.github.com> Co-authored-by: leofang <5534781+leofang@users.noreply.github.com> Co-authored-by: Leo Fang <leof@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Marcus D. Hanwell <mhanwell@nvidia.com>

leofang · 2025-09-11T14:28:46Z

btw we should backport this, but #955 needs to land first, after that I hope the backport bot should just work? (not sure)

Backported in #995.

CI: Move to self-hosted Windows GPU runners

e4be2b9

This comment has been minimized.

Sign in to view

leofang assigned cryos Sep 10, 2025

leofang added enhancement Any code-related improvements P0 High priority - Must do! CI/CD CI/CD infrastructure labels Sep 10, 2025

leofang modified the milestones: cuda-python parking lot, cuda.core beta 7 Sep 10, 2025

leofang approved these changes Sep 10, 2025

View reviewed changes

cryos marked this pull request as ready for review September 10, 2025 15:38

cryos merged commit 978154c into NVIDIA:main Sep 10, 2025
52 checks passed

cryos added a commit to kkraus14/cuda-python that referenced this pull request Sep 10, 2025

CI: Move to self-hosted Windows GPU runners

90b37d5

Migrate the Windows testing to use the new NV GHA runners. Cherry-pick NVIDIA#958.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

CI: Move to self-hosted Windows GPU runners #958

CI: Move to self-hosted Windows GPU runners #958

Uh oh!

cryos commented Sep 10, 2025 •

edited

Loading

Uh oh!

copy-pr-bot bot commented Sep 10, 2025

Uh oh!

cryos commented Sep 10, 2025

Uh oh!

This comment has been minimized.

copy-pr-bot bot commented Sep 10, 2025

Uh oh!

leofang commented Sep 10, 2025

Uh oh!

leofang commented Sep 10, 2025

Uh oh!

Uh oh!

github-actions bot commented Sep 10, 2025

Uh oh!

leofang commented Sep 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

CI: Move to self-hosted Windows GPU runners #958

CI: Move to self-hosted Windows GPU runners #958

Uh oh!

Conversation

cryos commented Sep 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklist

Uh oh!

copy-pr-bot bot commented Sep 10, 2025

Uh oh!

cryos commented Sep 10, 2025

Uh oh!

This comment has been minimized.

copy-pr-bot bot commented Sep 10, 2025

Uh oh!

leofang commented Sep 10, 2025

Uh oh!

leofang commented Sep 10, 2025

Uh oh!

Uh oh!

github-actions bot commented Sep 10, 2025

Uh oh!

leofang commented Sep 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

cryos commented Sep 10, 2025 •

edited

Loading