Skip to content

Conversation

cryos
Copy link
Collaborator

@cryos cryos commented Sep 10, 2025

Description

Migrate the Windows testing to use the new NV GHA runners.

Checklist

  • I am familiar with the Contributing Guidelines.
  • New or existing tests cover these changes.
  • The documentation is up to date with these changes.

@copy-pr-bot
Copy link
Contributor

copy-pr-bot bot commented Sep 10, 2025

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

@cryos
Copy link
Collaborator Author

cryos commented Sep 10, 2025

/ok to test

@github-actions

This comment has been minimized.

@leofang leofang added enhancement Any code-related improvements P0 High priority - Must do! CI/CD CI/CD infrastructure labels Sep 10, 2025
@cryos cryos marked this pull request as ready for review September 10, 2025 15:38
@copy-pr-bot
Copy link
Contributor

copy-pr-bot bot commented Sep 10, 2025

Auto-sync is disabled for ready for review pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

@leofang
Copy link
Member

leofang commented Sep 10, 2025

All green! Merge?

@leofang
Copy link
Member

leofang commented Sep 10, 2025

btw we should backport this, but #955 needs to land first, after that I hope the backport bot should just work? (not sure)

@cryos cryos merged commit 978154c into NVIDIA:main Sep 10, 2025
52 checks passed
@github-actions
Copy link

Doc Preview CI
Preview removed because the pull request was closed or merged.

cryos added a commit to kkraus14/cuda-python that referenced this pull request Sep 10, 2025
Migrate the Windows testing to use the new NV GHA runners.
Cherry-pick NVIDIA#958.
leofang added a commit that referenced this pull request Sep 11, 2025
* bump all CI jobs to CUDA 12.9.1

* CI: Consolidate test matrix configurations into ci/test-matrix.json with hard-coded values, optimized checkout, and prepared Windows self-hosted runner migration (#889)

* Initial plan

* Consolidate test matrices from workflows into ci/test-matrix.json

Co-authored-by: leofang <5534781+leofang@users.noreply.github.com>

* Hard-code all GPU and ARCH values in test-matrix.json with 6 fields per entry

Co-authored-by: leofang <5534781+leofang@users.noreply.github.com>

* Update Windows test matrix with a100 GPU and latest-1 driver, configure self-hosted runners

Co-authored-by: leofang <5534781+leofang@users.noreply.github.com>

* fix

* Revert eed0b71 and change Windows DRIVER from latest-1 to latest

Co-authored-by: leofang <5534781+leofang@users.noreply.github.com>

* Add proxy cache setup to Windows workflow for self-hosted runners

Co-authored-by: leofang <5534781+leofang@users.noreply.github.com>

* Remove Git for Windows and gh CLI installation steps, add T4 GPU support to Windows matrix

Co-authored-by: leofang <5534781+leofang@users.noreply.github.com>

* Set fetch-depth: 1 for checkout steps and favor L4/T4 over A100 GPUs for Windows testing

Co-authored-by: leofang <5534781+leofang@users.noreply.github.com>

* Revert Windows workflow to GitHub-hosted runners with TODO comments for future self-hosted migration

Co-authored-by: leofang <5534781+leofang@users.noreply.github.com>

* [pre-commit.ci] auto code formatting

* Revert Win runner name change for now

---------

Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>
Co-authored-by: leofang <5534781+leofang@users.noreply.github.com>
Co-authored-by: Leo Fang <leof@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

* forgot to add windows

* rerun codegen with 12.9.1 and update result/error explanations

* First stab at the filter for CUDA < 13 in CI

* Get data from the top-level array

* Use the map function on select output

* CI: Move to self-hosted Windows GPU runners

Migrate the Windows testing to use the new NV GHA runners.
Cherry-pick #958.

---------

Co-authored-by: Copilot <198982749+Copilot@users.noreply.github.com>
Co-authored-by: leofang <5534781+leofang@users.noreply.github.com>
Co-authored-by: Leo Fang <leof@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Marcus D. Hanwell <mhanwell@nvidia.com>
@leofang
Copy link
Member

leofang commented Sep 11, 2025

btw we should backport this, but #955 needs to land first, after that I hope the backport bot should just work? (not sure)

Backported in #995.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CI/CD CI/CD infrastructure enhancement Any code-related improvements P0 High priority - Must do!

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants