Skip to content

build: add extra cuda13, rocm7; remove cuda11, rocm4,5#2561

Merged
havogt merged 6 commits intoGridTools:mainfrom
havogt:add_cuda13_rocm7_cupy14_testing
Apr 1, 2026
Merged

build: add extra cuda13, rocm7; remove cuda11, rocm4,5#2561
havogt merged 6 commits intoGridTools:mainfrom
havogt:add_cuda13_rocm7_cupy14_testing

Conversation

@havogt
Copy link
Copy Markdown
Contributor

@havogt havogt commented Mar 31, 2026

No description provided.

@havogt havogt requested a review from egparedes March 31, 2026 13:54
Copy link
Copy Markdown
Contributor

@egparedes egparedes left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Just a question

Comment thread ci/cscs-ci.yml
image: ${CSCS_REGISTRY_PATH}/public/${ARCH}/base/gt4py-ci-${PY_VERSION}:${DOCKER_TAG}
variables:
TEST_VARIANTS: 'cpu' # Extended jobs should redefine which variants (cpu, cuda12, rocm6_0) to test
TEST_VARIANTS: 'cpu' # Extended jobs should redefine which variants (cpu, cuda12, rocm6) to test
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Question: is it clear for you what is this variable used for? I'm not longer sure if I understand why it is needed. Also, do you explicitly keep the cuda12 variant instead of cuda13 here?

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Trying to reverse engineer, I think we define all SUBVARIANTs in test_common, then TEST_VARIANTS should be used in the concrete test_cscs_gh200, test_cscs_amd_rocm to pick which to actually run.
We are still using cuda12 on santis by default, therefore I didn't bump the version (assuming you are not talking about the comment, but about what we run).

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We are still using cuda12 on santis by default, therefore I didn't bump the version (assuming you are not talking about the comment, but about what we run).

Understood, thanks.

Trying to reverse engineer, I think we define all SUBVARIANTs in test_common, then TEST_VARIANTS should be used in the concrete test_cscs_gh200, test_cscs_amd_rocm to pick which to actually run.

I see now, thanks. It's kind of weird how it's done: TEST_VARIANTS is used to exclude the variants which do not match its content in the exclude_variants_rules ...

@havogt havogt merged commit f6a4f80 into GridTools:main Apr 1, 2026
32 checks passed
@havogt havogt deleted the add_cuda13_rocm7_cupy14_testing branch April 1, 2026 11:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants