Skip to content

{2025.06} Zen5 accel easystack#1446

Merged
ocaisa merged 1 commit intoEESSI:mainfrom
bedroge:zen5_accel
Mar 6, 2026
Merged

{2025.06} Zen5 accel easystack#1446
ocaisa merged 1 commit intoEESSI:mainfrom
bedroge:zen5_accel

Conversation

@bedroge
Copy link
Copy Markdown
Collaborator

@bedroge bedroge commented Mar 6, 2026

The reprod script could also be used for this, but since we only have CUDA and cuDNN, I've just copied https://github.com/EESSI/software-layer/blob/main/easystacks/software.eessi.io/2025.06/accel/nvidia/eessi-2025.06-eb-5.2.0-001-system.yml.

@bedroge bedroge added accel:nvidia 2025.06-software.eessi.io 2025.06 version of software.eessi.io zen5 labels Mar 6, 2026
@bedroge
Copy link
Copy Markdown
Collaborator Author

bedroge commented Mar 6, 2026

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-rug for:arch=x86_64/amd/zen5,accel=nvidia/cc120

@eessi-bot-rug
Copy link
Copy Markdown

eessi-bot-rug Bot commented Mar 6, 2026

New job on instance eessi-bot-rug for repository eessi.io-2025.06-software
Building on: amd-zen5 and accelerator nvidia/cc120
Building for: x86_64/amd/zen5 and accelerator nvidia/cc120
Job dir: /scratch/hb-eessibot/SHARED/jobs/2026.03/pr_1446/27611401

date job status comment
Mar 06 08:30:15 UTC 2026 submitted job id 27611401 awaits release by job manager
Mar 06 08:31:23 UTC 2026 released job awaits launch by Slurm scheduler
Mar 06 08:33:27 UTC 2026 running job 27611401 is running
Mar 06 08:39:32 UTC 2026 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-27611401.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.* created!
Artefacts
eessi-2025.06-software-linux-x86_64-amd-zen5-accel-nvidia-cc120-17727861750.tar.zstsize: 3390 MiB (3554964413 bytes)
entries: 6476
modules under 2025.06/software/linux/x86_64/amd/zen5/accel/nvidia/cc120/modules/all
CUDA/12.6.0.lua
CUDA/12.8.0.lua
cuDNN/9.10.1.4-CUDA-12.8.0.lua
cuDNN/9.5.0.50-CUDA-12.6.0.lua
software under 2025.06/software/linux/x86_64/amd/zen5/accel/nvidia/cc120/software
CUDA/12.6.0
CUDA/12.8.0
cuDNN/9.10.1.4-CUDA-12.8.0
cuDNN/9.5.0.50-CUDA-12.6.0
reprod directories under 2025.06/software/linux/x86_64/amd/zen5/accel/nvidia/cc120/reprod
CUDA/12.6.0/20260306_083250UTC
CUDA/12.8.0/20260306_083512UTC
cuDNN/9.10.1.4-CUDA-12.8.0/20260306_083608UTC
cuDNN/9.5.0.50-CUDA-12.6.0/20260306_083514UTC
other under 2025.06/software/linux/x86_64/amd/zen5/accel/nvidia/cc120
no other files in tarball
Mar 06 08:39:32 UTC 2026 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ PASSED ] Ran 0/0 test case(s) from 0 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-27611401.out
✅ no message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case
Mar 06 09:41:41 UTC 2026 uploaded transfer of eessi-2025.06-software-linux-x86_64-amd-zen5-accel-nvidia-cc120-17727861750.tar.zst to S3 bucket succeeded

@bedroge
Copy link
Copy Markdown
Collaborator Author

bedroge commented Mar 6, 2026

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-aws-eu-south on:arch=x86_64/amd/zen5 for:arch=x86_64/amd/zen5,accel=nvidia/cc70

@eessi-bot-aws-eu-south
Copy link
Copy Markdown

eessi-bot-aws-eu-south Bot commented Mar 6, 2026

New job on instance eessi-bot-aws-eu-south for repository eessi.io-2025.06-software
Building on: amd-zen5
Building for: x86_64/amd/zen5 and accelerator nvidia/cc70
Job dir: /project/def-users/SHARED/jobs/2026.03/pr_1446/78

date job status comment
Mar 06 08:33:29 UTC 2026 submitted job id 78 awaits release by job manager
Mar 06 08:33:45 UTC 2026 released job awaits launch by Slurm scheduler
Mar 06 08:38:58 UTC 2026 running job 78 is running
Mar 06 09:01:21 UTC 2026 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-78.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.* created!
Artefacts
eessi-2025.06-software-linux-x86_64-amd-zen5-accel-nvidia-cc70-17727872950.tar.zstsize: 5712 MiB (5990405429 bytes)
entries: 12678
modules under 2025.06/software/linux/x86_64/amd/zen5/accel/nvidia/cc70/modules/all
CUDA/12.6.0.lua
CUDA/12.8.0.lua
cuDNN/9.10.1.4-CUDA-12.8.0.lua
cuDNN/9.5.0.50-CUDA-12.6.0.lua
software under 2025.06/software/linux/x86_64/amd/zen5/accel/nvidia/cc70/software
CUDA/12.6.0
CUDA/12.8.0
cuDNN/9.10.1.4-CUDA-12.8.0
cuDNN/9.5.0.50-CUDA-12.6.0
reprod directories under 2025.06/software/linux/x86_64/amd/zen5/accel/nvidia/cc70/reprod
CUDA/12.6.0/20260306_084153UTC
CUDA/12.8.0/20260306_084459UTC
cuDNN/9.10.1.4-CUDA-12.8.0/20260306_085237UTC
cuDNN/9.5.0.50-CUDA-12.6.0/20260306_084732UTC
other under 2025.06/software/linux/x86_64/amd/zen5/accel/nvidia/cc70
no other files in tarball
Mar 06 09:01:21 UTC 2026 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ OK ] (1/4) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_allreduce %module_name=OSU-Micro-Benchmarks/7.5-gompi-2025a %scale=1_node %device_type=cpu /e4bf9965 @BotBuildTests:x86-64-zen5+default
P: latency: 1.31 us (r:0, l:None, u:None)
[ OK ] (2/4) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_alltoall %module_name=OSU-Micro-Benchmarks/7.5-gompi-2025a %scale=1_node %device_type=cpu /3da4890b @BotBuildTests:x86-64-zen5+default
P: latency: 2.89 us (r:0, l:None, u:None)
[ OK ] (3/4) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_latency %module_name=OSU-Micro-Benchmarks/7.5-gompi-2025a %scale=1_node /3255009a @BotBuildTests:x86-64-zen5+default
P: latency: 0.2 us (r:0, l:None, u:None)
[ OK ] (4/4) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_bw %module_name=OSU-Micro-Benchmarks/7.5-gompi-2025a %scale=1_node /59f4b331 @BotBuildTests:x86-64-zen5+default
P: bandwidth: 39001.71 MB/s (r:0, l:None, u:None)
[ PASSED ] Ran 4/4 test case(s) from 4 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-78.out
✅ no message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case
Mar 06 09:43:22 UTC 2026 uploaded transfer of eessi-2025.06-software-linux-x86_64-amd-zen5-accel-nvidia-cc70-17727872950.tar.zst to S3 bucket succeeded

@bedroge
Copy link
Copy Markdown
Collaborator Author

bedroge commented Mar 6, 2026

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-aws-eu-south on:arch=x86_64/amd/zen5 for:arch=x86_64/amd/zen5,accel=nvidia/cc80
bot: build repo:eessi.io-2025.06-software instance:eessi-bot-aws-eu-south on:arch=x86_64/amd/zen5 for:arch=x86_64/amd/zen5,accel=nvidia/cc90
bot: build repo:eessi.io-2025.06-software instance:eessi-bot-aws-eu-south on:arch=x86_64/amd/zen5 for:arch=x86_64/amd/zen5,accel=nvidia/cc100

@eessi-bot-aws-eu-south
Copy link
Copy Markdown

eessi-bot-aws-eu-south Bot commented Mar 6, 2026

New job on instance eessi-bot-aws-eu-south for repository eessi.io-2025.06-software
Building on: amd-zen5
Building for: x86_64/amd/zen5 and accelerator nvidia/cc80
Job dir: /project/def-users/SHARED/jobs/2026.03/pr_1446/79

date job status comment
Mar 06 09:06:29 UTC 2026 submitted job id 79 awaits release by job manager
Mar 06 09:06:33 UTC 2026 released job awaits launch by Slurm scheduler
Mar 06 09:07:47 UTC 2026 running job 79 is running
Mar 06 09:21:09 UTC 2026 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-79.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.* created!
Artefacts
eessi-2025.06-software-linux-x86_64-amd-zen5-accel-nvidia-cc80-17727885100.tar.zstsize: 5712 MiB (5990399852 bytes)
entries: 12678
modules under 2025.06/software/linux/x86_64/amd/zen5/accel/nvidia/cc80/modules/all
CUDA/12.6.0.lua
CUDA/12.8.0.lua
cuDNN/9.10.1.4-CUDA-12.8.0.lua
cuDNN/9.5.0.50-CUDA-12.6.0.lua
software under 2025.06/software/linux/x86_64/amd/zen5/accel/nvidia/cc80/software
CUDA/12.6.0
CUDA/12.8.0
cuDNN/9.10.1.4-CUDA-12.8.0
cuDNN/9.5.0.50-CUDA-12.6.0
reprod directories under 2025.06/software/linux/x86_64/amd/zen5/accel/nvidia/cc80/reprod
CUDA/12.6.0/20260306_090922UTC
CUDA/12.8.0/20260306_091131UTC
cuDNN/9.10.1.4-CUDA-12.8.0/20260306_091342UTC
cuDNN/9.5.0.50-CUDA-12.6.0/20260306_091230UTC
other under 2025.06/software/linux/x86_64/amd/zen5/accel/nvidia/cc80
no other files in tarball
Mar 06 09:21:09 UTC 2026 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ OK ] (1/4) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_allreduce %module_name=OSU-Micro-Benchmarks/7.5-gompi-2025a %scale=1_node %device_type=cpu /e4bf9965 @BotBuildTests:x86-64-zen5+default
P: latency: 1.29 us (r:0, l:None, u:None)
[ OK ] (2/4) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_alltoall %module_name=OSU-Micro-Benchmarks/7.5-gompi-2025a %scale=1_node %device_type=cpu /3da4890b @BotBuildTests:x86-64-zen5+default
P: latency: 2.87 us (r:0, l:None, u:None)
[ OK ] (3/4) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_latency %module_name=OSU-Micro-Benchmarks/7.5-gompi-2025a %scale=1_node /3255009a @BotBuildTests:x86-64-zen5+default
P: latency: 0.19 us (r:0, l:None, u:None)
[ OK ] (4/4) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_bw %module_name=OSU-Micro-Benchmarks/7.5-gompi-2025a %scale=1_node /59f4b331 @BotBuildTests:x86-64-zen5+default
P: bandwidth: 44765.51 MB/s (r:0, l:None, u:None)
[ PASSED ] Ran 4/4 test case(s) from 4 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-79.out
✅ no message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case
Mar 06 09:46:16 UTC 2026 uploaded transfer of eessi-2025.06-software-linux-x86_64-amd-zen5-accel-nvidia-cc80-17727885100.tar.zst to S3 bucket succeeded

@eessi-bot-aws-eu-south
Copy link
Copy Markdown

eessi-bot-aws-eu-south Bot commented Mar 6, 2026

New job on instance eessi-bot-aws-eu-south for repository eessi.io-2025.06-software
Building on: amd-zen5
Building for: x86_64/amd/zen5 and accelerator nvidia/cc90
Job dir: /project/def-users/SHARED/jobs/2026.03/pr_1446/80

date job status comment
Mar 06 09:06:36 UTC 2026 submitted job id 80 awaits release by job manager
Mar 06 09:07:43 UTC 2026 released job awaits launch by Slurm scheduler
Mar 06 09:15:21 UTC 2026 running job 80 is running
Mar 06 09:29:01 UTC 2026 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-80.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.* created!
Artefacts
eessi-2025.06-software-linux-x86_64-amd-zen5-accel-nvidia-cc90-17727890620.tar.zstsize: 5712 MiB (5990334940 bytes)
entries: 12678
modules under 2025.06/software/linux/x86_64/amd/zen5/accel/nvidia/cc90/modules/all
CUDA/12.6.0.lua
CUDA/12.8.0.lua
cuDNN/9.10.1.4-CUDA-12.8.0.lua
cuDNN/9.5.0.50-CUDA-12.6.0.lua
software under 2025.06/software/linux/x86_64/amd/zen5/accel/nvidia/cc90/software
CUDA/12.6.0
CUDA/12.8.0
cuDNN/9.10.1.4-CUDA-12.8.0
cuDNN/9.5.0.50-CUDA-12.6.0
reprod directories under 2025.06/software/linux/x86_64/amd/zen5/accel/nvidia/cc90/reprod
CUDA/12.6.0/20260306_091744UTC
CUDA/12.8.0/20260306_092024UTC
cuDNN/9.10.1.4-CUDA-12.8.0/20260306_092235UTC
cuDNN/9.5.0.50-CUDA-12.6.0/20260306_092123UTC
other under 2025.06/software/linux/x86_64/amd/zen5/accel/nvidia/cc90
no other files in tarball
Mar 06 09:29:01 UTC 2026 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ OK ] (1/4) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_allreduce %module_name=OSU-Micro-Benchmarks/7.5-gompi-2025a %scale=1_node %device_type=cpu /e4bf9965 @BotBuildTests:x86-64-zen5+default
P: latency: 1.25 us (r:0, l:None, u:None)
[ OK ] (2/4) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_alltoall %module_name=OSU-Micro-Benchmarks/7.5-gompi-2025a %scale=1_node %device_type=cpu /3da4890b @BotBuildTests:x86-64-zen5+default
P: latency: 2.81 us (r:0, l:None, u:None)
[ OK ] (3/4) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_latency %module_name=OSU-Micro-Benchmarks/7.5-gompi-2025a %scale=1_node /3255009a @BotBuildTests:x86-64-zen5+default
P: latency: 0.2 us (r:0, l:None, u:None)
[ OK ] (4/4) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_bw %module_name=OSU-Micro-Benchmarks/7.5-gompi-2025a %scale=1_node /59f4b331 @BotBuildTests:x86-64-zen5+default
P: bandwidth: 38041.73 MB/s (r:0, l:None, u:None)
[ PASSED ] Ran 4/4 test case(s) from 4 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-80.out
✅ no message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case
Mar 06 09:49:18 UTC 2026 uploaded transfer of eessi-2025.06-software-linux-x86_64-amd-zen5-accel-nvidia-cc90-17727890620.tar.zst to S3 bucket succeeded

@eessi-bot-aws-eu-south
Copy link
Copy Markdown

eessi-bot-aws-eu-south Bot commented Mar 6, 2026

New job on instance eessi-bot-aws-eu-south for repository eessi.io-2025.06-software
Building on: amd-zen5
Building for: x86_64/amd/zen5 and accelerator nvidia/cc100
Job dir: /project/def-users/SHARED/jobs/2026.03/pr_1446/81

date job status comment
Mar 06 09:06:42 UTC 2026 submitted job id 81 awaits release by job manager
Mar 06 09:07:39 UTC 2026 released job awaits launch by Slurm scheduler
Mar 06 09:13:08 UTC 2026 running job 81 is running
Mar 06 09:22:16 UTC 2026 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-81.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.* created!
Artefacts
eessi-2025.06-software-linux-x86_64-amd-zen5-accel-nvidia-cc100-17727887350.tar.zstsize: 3390 MiB (3555058675 bytes)
entries: 6476
modules under 2025.06/software/linux/x86_64/amd/zen5/accel/nvidia/cc100/modules/all
CUDA/12.6.0.lua
CUDA/12.8.0.lua
cuDNN/9.10.1.4-CUDA-12.8.0.lua
cuDNN/9.5.0.50-CUDA-12.6.0.lua
software under 2025.06/software/linux/x86_64/amd/zen5/accel/nvidia/cc100/software
CUDA/12.6.0
CUDA/12.8.0
cuDNN/9.10.1.4-CUDA-12.8.0
cuDNN/9.5.0.50-CUDA-12.6.0
reprod directories under 2025.06/software/linux/x86_64/amd/zen5/accel/nvidia/cc100/reprod
CUDA/12.6.0/20260306_091438UTC
CUDA/12.8.0/20260306_091709UTC
cuDNN/9.10.1.4-CUDA-12.8.0/20260306_091836UTC
cuDNN/9.5.0.50-CUDA-12.6.0/20260306_091716UTC
other under 2025.06/software/linux/x86_64/amd/zen5/accel/nvidia/cc100
no other files in tarball
Mar 06 09:22:16 UTC 2026 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ OK ] (1/4) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_allreduce %module_name=OSU-Micro-Benchmarks/7.5-gompi-2025a %scale=1_node %device_type=cpu /e4bf9965 @BotBuildTests:x86-64-zen5+default
P: latency: 1.28 us (r:0, l:None, u:None)
[ OK ] (2/4) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_alltoall %module_name=OSU-Micro-Benchmarks/7.5-gompi-2025a %scale=1_node %device_type=cpu /3da4890b @BotBuildTests:x86-64-zen5+default
P: latency: 2.87 us (r:0, l:None, u:None)
[ OK ] (3/4) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_latency %module_name=OSU-Micro-Benchmarks/7.5-gompi-2025a %scale=1_node /3255009a @BotBuildTests:x86-64-zen5+default
P: latency: 0.17 us (r:0, l:None, u:None)
[ OK ] (4/4) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_bw %module_name=OSU-Micro-Benchmarks/7.5-gompi-2025a %scale=1_node /59f4b331 @BotBuildTests:x86-64-zen5+default
P: bandwidth: 45459.62 MB/s (r:0, l:None, u:None)
[ PASSED ] Ran 4/4 test case(s) from 4 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-81.out
✅ no message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case
Mar 06 09:51:12 UTC 2026 uploaded transfer of eessi-2025.06-software-linux-x86_64-amd-zen5-accel-nvidia-cc100-17727887350.tar.zst to S3 bucket succeeded

@bedroge bedroge added the ready-to-deploy Mark a PR as ready to deploy label Mar 6, 2026
@ocaisa ocaisa added bot:deploy Ask bot to deploy missing software installations to EESSI and removed ready-to-deploy Mark a PR as ready to deploy labels Mar 6, 2026
@ocaisa
Copy link
Copy Markdown
Member

ocaisa commented Mar 6, 2026

Staging PR merged

@ocaisa ocaisa merged commit 4ff6b14 into EESSI:main Mar 6, 2026
53 checks passed
@bedroge bedroge deleted the zen5_accel branch March 6, 2026 10:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

2025.06-software.eessi.io 2025.06 version of software.eessi.io accel:nvidia bot:deploy Ask bot to deploy missing software installations to EESSI zen5

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants