Skip to content

Conversation

@bedroge
Copy link
Collaborator

@bedroge bedroge commented Nov 13, 2025

No description provided.

@bedroge bedroge added 2023.06-software.eessi.io 2023.06 version of software.eessi.io a64fx labels Nov 13, 2025
@bedroge
Copy link
Collaborator Author

bedroge commented Nov 13, 2025

bot: build repo:eessi.io-2023.06-software instance:eessi-bot-deucalion for:arch=aarch64/a64fx

@eessi-bot-deucalion
Copy link

eessi-bot-deucalion bot commented Nov 13, 2025

New job on instance eessi-bot-deucalion for repository eessi.io-2023.06-software
Building on: a64fx
Building for: aarch64/a64fx
Job dir: /home/eessibot/new-bot/jobs/2025.11/pr_1302/682166

date job status comment
Nov 13 14:35:48 UTC 2025 submitted job id 682166 awaits release by job manager
Nov 13 14:36:21 UTC 2025 released job awaits launch by Slurm scheduler
Nov 13 14:37:26 UTC 2025 running job 682166 is running
Nov 13 15:51:40 UTC 2025 finished
🤷 UNKNOWN (click triangle for detailed information)
  • Job results file _bot_job682166.result does not exist in job directory, or parsing it failed.
  • No artefacts were found/reported.
Nov 13 15:51:40 UTC 2025 test result
🤷 UNKNOWN (click triangle for detailed information)
  • Job test file _bot_job682166.test does not exist in job directory, or parsing it failed.

@boegel
Copy link
Contributor

boegel commented Nov 13, 2025

@bedroge easyconfig PR needs work? see easybuilders/easybuild-easyconfigs#24548 (comment)

@bedroge
Copy link
Collaborator Author

bedroge commented Nov 13, 2025

@bedroge easyconfig PR needs work? see easybuilders/easybuild-easyconfigs#24548 (comment)

Looks like that may be specific to non-Arm systems, just want to make sure that it does solve the issue on A64FX, so let's give it another try (not sure what happened to the previous job).

bot: build repo:eessi.io-2023.06-software instance:eessi-bot-deucalion for:arch=aarch64/a64fx

@eessi-bot-deucalion
Copy link

eessi-bot-deucalion bot commented Nov 13, 2025

New job on instance eessi-bot-deucalion for repository eessi.io-2023.06-software
Building on: a64fx
Building for: aarch64/a64fx
Job dir: /home/eessibot/new-bot/jobs/2025.11/pr_1302/682642

date job status comment
Nov 13 22:04:27 UTC 2025 submitted job id 682642 awaits release by job manager
Nov 13 22:04:36 UTC 2025 released job awaits launch by Slurm scheduler
Nov 13 22:05:40 UTC 2025 running job 682642 is running
Nov 13 23:22:40 UTC 2025 finished
🤷 UNKNOWN (click triangle for detailed information)
  • Job results file _bot_job682642.result does not exist in job directory, or parsing it failed.
  • No artefacts were found/reported.
Nov 13 23:22:40 UTC 2025 test result
🤷 UNKNOWN (click triangle for detailed information)
  • Job test file _bot_job682642.test does not exist in job directory, or parsing it failed.

@bedroge
Copy link
Collaborator Author

bedroge commented Nov 14, 2025

The node crashed again, so I'm reducing the number of cores in EESSI/software-layer-scripts#129.

Comment out GROMACS entries and their options.
@bedroge
Copy link
Collaborator Author

bedroge commented Nov 15, 2025

One more try with only GROMACS 2024.4, and using EB 5.1.2.

bot: build repo:eessi.io-2023.06-software instance:eessi-bot-deucalion for:arch=aarch64/a64fx

@eessi-bot-deucalion
Copy link

eessi-bot-deucalion bot commented Nov 15, 2025

New job on instance eessi-bot-deucalion for repository eessi.io-2023.06-software
Building on: a64fx
Building for: aarch64/a64fx
Job dir: /home/eessibot/new-bot/jobs/2025.11/pr_1302/683695

date job status comment
Nov 15 09:10:10 UTC 2025 submitted job id 683695 awaits release by job manager
Nov 15 09:11:07 UTC 2025 released job awaits launch by Slurm scheduler
Nov 15 11:00:19 UTC 2025 running job 683695 is running
Nov 15 11:23:54 UTC 2025 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-683695.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.* created!
Artefacts
eessi-2023.06-software-linux-aarch64-a64fx-17632055080.tar.zstsize: 0 MiB (22 bytes)
entries: 0
modules under 2023.06/software/linux/aarch64/a64fx/modules/all
no module files in tarball
software under 2023.06/software/linux/aarch64/a64fx/software
no software packages in tarball
reprod directories under 2023.06/software/linux/aarch64/a64fx/reprod
no reprod directories in tarball
other under 2023.06/software/linux/aarch64/a64fx
no other files in tarball
Nov 15 11:23:54 UTC 2025 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ SKIP ] ( 1/10) Skipping test: nodes in this partition only have 30720 MiB memory available (per node) according to the current ReFrame configuration, but 49152 MiB is needed
[ SKIP ] ( 2/10) Skipping test: nodes in this partition only have 30720 MiB memory available (per node) according to the current ReFrame configuration, but 49152 MiB is needed
[ SKIP ] ( 3/10) Skipping test: nodes in this partition only have 30720 MiB memory available (per node) according to the current ReFrame configuration, but 49152 MiB is needed
[ SKIP ] ( 4/10) Skipping test: nodes in this partition only have 30720 MiB memory available (per node) according to the current ReFrame configuration, but 49152 MiB is needed
[ OK ] ( 5/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_latency %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node /15cad6c4 @BotBuildTests:aarch64_a64fx+default
P: latency: 1.72 us (r:0, l:None, u:None)
[ OK ] ( 6/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_latency %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node /6672deda @BotBuildTests:aarch64_a64fx+default
P: latency: 1.73 us (r:0, l:None, u:None)
[ OK ] ( 7/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_bw %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node /2a9a47b1 @BotBuildTests:aarch64_a64fx+default
P: bandwidth: 8465.62 MB/s (r:0, l:None, u:None)
[ OK ] ( 8/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_bw %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node /1b24ab8e @BotBuildTests:aarch64_a64fx+default
P: bandwidth: 8715.38 MB/s (r:0, l:None, u:None)
[ OK ] ( 9/10) EESSI_LAMMPS_lj %device_type=cpu %module_name=LAMMPS/29Aug2024-foss-2023b-kokkos %scale=1_node /aeb2d9df @BotBuildTests:aarch64_a64fx+default
P: perf: 580.631 timesteps/s (r:0, l:None, u:None)
[ OK ] (10/10) EESSI_LAMMPS_lj %device_type=cpu %module_name=LAMMPS/2Aug2023_update2-foss-2023a-kokkos %scale=1_node /04ff9ece @BotBuildTests:aarch64_a64fx+default
P: perf: 578.333 timesteps/s (r:0, l:None, u:None)
[ PASSED ] Ran 6/10 test case(s) from 10 check(s) (0 failure(s), 4 skipped, 0 aborted)
Details
✅ job output file slurm-683695.out
✅ no message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@bedroge
Copy link
Collaborator Author

bedroge commented Nov 15, 2025

It didn't process the second easystack, let's try again with just that one.

bot: build repo:eessi.io-2023.06-software instance:eessi-bot-deucalion for:arch=aarch64/a64fx

@eessi-bot-deucalion
Copy link

eessi-bot-deucalion bot commented Nov 15, 2025

New job on instance eessi-bot-deucalion for repository eessi.io-2023.06-software
Building on: a64fx
Building for: aarch64/a64fx
Job dir: /home/eessibot/new-bot/jobs/2025.11/pr_1302/683915

date job status comment
Nov 15 17:10:07 UTC 2025 submitted job id 683915 awaits release by job manager
Nov 15 17:10:31 UTC 2025 released job awaits launch by Slurm scheduler
Nov 15 17:11:33 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-683915.out
✅ no message matching FATAL:
❌ found message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.* created!
Artefacts
No artefacts were created or found.
Nov 15 17:11:33 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-683915.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@bedroge
Copy link
Collaborator Author

bedroge commented Nov 15, 2025

Easystack had a weird character in its filename, let's try again...

bot: build repo:eessi.io-2023.06-software instance:eessi-bot-deucalion for:arch=aarch64/a64fx

@eessi-bot-deucalion
Copy link

eessi-bot-deucalion bot commented Nov 15, 2025

New job on instance eessi-bot-deucalion for repository eessi.io-2023.06-software
Building on: a64fx
Building for: aarch64/a64fx
Job dir: /home/eessibot/new-bot/jobs/2025.11/pr_1302/684166

date job status comment
Nov 15 19:07:41 UTC 2025 submitted job id 684166 awaits release by job manager
Nov 15 19:08:11 UTC 2025 released job awaits launch by Slurm scheduler
Nov 15 19:09:13 UTC 2025 running job 684166 is running
Nov 15 22:01:39 UTC 2025 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-684166.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.* created!
Artefacts
eessi-2023.06-software-linux-aarch64-a64fx-17632437510.tar.zstsize: 43 MiB (45803683 bytes)
entries: 769
modules under 2023.06/software/linux/aarch64/a64fx/modules/all
GROMACS/2024.4-foss-2023b.lua
software under 2023.06/software/linux/aarch64/a64fx/software
GROMACS/2024.4-foss-2023b
reprod directories under 2023.06/software/linux/aarch64/a64fx/reprod
no reprod directories in tarball
other under 2023.06/software/linux/aarch64/a64fx
no other files in tarball
Nov 15 22:01:39 UTC 2025 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ SKIP ] ( 1/10) Skipping test: nodes in this partition only have 30720 MiB memory available (per node) according to the current ReFrame configuration, but 49152 MiB is needed
[ SKIP ] ( 2/10) Skipping test: nodes in this partition only have 30720 MiB memory available (per node) according to the current ReFrame configuration, but 49152 MiB is needed
[ SKIP ] ( 3/10) Skipping test: nodes in this partition only have 30720 MiB memory available (per node) according to the current ReFrame configuration, but 49152 MiB is needed
[ SKIP ] ( 4/10) Skipping test: nodes in this partition only have 30720 MiB memory available (per node) according to the current ReFrame configuration, but 49152 MiB is needed
[ OK ] ( 5/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_latency %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node /15cad6c4 @BotBuildTests:aarch64_a64fx+default
P: latency: 1.73 us (r:0, l:None, u:None)
[ OK ] ( 6/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_latency %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node /6672deda @BotBuildTests:aarch64_a64fx+default
P: latency: 1.73 us (r:0, l:None, u:None)
[ OK ] ( 7/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_bw %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node /2a9a47b1 @BotBuildTests:aarch64_a64fx+default
P: bandwidth: 8742.87 MB/s (r:0, l:None, u:None)
[ OK ] ( 8/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_bw %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node /1b24ab8e @BotBuildTests:aarch64_a64fx+default
P: bandwidth: 8193.03 MB/s (r:0, l:None, u:None)
[ OK ] ( 9/10) EESSI_LAMMPS_lj %device_type=cpu %module_name=LAMMPS/29Aug2024-foss-2023b-kokkos %scale=1_node /aeb2d9df @BotBuildTests:aarch64_a64fx+default
P: perf: 581.919 timesteps/s (r:0, l:None, u:None)
[ OK ] (10/10) EESSI_LAMMPS_lj %device_type=cpu %module_name=LAMMPS/2Aug2023_update2-foss-2023a-kokkos %scale=1_node /04ff9ece @BotBuildTests:aarch64_a64fx+default
P: perf: 529.517 timesteps/s (r:0, l:None, u:None)
[ PASSED ] Ran 6/10 test case(s) from 10 check(s) (0 failure(s), 4 skipped, 0 aborted)
Details
✅ job output file slurm-684166.out
✅ no message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@bedroge
Copy link
Collaborator Author

bedroge commented Nov 16, 2025

bot: build repo:eessi.io-2023.06-software instance:eessi-bot-deucalion for:arch=aarch64/a64fx

@eessi-bot-deucalion
Copy link

eessi-bot-deucalion bot commented Nov 16, 2025

New job on instance eessi-bot-deucalion for repository eessi.io-2023.06-software
Building on: a64fx
Building for: aarch64/a64fx
Job dir: /home/eessibot/new-bot/jobs/2025.11/pr_1302/684423

date job status comment
Nov 16 07:27:09 UTC 2025 submitted job id 684423 awaits release by job manager
Nov 16 07:27:18 UTC 2025 released job awaits launch by Slurm scheduler
Nov 16 07:28:20 UTC 2025 running job 684423 is running
Nov 16 13:04:51 UTC 2025 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-684423.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.* created!
Artefacts
eessi-2023.06-software-linux-aarch64-a64fx-17632978830.tar.zstsize: 87 MiB (91520012 bytes)
entries: 1538
modules under 2023.06/software/linux/aarch64/a64fx/modules/all
GROMACS/2024.3-foss-2023b.lua
GROMACS/2024.4-foss-2023b.lua
software under 2023.06/software/linux/aarch64/a64fx/software
GROMACS/2024.3-foss-2023b
GROMACS/2024.4-foss-2023b
reprod directories under 2023.06/software/linux/aarch64/a64fx/reprod
no reprod directories in tarball
other under 2023.06/software/linux/aarch64/a64fx
no other files in tarball
Nov 16 13:04:51 UTC 2025 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ SKIP ] ( 1/10) Skipping test: nodes in this partition only have 30720 MiB memory available (per node) according to the current ReFrame configuration, but 49152 MiB is needed
[ SKIP ] ( 2/10) Skipping test: nodes in this partition only have 30720 MiB memory available (per node) according to the current ReFrame configuration, but 49152 MiB is needed
[ SKIP ] ( 3/10) Skipping test: nodes in this partition only have 30720 MiB memory available (per node) according to the current ReFrame configuration, but 49152 MiB is needed
[ SKIP ] ( 4/10) Skipping test: nodes in this partition only have 30720 MiB memory available (per node) according to the current ReFrame configuration, but 49152 MiB is needed
[ OK ] ( 5/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_latency %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node /15cad6c4 @BotBuildTests:aarch64_a64fx+default
P: latency: 1.68 us (r:0, l:None, u:None)
[ OK ] ( 6/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_latency %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node /6672deda @BotBuildTests:aarch64_a64fx+default
P: latency: 1.74 us (r:0, l:None, u:None)
[ OK ] ( 7/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_bw %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node /2a9a47b1 @BotBuildTests:aarch64_a64fx+default
P: bandwidth: 8833.71 MB/s (r:0, l:None, u:None)
[ OK ] ( 8/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_bw %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node /1b24ab8e @BotBuildTests:aarch64_a64fx+default
P: bandwidth: 8698.85 MB/s (r:0, l:None, u:None)
[ OK ] ( 9/10) EESSI_LAMMPS_lj %device_type=cpu %module_name=LAMMPS/29Aug2024-foss-2023b-kokkos %scale=1_node /aeb2d9df @BotBuildTests:aarch64_a64fx+default
P: perf: 14.749 timesteps/s (r:0, l:None, u:None)
[ OK ] (10/10) EESSI_LAMMPS_lj %device_type=cpu %module_name=LAMMPS/2Aug2023_update2-foss-2023a-kokkos %scale=1_node /04ff9ece @BotBuildTests:aarch64_a64fx+default
P: perf: 581.586 timesteps/s (r:0, l:None, u:None)
[ PASSED ] Ran 6/10 test case(s) from 10 check(s) (0 failure(s), 4 skipped, 0 aborted)
Details
✅ job output file slurm-684423.out
✅ no message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case
Nov 16 22:23:59 UTC 2025 uploaded transfer of eessi-2023.06-software-linux-aarch64-a64fx-17632978830.tar.zst to S3 bucket succeeded

@bedroge bedroge changed the title {2023.06}[2023b, a64fx] GROMACS 2024.1 + 2024.3 + 2024.4 {2023.06}[2023b, a64fx] GROMACS 2024.3 + 2024.4 Nov 16, 2025
@bedroge bedroge added ready-to-deploy Mark a PR as ready to deploy ready-to-review labels Nov 16, 2025
Copy link
Collaborator

@trz42 trz42 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nice 🚀

@trz42 trz42 removed the ready-to-deploy Mark a PR as ready to deploy label Nov 16, 2025
@trz42 trz42 added the bot:deploy Ask bot to deploy missing software installations to EESSI label Nov 16, 2025
@ocaisa
Copy link
Member

ocaisa commented Nov 17, 2025

@bedroge Can we remove the exclusion from CI now?

@bedroge
Copy link
Collaborator Author

bedroge commented Nov 17, 2025

@bedroge Can we remove the exclusion from CI now?

Almost, there's one missing GROMACS version left (2024.1). That one had many double precision test failures. I'll check if there's an easy fix for that (2024.3 does work fine, so something got fixed), but otherwise I'd say that we just install a dummy modulefile that prints "this version is not supported on A64FX".

@ocaisa
Copy link
Member

ocaisa commented Nov 17, 2025

Ok, I'll merge this and we finish up in a follow-up PR

@ocaisa ocaisa merged commit e43a325 into EESSI:main Nov 17, 2025
50 checks passed
@bedroge bedroge deleted the a64fx_gromacs branch November 17, 2025 08:07
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

2023.06-software.eessi.io 2023.06 version of software.eessi.io a64fx bot:deploy Ask bot to deploy missing software installations to EESSI ready-to-review

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants