Skip to content

Conversation

bedroge
Copy link
Contributor

@bedroge bedroge commented Sep 26, 2025

With #80 being merged, the bot almost got to the point where it would actually install software, but it failed on this line: https://github.com/EESSI/software-layer-scripts/blob/main/EESSI-install-software.sh#L229.

It turned out that $EPREFIX was set to /cvmfs/dev.eessi.io/riscv/versions/2025.06/compat/linux/riscv64, resulting in:

ERROR: Not running in Gentoo Prefix environment, run '/cvmfs/dev.eessi.io/riscv/versions/2025.06/compat/linux/riscv64/startprefix' first!

This PR fixes it by introducing an additional variable for the compat layer prefix, which can then be overridden for the different RISC-V versions (20240402 has its own compat layer, 2025.06 won't have that).

@bedroge
Copy link
Contributor Author

bedroge commented Sep 26, 2025

Tested in a container on the RISC-V cluster:

Singularity> source /cvmfs/software.eessi.io/versions/2025.06/init/lmod/bash 
Singularity> ml use /cvmfs/software.eessi.io/init/modules

Singularity> ml EESSI/2023.06
RISC-V architecture detected, but there is no RISC-V support yet in the production repository.
Automatically switching to version 20240402 of the RISC-V development repository /cvmfs/riscv.eessi.io.
For more details about this repository, see https://www.eessi.io/docs/repositories/riscv.eessi.io/. 
Module for EESSI/20240402 loaded successfully
Singularity> echo $EPREFIX
/cvmfs/riscv.eessi.io/versions/20240402/compat/linux/riscv64

Singularity> ml purge
Singularity> ml EESSI/2025.06
This EESSI production version only provides a RISC-V compatibility layer,
software installations are provided by the EESSI development repository at /cvmfs/dev.eessi.io/riscv.

Module for EESSI/2025.06 loaded successfully
Singularity> echo $EPREFIX
/cvmfs/software.eessi.io/versions/2025.06/compat/linux/riscv64

@bedroge
Copy link
Contributor Author

bedroge commented Sep 26, 2025

And for an x86_64 client, by using the dir of this branch:

$ ml use init/modules
$ ml EESSI/2023.06 
Module for EESSI/2023.06 loaded successfully
$ echo $EPREFIX
/cvmfs/software.eessi.io/versions/2023.06/compat/linux/x86_64
$ ml purge
$ ml EESSI/2025.06
Module for EESSI/2025.06 loaded successfully
$ echo $EPREFIX
/cvmfs/software.eessi.io/versions/2025.06/compat/linux/x86_64

@bedroge bedroge requested a review from ocaisa September 26, 2025 14:34
Copy link
Member

@ocaisa ocaisa left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@ocaisa
Copy link
Member

ocaisa commented Sep 26, 2025

bot: build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws for:arch=x86_64/amd/zen2
bot: build repo:eessi.io-2025.06-software instance:eessi-bot-mc-aws for:arch=x86_64/amd/zen2

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Sep 26, 2025

New job on instance eessi-bot-mc-aws for repository eessi.io-2023.06-software
Building on: amd-zen2
Building for: x86_64/amd/zen2
Job dir: /project/def-users/SHARED/jobs/2025.09/pr_92/92597

date job status comment
Sep 26 14:48:43 UTC 2025 submitted job id 92597 awaits release by job manager
Sep 26 14:49:45 UTC 2025 released job awaits launch by Slurm scheduler
Sep 26 14:54:50 UTC 2025 running job 92597 is running
Sep 26 14:57:55 UTC 2025 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-92597.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-amd-zen2-17588985100.tar.gzsize: 0 MiB (3548 bytes)
entries: 1
modules under 2023.06/software/linux/x86_64/amd/zen2/modules/all
no module files in tarball
software under 2023.06/software/linux/x86_64/amd/zen2/software
no software packages in tarball
reprod directories under 2023.06/software/linux/x86_64/amd/zen2/reprod
no reprod directories in tarball
other under 2023.06/software/linux/x86_64/amd/zen2
2023.06/init/modules/EESSI/2023.06.lua
Sep 26 14:57:55 UTC 2025 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ OK ] ( 1/10) EESSI_LAMMPS_lj %device_type=cpu %module_name=LAMMPS/29Aug2024-foss-2023b-kokkos %scale=1_node /aeb2d9df @BotBuildTests:x86_64_amd_zen2+default
P: perf: 415.257 timesteps/s (r:0, l:None, u:None)
[ OK ] ( 2/10) EESSI_LAMMPS_lj %device_type=cpu %module_name=LAMMPS/2Aug2023_update2-foss-2023a-kokkos %scale=1_node /04ff9ece @BotBuildTests:x86_64_amd_zen2+default
P: perf: 420.559 timesteps/s (r:0, l:None, u:None)
[ OK ] ( 3/10) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_allreduce %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node %device_type=cpu /775175bf @BotBuildTests:x86_64_amd_zen2+default
P: latency: 1.91 us (r:0, l:None, u:None)
[ OK ] ( 4/10) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_allreduce %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node %device_type=cpu /52707c40 @BotBuildTests:x86_64_amd_zen2+default
P: latency: 1.98 us (r:0, l:None, u:None)
[ OK ] ( 5/10) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_alltoall %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node %device_type=cpu /b1aacda9 @BotBuildTests:x86_64_amd_zen2+default
P: latency: 3.93 us (r:0, l:None, u:None)
[ OK ] ( 6/10) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_alltoall %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node %device_type=cpu /c6bad193 @BotBuildTests:x86_64_amd_zen2+default
P: latency: 4.22 us (r:0, l:None, u:None)
[ OK ] ( 7/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_latency %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node /15cad6c4 @BotBuildTests:x86_64_amd_zen2+default
P: latency: 0.58 us (r:0, l:None, u:None)
[ OK ] ( 8/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_latency %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node /6672deda @BotBuildTests:x86_64_amd_zen2+default
P: latency: 0.55 us (r:0, l:None, u:None)
[ OK ] ( 9/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_bw %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node /2a9a47b1 @BotBuildTests:x86_64_amd_zen2+default
P: bandwidth: 7187.8 MB/s (r:0, l:None, u:None)
[ OK ] (10/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_bw %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node /1b24ab8e @BotBuildTests:x86_64_amd_zen2+default
P: bandwidth: 7160.69 MB/s (r:0, l:None, u:None)
[ PASSED ] Ran 10/10 test case(s) from 10 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-92597.out
✅ no message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case
Sep 26 14:58:53 UTC 2025 uploaded transfer of eessi-2023.06-software-linux-x86_64-amd-zen2-17588985100.tar.gz to S3 bucket succeeded

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Sep 26, 2025

New job on instance eessi-bot-mc-aws for repository eessi.io-2025.06-software
Building on: amd-zen2
Building for: x86_64/amd/zen2
Job dir: /project/def-users/SHARED/jobs/2025.09/pr_92/92598

date job status comment
Sep 26 14:48:47 UTC 2025 submitted job id 92598 awaits release by job manager
Sep 26 14:49:43 UTC 2025 released job awaits launch by Slurm scheduler
Sep 26 14:54:48 UTC 2025 running job 92598 is running
Sep 26 14:55:52 UTC 2025 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-92598.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2025.06-software-linux-x86_64-amd-zen2-17588984820.tar.gzsize: 0 MiB (3551 bytes)
entries: 1
modules under 2025.06/software/linux/x86_64/amd/zen2/modules/all
no module files in tarball
software under 2025.06/software/linux/x86_64/amd/zen2/software
no software packages in tarball
reprod directories under 2025.06/software/linux/x86_64/amd/zen2/reprod
no reprod directories in tarball
other under 2025.06/software/linux/x86_64/amd/zen2
2025.06/init/modules/EESSI/2025.06.lua
Sep 26 14:55:52 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-92598.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case
Sep 26 14:59:02 UTC 2025 uploaded transfer of eessi-2025.06-software-linux-x86_64-amd-zen2-17588984820.tar.gz to S3 bucket succeeded

@ocaisa
Copy link
Member

ocaisa commented Sep 26, 2025

Deployment PR merged

@ocaisa ocaisa merged commit 89e3b81 into EESSI:main Sep 26, 2025
64 checks passed
@bedroge bedroge deleted the riscv_eprefix_fix branch September 26, 2025 15:32
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants