Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

{lib}[GCC 11.2.0-13.2.0] UCX 1.16.0-rc4 #20237

Merged
merged 3 commits into from Mar 29, 2024
Merged

Conversation

hajgato
Copy link
Collaborator

@hajgato hajgato commented Mar 28, 2024

Recent intel MPI might need ucx >=1.16.0 if MLNX/NVIDIA OFED >= 23.10 is installed.
(We experienced infinite hangs with FDS/6.8.0-intel-2022b, and swapping to UCX-1.16.0-rc4 solved the problem. UCX < 1.16 did not solve the problem) Note that only AMD CPUs are affected, we did not get the same problem with Intel CPUs.
With our previous OFED 23.04 version, we did not have the problem.

@zao
Copy link
Contributor

zao commented Mar 28, 2024

Test report by @zao
FAILED
Build succeeded for 0 out of 5 (5 easyconfigs in total)
eb-mix.zao.se - Linux Ubuntu 24.04 (Noble Numbat), x86_64, AMD Ryzen 9 3900X 12-Core Processor (zen2), Python 3.12.2
See https://gist.github.com/zao/44968f5eef996dfb2aed690f9246f4ed for a full test report.

@zao
Copy link
Contributor

zao commented Mar 28, 2024

/bin/bash: line 1: ./autogen.sh: No such file or (took 1 secs)

@boegel boegel added the update label Mar 28, 2024
@boegel boegel added this to the 4.x milestone Mar 28, 2024
@boegel
Copy link
Member

boegel commented Mar 28, 2024

@boegelbot please test @ generoso

@boegelbot
Copy link
Collaborator

@boegel: Request for testing this PR well received on login1

PR test command 'EB_PR=20237 EB_ARGS= EB_CONTAINER= EB_REPO=easybuild-easyconfigs /opt/software/slurm/bin/sbatch --job-name test_PR_20237 --ntasks=4 ~/boegelbot/eb_from_pr_upload_generoso.sh' executed!

  • exit code: 0
  • output:
Submitted batch job 13217

Test results coming soon (I hope)...

- notification for comment with ID 2026148011 processed

Message to humans: this is just bookkeeping information for me,
it is of no use to you (unless you think I have a bug, which I don't).

@boegelbot
Copy link
Collaborator

Test report by @boegelbot
SUCCESS
Build succeeded for 5 out of 5 (5 easyconfigs in total)
cns1 - Linux Rocky Linux 8.9, x86_64, Intel(R) Xeon(R) CPU E5-2667 v3 @ 3.20GHz (haswell), Python 3.6.8
See https://gist.github.com/boegelbot/8c4e1f11fb3a65613f2a9ce3a0661709 for a full test report.

@zao
Copy link
Contributor

zao commented Mar 29, 2024

Test report by @zao
SUCCESS
Build succeeded for 4 out of 4 (4 easyconfigs in total)
eb-mix.zao.se - Linux Ubuntu 24.04 (Noble Numbat), x86_64, AMD Ryzen 9 3900X 12-Core Processor (zen2), Python 3.12.2
See https://gist.github.com/zao/3aceb007dfdd1a4210a1063a48ea5948 for a full test report.

@boegel
Copy link
Member

boegel commented Mar 29, 2024

Test report by @boegel
SUCCESS
Build succeeded for 7 out of 7 (5 easyconfigs in total)
node3125.skitty.os - Linux RHEL 8.8, x86_64, Intel(R) Xeon(R) Gold 6140 CPU @ 2.30GHz, Python 3.6.8
See https://gist.github.com/boegel/699081e4ae8b742a8a2055a599d29021 for a full test report.

@boegel
Copy link
Member

boegel commented Mar 29, 2024

Going in, thanks @hajgato!

@boegel boegel merged commit ac41cac into easybuilders:develop Mar 29, 2024
9 checks passed
@boegel boegel modified the milestones: 4.x, release after 4.9.0 Mar 29, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

4 participants