Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

{tools}[foss/2022a] Horovod v0.28.1 w/ PyTorch #18266

Merged

Conversation

Flamefire
Copy link
Contributor

@Flamefire Flamefire commented Jul 5, 2023

(created using eb --new-pr)

Add Horovod for all PyTorch versions where we don't have one yet.

…1.0.eb, Horovod-0.28.1-foss-2021a-CUDA-11.3.1-PyTorch-1.12.1.eb, Horovod-0.28.1-foss-2021b-CUDA-11.5.2-PyTorch-1.12.1.eb, Horovod-0.28.1-foss-2022a-CUDA-11.7.0-PyTorch-1.12.0.eb, Horovod-0.28.1-foss-2022a-CUDA-11.7.0-PyTorch-1.12.1.eb, Horovod-0.28.1-foss-2022a-CUDA-11.7.0-PyTorch-1.13.1.eb, Horovod-0.28.1-fosscuda-2020b-PyTorch-1.9.0.eb
@Flamefire Flamefire changed the title {tools}[foss/2022a] Horovod v0.28.1 w/ CUDA 11.3.1 PyTorch 1.11.0, CUDA 11.3.1 PyTorch 1.12.1, ... {tools}[foss/2022a] Horovod v0.28.1 w/ PyTorch 1.12.1 Jul 5, 2023
@Flamefire Flamefire changed the title {tools}[foss/2022a] Horovod v0.28.1 w/ PyTorch 1.12.1 {tools}[foss/2022a] Horovod v0.28.1 w/ PyTorch Jul 5, 2023
@Flamefire
Copy link
Contributor Author

Test report by @Flamefire
SUCCESS
Build succeeded for 6 out of 6 (6 easyconfigs in total)
taurusi8015 - Linux CentOS Linux 7.9.2009, x86_64, AMD EPYC 7352 24-Core Processor (zen2), Python 2.7.5
See https://gist.github.com/Flamefire/7d58b0d4a3d86dface39c83bf73e5a7d for a full test report.

@Flamefire
Copy link
Contributor Author

Test report by @Flamefire
SUCCESS
Build succeeded for 6 out of 6 (6 easyconfigs in total)
taurusml4 - Linux RHEL 7.6, POWER, 8335-GTX (power9le), 6 x NVIDIA Tesla V100-SXM2-32GB, 440.64.00, Python 2.7.5
See https://gist.github.com/Flamefire/0ded4e313b97e46e416530edb6eba63b for a full test report.

@branfosj branfosj added this to the next release (4.8.1?) milestone Jul 8, 2023
@branfosj branfosj added the update label Jul 8, 2023
@branfosj
Copy link
Member

branfosj commented Jul 8, 2023

Test report by @branfosj
SUCCESS
Build succeeded for 6 out of 6 (6 easyconfigs in total)
bear-pg0203u29a.bear.cluster - Linux RHEL 8.6, x86_64, Intel(R) Xeon(R) Platinum 8360Y CPU @ 2.40GHz (icelake), 1 x NVIDIA NVIDIA A100-SXM4-80GB, 520.61.05, Python 3.6.8
See https://gist.github.com/branfosj/b887c015def9eda4cb29ea1003a26392 for a full test report.

@branfosj
Copy link
Member

branfosj commented Jul 8, 2023

Going in, thanks @Flamefire!

@branfosj branfosj merged commit 1114e89 into easybuilders:develop Jul 8, 2023
5 checks passed
@Flamefire Flamefire deleted the 20230705133129_new_pr_Horovod0281 branch July 17, 2023 07:52
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants