Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

{tools}[foss/2021b,foss/2022a] Horovod w/ TensorFlow #18265

Merged

Conversation

Flamefire
Copy link
Contributor

@Flamefire Flamefire commented Jul 5, 2023

(created using eb --new-pr)

Add Horovod for all TensorFlow versions where we don't have one yet.

…2.5.3.eb, Horovod-0.28.1-foss-2021b-CUDA-11.4.1-TensorFlow-2.7.1.eb, Horovod-0.28.1-foss-2021b-CUDA-11.4.1-TensorFlow-2.8.4.eb, Horovod-0.28.1-foss-2022a-CUDA-11.7.0-TensorFlow-2.11.0.eb, Horovod-0.28.1-foss-2022a-CUDA-11.7.0-TensorFlow-2.9.1.eb and patches: Horovod-0.28.1_support_flatbuffers_2.0.6.patch
@Flamefire Flamefire changed the title {tools}[foss/2021b,foss/2022a] Horovod v0.22.1, Horovod v0.28.1 w/ CUDA 11.3.1 TensorFlow 2.5.3, CUDA 11.4.1 TensorFlow 2.7.1, ... {tools}[foss/2021b,foss/2022a] Horovod w/ TensorFlow Jul 5, 2023
@Flamefire
Copy link
Contributor Author

Test report by @Flamefire
SUCCESS
Build succeeded for 5 out of 5 (5 easyconfigs in total)
taurusi8015 - Linux CentOS Linux 7.9.2009, x86_64, AMD EPYC 7352 24-Core Processor (zen2), Python 2.7.5
See https://gist.github.com/Flamefire/7110e144feae9430212973827e80c933 for a full test report.

@Flamefire
Copy link
Contributor Author

Test report by @Flamefire
SUCCESS
Build succeeded for 5 out of 5 (5 easyconfigs in total)
taurusml4 - Linux RHEL 7.6, POWER, 8335-GTX (power9le), 6 x NVIDIA Tesla V100-SXM2-32GB, 440.64.00, Python 2.7.5
See https://gist.github.com/Flamefire/26281489c4554103f47b2718d80632cf for a full test report.

@branfosj
Copy link
Member

branfosj commented Jul 8, 2023

Test report by @branfosj
SUCCESS
Build succeeded for 5 out of 5 (5 easyconfigs in total)
bear-pg0203u29a.bear.cluster - Linux RHEL 8.6, x86_64, Intel(R) Xeon(R) Platinum 8360Y CPU @ 2.40GHz (icelake), 1 x NVIDIA NVIDIA A100-SXM4-80GB, 520.61.05, Python 3.6.8
See https://gist.github.com/branfosj/1a59d216a6c6c3677cdbfd0ea731adb0 for a full test report.

@branfosj branfosj added the update label Jul 8, 2023
@branfosj branfosj added this to the next release (4.8.1?) milestone Jul 8, 2023
@branfosj
Copy link
Member

branfosj commented Jul 8, 2023

Going in, thanks @Flamefire!

@branfosj branfosj merged commit 8070d5c into easybuilders:develop Jul 8, 2023
5 checks passed
@Flamefire Flamefire deleted the 20230705133024_new_pr_Horovod0221 branch July 17, 2023 07:52
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants