-
Notifications
You must be signed in to change notification settings - Fork 406
Issues: openucx/ucx
Error: Transport retry count exceeded on mlx5_0:1/RoCE
#6000
by afernandezody
was closed Feb 1, 2021
Closed
7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Performance regression in collectives due to UCX_PROTO_ENABLE
Bug
#9914
opened May 30, 2024 by
angainor
cuda_copy_md.c:489 UCX WARN cuPointerSetAttribute error with CUDA VMM API
Bug
#9895
opened May 23, 2024 by
MinassZhang
DBFS library installations are not supported on DBR 15 or above.
Bug
#9777
opened Mar 25, 2024 by
rkkalluri
UCS/ARCH/BITOPS: gcc 12.3.0 fails to build x86_64 ucs_ffs32
Bug
#9774
opened Mar 21, 2024 by
tvegas1
osu_mbw_mr for CUDA memory shows bad performance with UCX_PROTO_ENABLE=y
Bug
#9690
opened Feb 15, 2024 by
dmitrygx
Question: does ucx support FPGA to AMDGPU (ROCm ) p2p transfer?
#9598
opened Jan 13, 2024 by
littlewu2508
GPU Aware openMPI 5.0.1 + ROCM gives UCX ERROR : failed to register address
Bug
#9589
opened Jan 10, 2024 by
denisbertini
Selection of Network Ressources and creating worker/endpoint pair
Bug
#9586
opened Jan 9, 2024 by
98luks
question about fine-grained transport selection for multi-node env
#9560
opened Dec 24, 2023 by
qelk123
Application works with
UCX_LOG_LEVEL=info
(or more verbose levels), but hangs otherwise
Bug
#9532
opened Dec 5, 2023 by
bedroge
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.