You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
(ParaWorker pid=1955026) Error: Peer-to-peer access is unsupported on this platform.
(ParaWorker pid=1955026) In the current version of distserve, it is necessary to use a platform that supports GPU P2P access.
(ParaWorker pid=1955026) Exiting...
I face a problem like this, but I actually checked the P2P connection between the two GPUs, and I tried the following codes for testing the P2P connection between GPUs:
tensor_a = torch.randn(10, device="cuda:0")
try:
# Attempt to directly copy tensor_a from GPU 0 to GPU 1
tensor_b = tensor_a.to("cuda:1")
print("Successfully copied tensor from GPU 0 to GPU 1 using P2P.")
except RuntimeError as e:
print("Failed to copy tensor from GPU 0 to GPU 1 using P2P. Error:", e)
and the output is:
Successfully copied tensor from GPU 0 to GPU 1 using P2P.
and the GPU topo is:
can you provide any suggestions?
thank you!
The text was updated successfully, but these errors were encountered:
Hi, the problem is:
(ParaWorker pid=1955026) Error: Peer-to-peer access is unsupported on this platform.
(ParaWorker pid=1955026) In the current version of distserve, it is necessary to use a platform that supports GPU P2P access.
(ParaWorker pid=1955026) Exiting...
I face a problem like this, but I actually checked the P2P connection between the two GPUs, and I tried the following codes for testing the P2P connection between GPUs:
and the output is:
Successfully copied tensor from GPU 0 to GPU 1 using P2P.
and the GPU topo is:
can you provide any suggestions?
thank you!
The text was updated successfully, but these errors were encountered: