Skip to content

Conversation

@un-def
Copy link
Collaborator

@un-def un-def commented Mar 10, 2025

Part-of: #2368

@un-def un-def requested a review from r4victor March 10, 2025 15:32
@un-def un-def merged commit 4341282 into master Mar 11, 2025
24 checks passed
@un-def un-def deleted the issue_2368_multi_node_ssh_connectivity branch March 11, 2025 10:04
done
# Run NCCL Tests
${MPIRUN} \
-n $((DSTACK_NODES_NUM * DSTACK_GPUS_PER_NODE)) -N ${DSTACK_GPUS_PER_NODE} \
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@un-def Shouldn't we use DSTACK_GPUS_NUM instead of (DSTACK_NODES_NUM * DSTACK_GPUS_PER_NODE)?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sure, I'll update the docs in a separate PR. Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants