Skip to content

Conversation

mrshenli
Copy link
Contributor

@mrshenli mrshenli commented Aug 29, 2022

@facebook-github-bot
Copy link
Contributor

facebook-github-bot commented Aug 29, 2022

🔗 Helpful links

✅ No Failures (0 Pending)

As of commit 0c57d84 (more details on the Dr. CI page):

Expand to see more

💚 💚 Looks good so far! There are no failures yet. 💚 💚


This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

@mrshenli
Copy link
Contributor Author

The error message I got is:

[108/6838] Performing build step for 'nccl_external'                                                                                                              
FAILED: nccl_external-prefix/src/nccl_external-stamp/nccl_external-build nccl/lib/libnccl_static.a /fsx/users/shenli/pytorch/build/nccl_external-prefix/src/nccl_e
xternal-stamp/nccl_external-build /fsx/users/shenli/pytorch/build/nccl/lib/libnccl_static.a                                                                       
cd /fsx/users/shenli/pytorch/third_party/nccl/nccl && env CCACHE_DISABLE=1 SCCACHE_DISABLE=1 make -j0 -l0 CXX=/data/shared/bin/c++ CUDA_HOME=/usr/local/cuda-11.1
NVCC=/data/shared/bin/cuda/nvcc NVCC_GENCODE=-gencode=arch=compute_80,code=sm_80 BUILDDIR=/fsx/users/shenli/pytorch/build/nccl VERBOSE=0 && /fsx/users/shenli/cond
a/envs/compiler/bin/cmake -E touch /fsx/users/shenli/pytorch/build/nccl_external-prefix/src/nccl_external-stamp/nccl_external-build
make: the '-j' option requires a positive integer argument

@malfet malfet added the ciflow/binaries_wheel Trigger binary build and upload jobs for wheel on the PR label Aug 29, 2022
Copy link
Contributor

@malfet malfet left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM if CI is green (why env is needed though)?

@@ -38,6 +42,7 @@ if(NOT __NCCL_INCLUDED)
BUILD_IN_SOURCE 1
CONFIGURE_COMMAND ""
BUILD_COMMAND
env
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why this is needed?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It was there before, but removed by #83696. I thought we need to pass this to pass env vars to make if necessary?

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No env vars are being passed though, it was left over from the CCACHE_DISABLE=1 that was there before.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Oh I see, let me remove that then.

@@ -38,6 +42,7 @@ if(NOT __NCCL_INCLUDED)
BUILD_IN_SOURCE 1
CONFIGURE_COMMAND ""
BUILD_COMMAND
env
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

is this change intentional?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yep, it was there before, but removed by #83696. I thought we need to pass this to pass env vars to make if necessary?

mrshenli added a commit that referenced this pull request Aug 30, 2022
@mrshenli
Copy link
Contributor Author

@pytorchbot merge

@pytorchmergebot
Copy link
Collaborator

@pytorchbot successfully started a merge job. Check the current status here and land check progress here.
The merge job was triggered with the land checks (-l) flag. If you did not specify this flag yourself, you are likely enrolled in the land checks rollout. This means that your change will be merged once all checks on your PR and the land checks have passed (ETA 4 Hours). If you need to coordinate lands between different changes and cannot risk a land race, please add the ciflow/trunk label to your PR and wait for signal to complete, and then land your changes in proper order. Having trunk, pull, and Lint pre-run on a PR will bypass land checks and the ETA should be immediate. If this is not the intended behavior, feel free to use some of the other merge options in the wiki.
Please reach out to the PyTorch DevX Team with feedback or questions!

@pytorchmergebot
Copy link
Collaborator

Merge failed

Reason: 2 additional jobs have failed, first few of them are: trunk ,trunk / macos-12-py3-arm64 / test (default, 1, 2, macos-m1-12)

If you believe this is an error, you can use the old behavior with @pytorchbot merge -g (optionally with the ciflow/trunk to get land checks) or use @pytorchbot merge -f "some reason here". For more information, see the bot wiki.

Please reach out to the PyTorch DevX Team with feedback or questions!

Details for Dev Infra team Raised by workflow job

@mrshenli
Copy link
Contributor Author

mrshenli commented Aug 30, 2022

@pytorchbot merge -g

1 similar comment
@mrshenli
Copy link
Contributor Author

@pytorchbot merge -g

@pytorchmergebot
Copy link
Collaborator

@pytorchbot successfully started a merge job. Check the current status here.
The merge job was triggered with the green (-g) flag. This means that your change will be merged once all checks on your PR have passed (ETA: 0-4 Hours). If this is not the intended behavior, feel free to use some of the other merge options in the wiki.
Please reach out to the PyTorch DevX Team with feedback or questions!

@github-actions
Copy link
Contributor

Hey @mrshenli.
You've committed this PR, but it does not have both a 'release notes: ...' and 'topics: ...' label. Please add one of each to the PR. The 'release notes: ...' label should represent the part of PyTorch that this PR changes (fx, autograd, distributed, etc) and the 'topics: ...' label should represent the kind of PR it is (not user facing, new feature, bug fix, perf improvement, etc). The list of valid labels can be found here for the 'release notes: ...' and here for the 'topics: ...'.
For changes that are 'topic: not user facing' there is no need for a release notes label.

facebook-github-bot pushed a commit that referenced this pull request Aug 31, 2022
…84231) (#84231)

Summary:
Pull Request resolved: #84231
Approved by: https://github.com/malfet, https://github.com/rohan-varma

Test Plan: contbuild & OSS CI, see https://hud.pytorch.org/commit/pytorch/pytorch/56a37ea1a6e89a8aa31abc888127ccac647b92d4

Reviewed By: mehtanirav

Differential Revision: D39142407

Pulled By: mrshenli

fbshipit-source-id: 2ee44484605ca3ef18fcbca37829c4727988dab7
@facebook-github-bot facebook-github-bot deleted the gh/mrshenli/339/head branch September 3, 2022 14:20
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ciflow/binaries_wheel Trigger binary build and upload jobs for wheel on the PR cla signed Merged
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants