Skip to content

Conversation

bowangbj
Copy link
Contributor

@bowangbj bowangbj commented Aug 22, 2021

Stack from ghstack:

Summary:

Test Plan:

$ python test/distributed/_sharded_tensor/test_sharded_tensor.py --v

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: D30472621

cc @pietern @mrshenli @pritamdamania87 @zhaojuanmao @satgera @rohan-varma @gqchen @aazzolini @osalpekar @jiayisuse @agolynski @SciPioneer @H-Huang @mrzzd @cbalioglu @gcramer23

….full, sharded_tensor.rand

Summary:

Test Plan:

$ python test/distributed/_sharded_tensor/test_sharded_tensor.py TestCreateTensorFromParams --v
$ python test/distributed/_sharded_tensor/test_sharded_tensor.py TestShardedTensorChunked --v

Reviewers:

Subscribers:

Tasks:

Tags:

[ghstack-poisoned]
@facebook-github-bot
Copy link
Contributor

facebook-github-bot commented Aug 22, 2021

🔗 Helpful links

💊 CI failures summary and remediations

As of commit 8d5b8a3 (more details on the Dr. CI page):


  • 1/1 failures introduced in this PR

🕵️ 1 new failure recognized by patterns

The following CI failures do not appear to be due to upstream breakages:

See CircleCI build pytorch_linux_xenial_py3_6_gcc5_4_build (1/1)

Step: "Build" (full log | diagnosis details | 🔁 rerun)

Aug 26 19:21:32 CMake Error at CMakeLists.txt:1 (cmake_minimum_required):
Aug 26 19:21:31 invalid command 'bad_argument'
Aug 26 19:21:31 + retcode=1
Aug 26 19:21:31 + set -e
Aug 26 19:21:31 + return 1
Aug 26 19:21:31 + [[ pytorch-linux-xenial-py3.6-gcc5.4-build != *libtorch* ]]
Aug 26 19:21:31 + [[ pytorch-linux-xenial-py3.6-gcc5.4-build != *ppc64le* ]]
Aug 26 19:21:31 + [[ pytorch-linux-xenial-py3.6-gcc5.4-build != *clang* ]]
Aug 26 19:21:31 + WERROR=1
Aug 26 19:21:31 + python setup.py bdist_wheel
Aug 26 19:21:32 Building wheel torch-1.10.0a0+git8b2108a
Aug 26 19:21:32 CMake Error at CMakeLists.txt:1 (cmake_minimum_required):
Aug 26 19:21:32   CMake 3.10 or higher is required.  You are running version 3.5.1
Aug 26 19:21:32 
Aug 26 19:21:32 
Aug 26 19:21:32 -- Configuring incomplete, errors occurred!
Aug 26 19:21:32 -- Building version 1.10.0a0+git8b2108a
Aug 26 19:21:32 cmake -DBUILD_ENVIRONMENT=pytorch-linux-xenial-py3.6-gcc5.4-build -DBUILD_PYTHON=True -DBUILD_STATIC_RUNTIME_BENCHMARK=ON -DBUILD_TEST=True -DCMAKE_BUILD_TYPE=Release -DCMAKE_INSTALL_PREFIX=/var/lib/jenkins/workspace/torch -DCMAKE_PREFIX_PATH=/opt/conda/lib/python3.6/site-packages;/opt/conda -DNUMPY_INCLUDE_DIR=/opt/conda/lib/python3.6/site-packages/numpy/core/include -DOPENSSL_ROOT_DIR=/opt/openssl -DPYTHON_EXECUTABLE=/opt/conda/bin/python -DPYTHON_INCLUDE_DIR=/opt/conda/include/python3.6m -DPYTHON_LIBRARY=/opt/conda/lib/libpython3.6m.so.1.0 -DTORCH_BUILD_VERSION=1.10.0a0+git8b2108a -DUSE_GLOO_WITH_OPENSSL=ON -DUSE_LLVM=/opt/llvm -DUSE_NUMPY=True -DWERROR=1 /var/lib/jenkins/workspace
Aug 26 19:21:32 + cleanup
Aug 26 19:21:32 + retcode=1
Aug 26 19:21:32 + set +x
Aug 26 19:21:32 =================== sccache compilation log ===================

This comment was automatically generated by Dr. CI (expand for details).Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

@bowangbj
Copy link
Contributor Author

@bowangbj has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

…ed_tensor.full, sharded_tensor.rand"

Summary:

Test Plan:

$ python test/distributed/_sharded_tensor/test_sharded_tensor.py  --v

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: [D30472621](https://our.internmc.facebook.com/intern/diff/D30472621)

[ghstack-poisoned]
bowangbj added a commit that referenced this pull request Aug 22, 2021
….full, sharded_tensor.rand

Summary:

Test Plan:

$ python test/distributed/_sharded_tensor/test_sharded_tensor.py  --v

Reviewers:

Subscribers:

Tasks:

Tags:

ghstack-source-id: 0ec4f29
Pull Request resolved: #63732
@bowangbj
Copy link
Contributor Author

@bowangbj has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

…ensor.zeros, sharded_tensor.full, sharded_tensor.rand"

Summary:

Test Plan:

$ python test/distributed/_sharded_tensor/test_sharded_tensor.py  --v

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: [D30472621](https://our.internmc.facebook.com/intern/diff/D30472621)

[ghstack-poisoned]
@bowangbj
Copy link
Contributor Author

@bowangbj has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

@bowangbj
Copy link
Contributor Author

Thanks Pritam for a great, detailed and fast review. Rebased and fixed all your cmts. PTAL

Copy link
Contributor

@pritamdamania87 pritamdamania87 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good, just had a few more comments and it looks like there are some lint failures.

…ensor.zeros, sharded_tensor.full, sharded_tensor.rand"

Summary:

Test Plan:

$ python test/distributed/_sharded_tensor/test_sharded_tensor.py  --v

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: [D30472621](https://our.internmc.facebook.com/intern/diff/D30472621)

[ghstack-poisoned]
@bowangbj
Copy link
Contributor Author

@bowangbj has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

@bowangbj
Copy link
Contributor Author

Looks good, just had a few more comments and it looks like there are some lint failures.

Thanks Pritam. Fixed all the cmts, PTAL

…ed_tensor.full, sharded_tensor.rand"

Summary:

Test Plan:

$ python test/distributed/_sharded_tensor/test_sharded_tensor.py  --v

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: [D30472621](https://our.internmc.facebook.com/intern/diff/D30472621)

[ghstack-poisoned]
bowangbj added a commit that referenced this pull request Aug 25, 2021
….full, sharded_tensor.rand

Summary:

Test Plan:

$ python test/distributed/_sharded_tensor/test_sharded_tensor.py  --v

Reviewers:

Subscribers:

Tasks:

Tags:

ghstack-source-id: 41b7008
Pull Request resolved: #63732
@bowangbj
Copy link
Contributor Author

@bowangbj has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

@bowangbj
Copy link
Contributor Author

The only tricky lint error is

./torch/distributed/_sharded_tensor/init.py:1:1: C101 Coding magic comment not found
Please fix the above Flake8 warnings.
Error: Process completed with exit code 1.

I run $flake8 init.py locally which produced no error.

@pritamdamania87
Copy link
Contributor

The only tricky lint error is

./torch/distributed/_sharded_tensor/init.py:1:1: C101 Coding magic comment not found
Please fix the above Flake8 warnings.
Error: Process completed with exit code 1.

I run $flake8 init.py locally which produced no error.

I think https://pypi.org/project/flake8-coding/ outlines how you need to setup flake8 to repro this error. The page has links in terms of how to resolve the issue as well.

…sharded_tensor.full, sharded_tensor.rand"

Summary:

Test Plan:

$ python test/distributed/_sharded_tensor/test_sharded_tensor.py  --v

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: [D30472621](https://our.internmc.facebook.com/intern/diff/D30472621)

[ghstack-poisoned]
…sharded_tensor.full, sharded_tensor.rand"

Summary:

Test Plan:

$ python test/distributed/_sharded_tensor/test_sharded_tensor.py  --v

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: [D30472621](https://our.internmc.facebook.com/intern/diff/D30472621)

[ghstack-poisoned]
bowangbj added a commit that referenced this pull request Aug 25, 2021
….full, sharded_tensor.rand

Summary:

Test Plan:

$ python test/distributed/_sharded_tensor/test_sharded_tensor.py  --v

Reviewers:

Subscribers:

Tasks:

Tags:

ghstack-source-id: 33fd0d6
Pull Request resolved: #63732
@bowangbj
Copy link
Contributor Author

The only tricky lint error is

./torch/distributed/_sharded_tensor/init.py:1:1: C101 Coding magic comment not found
Please fix the above Flake8 warnings.
Error: Process completed with exit code 1.

I run $flake8 init.py locally which produced no error.

I think https://pypi.org/project/flake8-coding/ outlines how you need to setup flake8 to repro this error. The page has links in terms of how to resolve the issue as well.

OK, added the magic comment (encoding) explicitly to address flake8 complaint. PTAL

@bowangbj
Copy link
Contributor Author

@bowangbj has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

…ops: harded_tensor.zeros, sharded_tensor.full, sharded_tensor.rand"

Summary:

Test Plan:

$ python test/distributed/_sharded_tensor/test_sharded_tensor.py  --v

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: [D30472621](https://our.internmc.facebook.com/intern/diff/D30472621)

[ghstack-poisoned]
bowangbj added a commit that referenced this pull request Aug 26, 2021
….full, sharded_tensor.rand

Summary:

Test Plan:

$ python test/distributed/_sharded_tensor/test_sharded_tensor.py  --v

Reviewers:

Subscribers:

Tasks:

Tags:

ghstack-source-id: 44f6851
Pull Request resolved: #63732
@bowangbj
Copy link
Contributor Author

@bowangbj has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

@bowangbj
Copy link
Contributor Author

Thanks Pritam for the great review, added the tests, landing ...

@facebook-github-bot
Copy link
Contributor

@bowangbj merged this pull request in 49353e3.

@facebook-github-bot facebook-github-bot deleted the gh/bowangb/2/head branch August 30, 2021 14:17
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

cla signed Merged oncall: distributed Add this issue/PR to distributed oncall triage queue

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants