Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve new_group example in the context of SyncBatchNorm #48897

Closed
wants to merge 3 commits into from

Conversation

rohan-varma
Copy link
Member

@rohan-varma rohan-varma commented Dec 6, 2020

Closes #48804
Improves some documentation/example in SyncBN docs to clearly show that each rank must call into all new_group() calls for creating process subgroups, even if they are not going to be part of that particular subgroup.
We then pick the right group, i.e. the group that the rank is part of, and pass that into the SyncBN APIs.

Doc rendering:

syncbn_update

@dr-ci
Copy link

dr-ci bot commented Dec 6, 2020

💊 CI failures summary and remediations

As of commit ff8d56c (more details on the Dr. CI page):


  • 7/7 failures possibly* introduced in this PR
    • 2/7 non-CircleCI failure(s)

🕵️ 5 new failures recognized by patterns

The following CI failures do not appear to be due to upstream breakages:

See CircleCI build pytorch_linux_xenial_cuda9_2_cudnn7_py3_gcc5_4_build (1/5)

Step: "(Optional) Merge target branch" (full log | diagnosis details | 🔁 rerun)

Automatic merge failed; fix conflicts and then commit the result.
CONFLICT (add/add): Merge conflict in .jenkins/pytorch/codegen-test.sh
Auto-merging .jenkins/pytorch/codegen-test.sh
CONFLICT (add/add): Merge conflict in .jenkins/pytorch/build.sh
Auto-merging .jenkins/pytorch/build.sh
CONFLICT (add/add): Merge conflict in .jenkins/caffe2/build.sh
Auto-merging .jenkins/caffe2/build.sh
CONFLICT (add/add): Merge conflict in .github/workflows/lint.yml
Auto-merging .github/workflows/lint.yml
CONFLICT (add/add): Merge conflict in .circleci/scripts/binary_ios_upload.sh
Auto-merging .circleci/scripts/binary_ios_upload.sh
Automatic merge failed; fix conflicts and then commit the result.


Exited with code exit status 1

See CircleCI build pytorch_windows_vs2019_py36_cuda11.1_build (2/5)

Step: "Build" (full log | diagnosis details | 🔁 rerun)

FAILED: bin/cuda_atomic_ops_test.exe
-- Generating C:/Users/circleci/project/build/caffe2/CMakeFiles/cuda_atomic_ops_test.dir/__/aten/src/ATen/test/./cuda_atomic_ops_test_generated_cuda_atomic_ops_test.cu.obj
C:/Users/circleci/project/build/win_tmp/bin/randomtemp.exe C:/Users/circleci/project/aten/src/ATen/test/cuda_atomic_ops_test.cu -c -o C:/Users/circleci/project/build/caffe2/CMakeFiles/cuda_atomic_ops_test.dir/__/aten/src/ATen/test/./cuda_atomic_ops_test_generated_cuda_atomic_ops_test.cu.obj -ccbin cl.exe -m64 -DTH_BLAS_MKL -D_OPENMP_NOFORCE_MANIFEST -DONNX_ML=1 -DONNXIFI_ENABLE_EXT=1 -DONNX_NAMESPACE=onnx_torch -D_CRT_SECURE_NO_DEPRECATE=1 -DWIN32_LEAN_AND_MEAN -DMAGMA_V2 -DIDEEP_USE_MKL -DUSE_EXTERNAL_MZCRC -DMINIZ_DISABLE_ZIP_READER_CRC32_CHECKS -Xcompiler ,\"/DWIN32\",\"/D_WINDOWS\",\"/GR\",\"/EHsc\",\"/w\",\"/bigobj\",\"-DUSE_PTHREADPOOL\",\"-openmp:experimental\",\"-DNDEBUG\",\"-DUSE_FBGEMM\",\"-DUSE_XNNPACK\",\"-DHAVE_AVX_CPU_DEFINITION\",\"-DHAVE_AVX2_CPU_DEFINITION\",\"/MD\",\"/O2\",\"/Ob2\",\"/DNDEBUG\",\"/w\",\"/bigobj\",\"-DNDEBUG\" -Xcompiler /w -w -Xfatbin -compress-all -DONNX_NAMESPACE=onnx_torch --use-local-env -gencode arch=compute_75,code=sm_75 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=integer_sign_change,--diag_suppress=useless_using_declaration,--diag_suppress=set_but_not_used,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=implicit_return_from_non_void_function,--diag_suppress=unsigned_compare_with_zero,--diag_suppress=declared_but_not_referenced,--diag_suppress=bad_friend_decl --Werror cross-execution-space-call --no-host-device-move-forward -Xcompiler -MD --expt-relaxed-constexpr --expt-extended-lambda -Xcompiler=/wd4819,/wd4503,/wd4190,/wd4244,/wd4251,/wd4275,/wd4522 -Wno-deprecated-gpu-targets --expt-extended-lambda -DCUDA_HAS_FP16=1 -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -DNVCC "-IC:/Program Files/NVIDIA GPU Computing Toolkit/CUDA/v11.1/include" -IC:/Users/circleci/project/build/aten/src -IC:/Users/circleci/project/aten/src -IC:/Users/circleci/project/build -IC:/Users/circleci/project -IC:/Users/circleci/project/build/third_party/gloo -IC:/Users/circleci/project/cmake/../third_party/gloo -IC:/Users/circleci/project/cmake/../third_party/googletest/googlemock/include -IC:/Users/circleci/project/cmake/../third_party/googletest/googletest/include -IC:/Users/circleci/project/third_party/protobuf/src -IC:/Users/circleci/project/build/win_tmp/mkl/include -IC:/Users/circleci/project/third_party/XNNPACK/include -IC:/Users/circleci/project/cmake/../third_party/benchmark/include -IC:/Users/circleci/project/third_party -IC:/Users/circleci/project/cmake/../third_party/eigen -IC:/Jenkins/Miniconda3/include -IC:/Jenkins/Miniconda3/lib/site-packages/numpy/core/include -IC:/Users/circleci/project/cmake/../third_party/pybind11/include -IC:/Users/circleci/project/build/caffe2/contrib/aten -IC:/Users/circleci/project/third_party/onnx -IC:/Users/circleci/project/build/third_party/onnx -IC:/Users/circleci/project/third_party/foxi -IC:/Users/circleci/project/build/third_party/foxi -IC:/Users/circleci/project/build/win_tmp/magma/include -IC:/Users/circleci/project/third_party/ideep/mkl-dnn/include -IC:/Users/circleci/project/third_party/ideep/include -IC:/Users/circleci/project/build/caffe2/aten/src/TH -IC:/Users/circleci/project/aten/src/TH -IC:/Users/circleci/project/build/caffe2/aten/src -IC:/Users/circleci/project/aten/../third_party/catch/single_include -IC:/Users/circleci/project/aten/src/ATen/.. -IC:/Users/circleci/project/build/caffe2/aten/src/ATen -IC:/Users/circleci/project/third_party/miniz-2.0.8 -IC:/Users/circleci/project/caffe2/core/nomnigraph/include -IC:/Users/circleci/project/torch/csrc/api -IC:/Users/circleci/project/torch/csrc/api/include -IC:/Users/circleci/project/c10/../ -IC:/Users/circleci/project/build/third_party/ideep/mkl-dnn/include -IC:/Users/circleci/project/third_party/ideep/mkl-dnn/src/../include -IC:/Users/circleci/project/c10/cuda/../.. "-IC:/Program Files/NVIDIA Corporation/NvToolsExt/include" -IC:/Users/circleci/project/third_party/googletest/googletest/include -IC:/Users/circleci/project/third_party/googletest/googletest
cuda_atomic_ops_test.cu
Generated C:/Users/circleci/project/build/caffe2/CMakeFiles/cuda_atomic_ops_test.dir/__/aten/src/ATen/test/./cuda_atomic_ops_test_generated_cuda_atomic_ops_test.cu.obj successfully.
[5093/5298] cmd.exe /C "cd . && C:\Jenkins\Miniconda3\Library\bin\cmake.exe -E vs_link_exe --intdir=test_tensorexpr\CMakeFiles\tutorial_tensorexpr.dir --rc=C:\PROGRA~2\WI3CF2~1\10\bin\100190~1.0\x64\rc.exe --mt=C:\PROGRA~2\WI3CF2~1\10\bin\100190~1.0\x64\mt.exe --manifests  -- C:\PROGRA~2\MICROS~2\2019\COMMUN~1\VC\Tools\MSVC\1428~1.293\bin\Hostx64\x64\link.exe  test_tensorexpr\CMakeFiles\tutorial_tensorexpr.dir\tutorial.cpp.obj  /out:bin\tutorial_tensorexpr.exe /implib:lib\tutorial_tensorexpr.lib /pdb:bin\tutorial_tensorexpr.pdb /version:0.0 /machine:x64 /ignore:4049 /ignore:4217 /INCREMENTAL:NO /subsystem:console  lib\torch.lib  "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.1\lib\x64\cudart_static.lib"  "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.1\lib\x64\nvrtc.lib"  "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.1\lib\x64\cuda.lib"  lib\torch_cuda.lib  lib\torch_cpu.lib  lib\libprotobuf.lib  win_tmp\mkl\lib\mkl_intel_lp64.lib  win_tmp\mkl\lib\mkl_intel_thread.lib  win_tmp\mkl\lib\mkl_core.lib  win_tmp\mkl\lib\libiomp5md.lib  lib\dnnl.lib  -INCLUDE:?warp_size@cuda@at@@YAHXZ  lib\c10_cuda.lib  lib\c10.lib  "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.1\lib\x64\cudart_static.lib"  "C:\Program Files\NVIDIA Corporation\NvToolsExt\lib\x64\nvToolsExt64_1.lib"  "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.1\lib\x64\cufft.lib"  "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.1\lib\x64\curand.lib"  "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.1\lib\x64\cublas.lib"  "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.1\lib\x64\cudnn.lib"  kernel32.lib user32.lib gdi32.lib winspool.lib shell32.lib ole32.lib oleaut32.lib uuid.lib comdlg32.lib advapi32.lib && cd ."
Microsoft (R) Incremental Linker Version 14.28.29335.0
Copyright (C) Microsoft Corporation.  All rights reserved.

   Creating library lib\tutorial_tensorexpr.lib and object lib\tutorial_tensorexpr.exp
[5094/5298] cmd.exe /C "cd . && C:\Jenkins\Miniconda3\Library\bin\cmake.exe -E vs_link_exe --intdir=caffe2\CMakeFiles\cuda_atomic_ops_test.dir --rc=C:\PROGRA~2\WI3CF2~1\10\bin\100190~1.0\x64\rc.exe --mt=C:\PROGRA~2\WI3CF2~1\10\bin\100190~1.0\x64\mt.exe --manifests  -- C:\PROGRA~2\MICROS~2\2019\COMMUN~1\VC\Tools\MSVC\1428~1.293\bin\Hostx64\x64\link.exe  caffe2\CMakeFiles\cuda_atomic_ops_test.dir\__\aten\src\ATen\test\cuda_atomic_ops_test_generated_cuda_atomic_ops_test.cu.obj  /out:bin\cuda_atomic_ops_test.exe /implib:lib\cuda_atomic_ops_test.lib /pdb:bin\cuda_atomic_ops_test.pdb /version:0.0 /machine:x64 /ignore:4049 /ignore:4217 /INCREMENTAL:NO /subsystem:console  "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.1\lib\x64\cudart_static.lib"  lib\gtest_main.lib  lib\torch.lib  lib\torch_cuda.lib  lib\torch_cpu.lib  lib\libprotobuf.lib  win_tmp\mkl\lib\mkl_intel_lp64.lib  win_tmp\mkl\lib\mkl_intel_thread.lib  win_tmp\mkl\lib\mkl_core.lib  win_tmp\mkl\lib\libiomp5md.lib  lib\dnnl.lib  -INCLUDE:?warp_size@cuda@at@@YAHXZ  lib\c10_cuda.lib  lib\c10.lib  "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.1\lib\x64\cudart_static.lib"  "C:\Program Files\NVIDIA Corporation\NvToolsExt\lib\x64\nvToolsExt64_1.lib"  "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.1\lib\x64\cufft.lib"  "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.1\lib\x64\curand.lib"  "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.1\lib\x64\cublas.lib"  "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.1\lib\x64\cudnn.lib"  lib\gtest.lib  kernel32.lib user32.lib gdi32.lib winspool.lib shell32.lib ole32.lib oleaut32.lib uuid.lib comdlg32.lib advapi32.lib && cd ."
FAILED: bin/cuda_atomic_ops_test.exe 
cmd.exe /C "cd . && C:\Jenkins\Miniconda3\Library\bin\cmake.exe -E vs_link_exe --intdir=caffe2\CMakeFiles\cuda_atomic_ops_test.dir --rc=C:\PROGRA~2\WI3CF2~1\10\bin\100190~1.0\x64\rc.exe --mt=C:\PROGRA~2\WI3CF2~1\10\bin\100190~1.0\x64\mt.exe --manifests  -- C:\PROGRA~2\MICROS~2\2019\COMMUN~1\VC\Tools\MSVC\1428~1.293\bin\Hostx64\x64\link.exe  caffe2\CMakeFiles\cuda_atomic_ops_test.dir\__\aten\src\ATen\test\cuda_atomic_ops_test_generated_cuda_atomic_ops_test.cu.obj  /out:bin\cuda_atomic_ops_test.exe /implib:lib\cuda_atomic_ops_test.lib /pdb:bin\cuda_atomic_ops_test.pdb /version:0.0 /machine:x64 /ignore:4049 /ignore:4217 /INCREMENTAL:NO /subsystem:console  "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.1\lib\x64\cudart_static.lib"  lib\gtest_main.lib  lib\torch.lib  lib\torch_cuda.lib  lib\torch_cpu.lib  lib\libprotobuf.lib  win_tmp\mkl\lib\mkl_intel_lp64.lib  win_tmp\mkl\lib\mkl_intel_thread.lib  win_tmp\mkl\lib\mkl_core.lib  win_tmp\mkl\lib\libiomp5md.lib  lib\dnnl.lib  -INCLUDE:?warp_size@cuda@at@@YAHXZ  lib\c10_cuda.lib  lib\c10.lib  "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.1\lib\x64\cudart_static.lib"  "C:\Program Files\NVIDIA Corporation\NvToolsExt\lib\x64\nvToolsExt64_1.lib"  "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.1\lib\x64\cufft.lib"  "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.1\lib\x64\curand.lib"  "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.1\lib\x64\cublas.lib"  "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.1\lib\x64\cudnn.lib"  lib\gtest.lib  kernel32.lib user32.lib gdi32.lib winspool.lib shell32.lib ole32.lib oleaut32.lib uuid.lib comdlg32.lib advapi32.lib && cd ."
LINK: command "C:\PROGRA~2\MICROS~2\2019\COMMUN~1\VC\Tools\MSVC\1428~1.293\bin\Hostx64\x64\link.exe caffe2\CMakeFiles\cuda_atomic_ops_test.dir\__\aten\src\ATen\test\cuda_atomic_ops_test_generated_cuda_atomic_ops_test.cu.obj /out:bin\cuda_atomic_ops_test.exe /implib:lib\cuda_atomic_ops_test.lib /pdb:bin\cuda_atomic_ops_test.pdb /version:0.0 /machine:x64 /ignore:4049 /ignore:4217 /INCREMENTAL:NO /subsystem:console C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.1\lib\x64\cudart_static.lib lib\gtest_main.lib lib\torch.lib lib\torch_cuda.lib lib\torch_cpu.lib lib\libprotobuf.lib win_tmp\mkl\lib\mkl_intel_lp64.lib win_tmp\mkl\lib\mkl_intel_thread.lib win_tmp\mkl\lib\mkl_core.lib win_tmp\mkl\lib\libiomp5md.lib lib\dnnl.lib -INCLUDE:?warp_size@cuda@at@@YAHXZ lib\c10_cuda.lib lib\c10.lib C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.1\lib\x64\cudart_static.lib C:\Program Files\NVIDIA Corporation\NvToolsExt\lib\x64\nvToolsExt64_1.lib C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.1\lib\x64\cufft.lib C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.1\lib\x64\curand.lib C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.1\lib\x64\cublas.lib C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.1\lib\x64\cudnn.lib lib\gtest.lib kernel32.lib user32.lib gdi32.lib winspool.lib shell32.lib ole32.lib oleaut32.lib uuid.lib comdlg32.lib advapi32.lib /MANIFEST /MANIFESTFILE:bin\cuda_atomic_ops_test.exe.manifest" failed (exit code 1120) with the following output:
Microsoft (R) Incremental Linker Version 14.28.29335.0
Copyright (C) Microsoft Corporation.  All rights reserved.

   Creating library lib\cuda_atomic_ops_test.lib and object lib\cuda_atomic_ops_test.exp
cuda_atomic_ops_test_generated_cuda_atomic_ops_test.cu.obj : error LNK2019: unresolved external symbol "float __cdecl pow(float,int)" (?pow@@YAMMH@Z) referenced in function "void __cdecl test_atomic_mul<float>(void)" (??$test_atomic_mul@M@@YAXXZ)
  Hint on symbols that are defined and could potentially match:
    "float __cdecl pow(float,float)" (?pow@@YAMMM@Z)
cuda_atomic_ops_test_generated_cuda_atomic_ops_test.cu.obj : error LNK2019: unresolved external symbol "double __cdecl pow(double,int)" (?pow@@YANNH@Z) referenced in function "void __cdecl test_atomic_mul<double>(void)" (??$test_atomic_mul@N@@YAXXZ)

See CircleCI build pytorch_windows_vs2019_py36_cuda10.1_build (3/5)

Step: "Build" (full log | diagnosis details | 🔁 rerun)

FAILED: bin/cuda_atomic_ops_test.exe

[5129/5298] C:\Users\circleci\project\build\win_tmp\bin\sccache-cl.exe   /TP -DIDEEP_USE_MKL -DMAGMA_V2 -DMINIZ_DISABLE_ZIP_READER_CRC32_CHECKS -DONNXIFI_ENABLE_EXT=1 -DONNX_ML=1 -DONNX_NAMESPACE=onnx_torch -DTH_BLAS_MKL -DUSE_CUDA -DUSE_EXTERNAL_MZCRC -DWIN32_LEAN_AND_MEAN -D_CRT_SECURE_NO_DEPRECATE=1 -D_OPENMP_NOFORCE_MANIFEST -Iaten\src -I..\aten\src -I. -I..\ -I..\cmake\..\third_party\benchmark\include -Icaffe2\contrib\aten -I..\third_party\onnx -Ithird_party\onnx -I..\third_party\foxi -Ithird_party\foxi -Icaffe2\..\aten\src -Icaffe2\..\aten\src\ATen -I..\torch\csrc\api -I..\torch\csrc\api\include -I..\c10\.. -Ithird_party\ideep\mkl-dnn\include -I..\third_party\ideep\mkl-dnn\src\..\include -I..\c10\cuda\..\.. -Ithird_party\gloo -I..\cmake\..\third_party\gloo -I..\cmake\..\third_party\googletest\googlemock\include -I..\cmake\..\third_party\googletest\googletest\include -I..\third_party\protobuf\src -Iwin_tmp\mkl\include -I..\third_party\XNNPACK\include -I..\third_party -I..\cmake\..\third_party\eigen -IC:\Jenkins\Miniconda3\include -IC:\Jenkins\Miniconda3\lib\site-packages\numpy\core\include -I..\cmake\..\third_party\pybind11\include -I..\cmake\..\third_party\cub -Iwin_tmp\magma\include -I"C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1\include" -I..\third_party\ideep\mkl-dnn\include -I..\third_party\ideep\include -I"C:\Program Files\NVIDIA Corporation\NvToolsExt\include" -I..\third_party\googletest\googletest\include -I..\third_party\googletest\googletest /DWIN32 /D_WINDOWS /GR /EHsc /w /bigobj -DUSE_PTHREADPOOL -openmp:experimental -DNDEBUG -DUSE_FBGEMM -DUSE_XNNPACK -DHAVE_AVX_CPU_DEFINITION -DHAVE_AVX2_CPU_DEFINITION /MD /O2 /Ob2 /DNDEBUG /w /bigobj -DNDEBUG -DCAFFE2_USE_GLOO -DCUDA_HAS_FP16=1 -DUSE_GCC_GET_CPUID -DUSE_AVX -DUSE_AVX2 -DTH_HAVE_THREAD /EHsc /DNOMINMAX /wd4267 /wd4251 /wd4522 /wd4838 /wd4305 /wd4244 /wd4190 /wd4101 /wd4996 /wd4275 /bigobj -std:c++14 /showIncludes /Fotest_api\CMakeFiles\test_api.dir\tensor_cuda.cpp.obj /Fdtest_api\CMakeFiles\test_api.dir\ /FS -c ..\test\cpp\api\tensor_cuda.cpp
Microsoft (R) C/C++ Optimizing Compiler Version 19.28.29335 for x64
Copyright (C) Microsoft Corporation.  All rights reserved.

[5130/5298] C:\Users\circleci\project\build\win_tmp\bin\sccache-cl.exe   /TP -DIDEEP_USE_MKL -DMAGMA_V2 -DMINIZ_DISABLE_ZIP_READER_CRC32_CHECKS -DONNXIFI_ENABLE_EXT=1 -DONNX_ML=1 -DONNX_NAMESPACE=onnx_torch -DTH_BLAS_MKL -DUSE_CUDA=1 -DUSE_EXTERNAL_MZCRC -DWIN32_LEAN_AND_MEAN -D_CRT_SECURE_NO_DEPRECATE=1 -D_OPENMP_NOFORCE_MANIFEST -Iaten\src -I..\aten\src -I. -I..\ -I..\cmake\..\third_party\benchmark\include -Icaffe2\contrib\aten -I..\third_party\onnx -Ithird_party\onnx -I..\third_party\foxi -Ithird_party\foxi -Icaffe2\aten\src -I..\torch\lib\c10d\.. -I..\torch\csrc\api -I..\torch\csrc\api\include -I..\c10\.. -Ithird_party\ideep\mkl-dnn\include -I..\third_party\ideep\mkl-dnn\src\..\include -I..\c10\cuda\..\.. -Ithird_party\gloo -I..\cmake\..\third_party\gloo -I..\cmake\..\third_party\googletest\googlemock\include -I..\cmake\..\third_party\googletest\googletest\include -I..\third_party\protobuf\src -Iwin_tmp\mkl\include -I..\third_party\XNNPACK\include -I..\third_party -I..\cmake\..\third_party\eigen -IC:\Jenkins\Miniconda3\include -IC:\Jenkins\Miniconda3\lib\site-packages\numpy\core\include -I..\cmake\..\third_party\pybind11\include -I..\cmake\..\third_party\cub -Iwin_tmp\magma\include -I"C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1\include" -I..\third_party\ideep\mkl-dnn\include -I..\third_party\ideep\include -I"C:\Program Files\NVIDIA Corporation\NvToolsExt\include" /DWIN32 /D_WINDOWS /GR /EHsc /w /bigobj -DUSE_PTHREADPOOL -openmp:experimental -DNDEBUG -DUSE_FBGEMM -DUSE_XNNPACK -DHAVE_AVX_CPU_DEFINITION -DHAVE_AVX2_CPU_DEFINITION /MD /O2 /Ob2 /DNDEBUG /w /bigobj -DNDEBUG -DCAFFE2_USE_GLOO -DCUDA_HAS_FP16=1 -DUSE_GCC_GET_CPUID -DUSE_AVX -DUSE_AVX2 -DTH_HAVE_THREAD /EHsc /DNOMINMAX /wd4267 /wd4251 /wd4522 /wd4838 /wd4305 /wd4244 /wd4190 /wd4101 /wd4996 /wd4275 /bigobj -std:c++14 /showIncludes /Focaffe2\lib_c10d\CMakeFiles\c10d.dir\ProcessGroup.cpp.obj /Fdcaffe2\lib_c10d\CMakeFiles\c10d.dir\c10d.pdb /FS -c ..\torch\lib\c10d\ProcessGroup.cpp
Microsoft (R) C/C++ Optimizing Compiler Version 19.28.29335 for x64
Copyright (C) Microsoft Corporation.  All rights reserved.

[5131/5298] cmd.exe /C "cd . && C:\Jenkins\Miniconda3\Library\bin\cmake.exe -E vs_link_exe --intdir=caffe2\CMakeFiles\cuda_atomic_ops_test.dir --rc=C:\PROGRA~2\WI3CF2~1\10\bin\100190~1.0\x64\rc.exe --mt=C:\PROGRA~2\WI3CF2~1\10\bin\100190~1.0\x64\mt.exe --manifests  -- C:\PROGRA~2\MICROS~2\2019\COMMUN~1\VC\Tools\MSVC\1428~1.293\bin\Hostx64\x64\link.exe  caffe2\CMakeFiles\cuda_atomic_ops_test.dir\__\aten\src\ATen\test\cuda_atomic_ops_test_generated_cuda_atomic_ops_test.cu.obj  /out:bin\cuda_atomic_ops_test.exe /implib:lib\cuda_atomic_ops_test.lib /pdb:bin\cuda_atomic_ops_test.pdb /version:0.0 /machine:x64 /ignore:4049 /ignore:4217 /INCREMENTAL:NO /subsystem:console  "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1\lib\x64\cudart_static.lib"  lib\gtest_main.lib  lib\torch.lib  lib\torch_cuda.lib  lib\torch_cpu.lib  lib\libprotobuf.lib  win_tmp\mkl\lib\mkl_intel_lp64.lib  win_tmp\mkl\lib\mkl_intel_thread.lib  win_tmp\mkl\lib\mkl_core.lib  win_tmp\mkl\lib\libiomp5md.lib  lib\dnnl.lib  -INCLUDE:?warp_size@cuda@at@@YAHXZ  lib\c10_cuda.lib  lib\c10.lib  "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1\lib\x64\cudart_static.lib"  "C:\Program Files\NVIDIA Corporation\NvToolsExt\lib\x64\nvToolsExt64_1.lib"  "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1\lib\x64\cufft.lib"  "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1\lib\x64\curand.lib"  "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1\lib\x64\cublas.lib"  "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1\lib\x64\cudnn.lib"  lib\gtest.lib  kernel32.lib user32.lib gdi32.lib winspool.lib shell32.lib ole32.lib oleaut32.lib uuid.lib comdlg32.lib advapi32.lib && cd ."
FAILED: bin/cuda_atomic_ops_test.exe 
cmd.exe /C "cd . && C:\Jenkins\Miniconda3\Library\bin\cmake.exe -E vs_link_exe --intdir=caffe2\CMakeFiles\cuda_atomic_ops_test.dir --rc=C:\PROGRA~2\WI3CF2~1\10\bin\100190~1.0\x64\rc.exe --mt=C:\PROGRA~2\WI3CF2~1\10\bin\100190~1.0\x64\mt.exe --manifests  -- C:\PROGRA~2\MICROS~2\2019\COMMUN~1\VC\Tools\MSVC\1428~1.293\bin\Hostx64\x64\link.exe  caffe2\CMakeFiles\cuda_atomic_ops_test.dir\__\aten\src\ATen\test\cuda_atomic_ops_test_generated_cuda_atomic_ops_test.cu.obj  /out:bin\cuda_atomic_ops_test.exe /implib:lib\cuda_atomic_ops_test.lib /pdb:bin\cuda_atomic_ops_test.pdb /version:0.0 /machine:x64 /ignore:4049 /ignore:4217 /INCREMENTAL:NO /subsystem:console  "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1\lib\x64\cudart_static.lib"  lib\gtest_main.lib  lib\torch.lib  lib\torch_cuda.lib  lib\torch_cpu.lib  lib\libprotobuf.lib  win_tmp\mkl\lib\mkl_intel_lp64.lib  win_tmp\mkl\lib\mkl_intel_thread.lib  win_tmp\mkl\lib\mkl_core.lib  win_tmp\mkl\lib\libiomp5md.lib  lib\dnnl.lib  -INCLUDE:?warp_size@cuda@at@@YAHXZ  lib\c10_cuda.lib  lib\c10.lib  "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1\lib\x64\cudart_static.lib"  "C:\Program Files\NVIDIA Corporation\NvToolsExt\lib\x64\nvToolsExt64_1.lib"  "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1\lib\x64\cufft.lib"  "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1\lib\x64\curand.lib"  "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1\lib\x64\cublas.lib"  "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1\lib\x64\cudnn.lib"  lib\gtest.lib  kernel32.lib user32.lib gdi32.lib winspool.lib shell32.lib ole32.lib oleaut32.lib uuid.lib comdlg32.lib advapi32.lib && cd ."
LINK: command "C:\PROGRA~2\MICROS~2\2019\COMMUN~1\VC\Tools\MSVC\1428~1.293\bin\Hostx64\x64\link.exe caffe2\CMakeFiles\cuda_atomic_ops_test.dir\__\aten\src\ATen\test\cuda_atomic_ops_test_generated_cuda_atomic_ops_test.cu.obj /out:bin\cuda_atomic_ops_test.exe /implib:lib\cuda_atomic_ops_test.lib /pdb:bin\cuda_atomic_ops_test.pdb /version:0.0 /machine:x64 /ignore:4049 /ignore:4217 /INCREMENTAL:NO /subsystem:console C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1\lib\x64\cudart_static.lib lib\gtest_main.lib lib\torch.lib lib\torch_cuda.lib lib\torch_cpu.lib lib\libprotobuf.lib win_tmp\mkl\lib\mkl_intel_lp64.lib win_tmp\mkl\lib\mkl_intel_thread.lib win_tmp\mkl\lib\mkl_core.lib win_tmp\mkl\lib\libiomp5md.lib lib\dnnl.lib -INCLUDE:?warp_size@cuda@at@@YAHXZ lib\c10_cuda.lib lib\c10.lib C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1\lib\x64\cudart_static.lib C:\Program Files\NVIDIA Corporation\NvToolsExt\lib\x64\nvToolsExt64_1.lib C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1\lib\x64\cufft.lib C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1\lib\x64\curand.lib C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1\lib\x64\cublas.lib C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1\lib\x64\cudnn.lib lib\gtest.lib kernel32.lib user32.lib gdi32.lib winspool.lib shell32.lib ole32.lib oleaut32.lib uuid.lib comdlg32.lib advapi32.lib /MANIFEST /MANIFESTFILE:bin\cuda_atomic_ops_test.exe.manifest" failed (exit code 1120) with the following output:
Microsoft (R) Incremental Linker Version 14.28.29335.0
Copyright (C) Microsoft Corporation.  All rights reserved.

   Creating library lib\cuda_atomic_ops_test.lib and object lib\cuda_atomic_ops_test.exp
cuda_atomic_ops_test_generated_cuda_atomic_ops_test.cu.obj : error LNK2019: unresolved external symbol "float __cdecl pow(float,int)" (?pow@@YAMMH@Z) referenced in function "void __cdecl test_atomic_mul<float>(void)" (??$test_atomic_mul@M@@YAXXZ)
  Hint on symbols that are defined and could potentially match:
    "float __cdecl pow(float,float)" (?pow@@YAMMM@Z)
cuda_atomic_ops_test_generated_cuda_atomic_ops_test.cu.obj : error LNK2019: unresolved external symbol "double __cdecl pow(double,int)" (?pow@@YANNH@Z) referenced in function "void __cdecl test_atomic_mul<double>(void)" (??$test_atomic_mul@N@@YAXXZ)

See CircleCI build pytorch_xla_linux_bionic_py3_6_clang9_build (4/5)

Step: "(Optional) Merge target branch" (full log | diagnosis details | 🔁 rerun)

Automatic merge failed; fix conflicts and then commit the result.
CONFLICT (add/add): Merge conflict in .jenkins/pytorch/codegen-test.sh
Auto-merging .jenkins/pytorch/codegen-test.sh
CONFLICT (add/add): Merge conflict in .jenkins/pytorch/build.sh
Auto-merging .jenkins/pytorch/build.sh
CONFLICT (add/add): Merge conflict in .jenkins/caffe2/build.sh
Auto-merging .jenkins/caffe2/build.sh
CONFLICT (add/add): Merge conflict in .github/workflows/lint.yml
Auto-merging .github/workflows/lint.yml
CONFLICT (add/add): Merge conflict in .circleci/scripts/binary_ios_upload.sh
Auto-merging .circleci/scripts/binary_ios_upload.sh
Automatic merge failed; fix conflicts and then commit the result.


Exited with code exit status 1

See CircleCI build pytorch_linux_xenial_py3_6_gcc5_4_build (5/5)

Step: "(Optional) Merge target branch" (full log | diagnosis details | 🔁 rerun)

Automatic merge failed; fix conflicts and then commit the result.
CONFLICT (add/add): Merge conflict in .jenkins/pytorch/codegen-test.sh
Auto-merging .jenkins/pytorch/codegen-test.sh
CONFLICT (add/add): Merge conflict in .jenkins/pytorch/build.sh
Auto-merging .jenkins/pytorch/build.sh
CONFLICT (add/add): Merge conflict in .jenkins/caffe2/build.sh
Auto-merging .jenkins/caffe2/build.sh
CONFLICT (add/add): Merge conflict in .github/workflows/lint.yml
Auto-merging .github/workflows/lint.yml
CONFLICT (add/add): Merge conflict in .circleci/scripts/binary_ios_upload.sh
Auto-merging .circleci/scripts/binary_ios_upload.sh
Automatic merge failed; fix conflicts and then commit the result.


Exited with code exit status 1


Extra GitHub checks: 1 failed


This comment was automatically generated by Dr. CI (expand for details).Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions to the (internal) Dr. CI Users group.

This comment has been revised 8 times.

@@ -435,7 +435,13 @@ class SyncBatchNorm(_BatchNorm):
>>> m = nn.SyncBatchNorm(100)
>>> # creating process group (optional)
>>> # process_ids is a list of int identifying rank ids.
>>> process_group = torch.distributed.new_group(process_ids)
>>> process_ids = list(range(8))
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: maybe call these ranks instead of process_ids since it might be confused with OS pids.

Copy link
Contributor

@facebook-github-bot facebook-github-bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@rohan-varma has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

@rohan-varma
Copy link
Member Author

Lint failure is unrelated: ERROR: Could not open requirements file: [Errno 2] No such file or directory: 'requirements-flake8.txt'

@facebook-github-bot
Copy link
Contributor

@rohan-varma merged this pull request in c0a0845.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Improve torch.distributed.new_group() documentation in the context of SyncBatchNorm
3 participants