Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About install horovod #96

Open
LCXin00 opened this issue Jan 13, 2020 · 0 comments
Open

About install horovod #96

LCXin00 opened this issue Jan 13, 2020 · 0 comments

Comments

@LCXin00
Copy link

LCXin00 commented Jan 13, 2020

Hi!guys, when i run pip install -r requirements.txt
The following error :
Building wheels for collected packages: horovod
Building wheel for horovod (setup.py) ... error
ERROR: Command errored out with exit status 1:
command: /home/scw4750/anaconda3/envs/lcx_glow/bin/python -u -c 'import sys, setuptools, tokenize; sys.argv[0] = '"'"'/tmp/pip-install-qjvktya2/horovod/setup.py'"'"'; file='"'"'/tmp/pip-install-qjvktya2/horovod/setup.py'"'"';f=getattr(tokenize, '"'"'open'"'"', open)(file);code=f.read().replace('"'"'\r\n'"'"', '"'"'\n'"'"');f.close();exec(compile(code, file, '"'"'exec'"'"'))' bdist_wheel -d /tmp/pip-wheel-j5sti5wz --python-tag cp35
cwd: /tmp/pip-install-qjvktya2/horovod/
Complete output (97 lines):
running bdist_wheel
running build
running build_py
creating build
creating build/lib.linux-x86_64-3.5
creating build/lib.linux-x86_64-3.5/horovod
copying horovod/init.py -> build/lib.linux-x86_64-3.5/horovod
creating build/lib.linux-x86_64-3.5/horovod/tensorflow
copying horovod/tensorflow/init.py -> build/lib.linux-x86_64-3.5/horovod/tensorflow
copying horovod/tensorflow/mpi_ops.py -> build/lib.linux-x86_64-3.5/horovod/tensorflow
creating build/lib.linux-x86_64-3.5/horovod/torch
copying horovod/torch/init.py -> build/lib.linux-x86_64-3.5/horovod/torch
copying horovod/torch/mpi_ops.py -> build/lib.linux-x86_64-3.5/horovod/torch
creating build/lib.linux-x86_64-3.5/horovod/keras
copying horovod/keras/callbacks.py -> build/lib.linux-x86_64-3.5/horovod/keras
copying horovod/keras/init.py -> build/lib.linux-x86_64-3.5/horovod/keras
creating build/lib.linux-x86_64-3.5/horovod/common
copying horovod/common/init.py -> build/lib.linux-x86_64-3.5/horovod/common
creating build/lib.linux-x86_64-3.5/horovod/torch/mpi_lib_impl
copying horovod/torch/mpi_lib_impl/init.py -> build/lib.linux-x86_64-3.5/horovod/torch/mpi_lib_impl
creating build/lib.linux-x86_64-3.5/horovod/torch/mpi_lib
copying horovod/torch/mpi_lib/init.py -> build/lib.linux-x86_64-3.5/horovod/torch/mpi_lib
running build_ext
gcc -pthread -Wsign-compare -DNDEBUG -g -fwrapv -O3 -Wall -Wstrict-prototypes -fPIC -std=c++11 -fPIC -O2 -I/home/scw4750/anaconda3/envs/lcx_glow/include/python3.5m -c build/temp.linux-x86_64-3.5/test_compile/test_cpp_flags.cc -o build/temp.linux-x86_64-3.5/test_compile/test_cpp_flags.o
cc1plus: warning: command line option ‘-Wstrict-prototypes’ is valid for C/ObjC but not for C++
gcc -pthread -shared -L/home/scw4750/anaconda3/envs/lcx_glow/lib -Wl,-rpath=/home/scw4750/anaconda3/envs/lcx_glow/lib,--no-as-needed build/temp.linux-x86_64-3.5/test_compile/test_cpp_flags.o -L/home/scw4750/anaconda3/envs/lcx_glow/lib -o build/temp.linux-x86_64-3.5/test_compile/test_cpp_flags.so
gcc -pthread -Wsign-compare -DNDEBUG -g -fwrapv -O3 -Wall -Wstrict-prototypes -fPIC -std=c++11 -fPIC -O2 -I/usr/lib/openmpi/include -I/usr/lib/openmpi/include/openmpi -pthread -L/usr//lib -L/usr/lib/openmpi/lib -lmpi_cxx -lmpi -ldl -lhwloc -I/home/scw4750/anaconda3/envs/lcx_glow/include/python3.5m -c build/temp.linux-x86_64-3.5/test_compile/test_tensorflow_libs.cc -o build/temp.linux-x86_64-3.5/test_compile/test_tensorflow_libs.o
cc1plus: warning: command line option ‘-Wstrict-prototypes’ is valid for C/ObjC but not for C++
gcc -pthread -shared -L/home/scw4750/anaconda3/envs/lcx_glow/lib -Wl,-rpath=/home/scw4750/anaconda3/envs/lcx_glow/lib,--no-as-needed build/temp.linux-x86_64-3.5/test_compile/test_tensorflow_libs.o -L/home/scw4750/anaconda3/envs/lcx_glow/lib/python3.5/site-packages/tensorflow/core -L/home/scw4750/anaconda3/envs/lcx_glow/lib -ltensorflow_framework -o build/temp.linux-x86_64-3.5/test_compile/test_tensorflow_libs.so
/usr/bin/ld: cannot find -ltensorflow_framework
collect2: error: ld returned 1 exit status
gcc -pthread -Wsign-compare -DNDEBUG -g -fwrapv -O3 -Wall -Wstrict-prototypes -fPIC -std=c++11 -fPIC -O2 -I/usr/lib/openmpi/include -I/usr/lib/openmpi/include/openmpi -pthread -L/usr//lib -L/usr/lib/openmpi/lib -lmpi_cxx -lmpi -ldl -lhwloc -I/home/scw4750/anaconda3/envs/lcx_glow/include/python3.5m -c build/temp.linux-x86_64-3.5/test_compile/test_tensorflow_libs.cc -o build/temp.linux-x86_64-3.5/test_compile/test_tensorflow_libs.o
cc1plus: warning: command line option ‘-Wstrict-prototypes’ is valid for C/ObjC but not for C++
gcc -pthread -shared -L/home/scw4750/anaconda3/envs/lcx_glow/lib -Wl,-rpath=/home/scw4750/anaconda3/envs/lcx_glow/lib,--no-as-needed build/temp.linux-x86_64-3.5/test_compile/test_tensorflow_libs.o -L/home/scw4750/anaconda3/envs/lcx_glow/lib/python3.5/site-packages/tensorflow/core -L/home/scw4750/anaconda3/envs/lcx_glow/lib -o build/temp.linux-x86_64-3.5/test_compile/test_tensorflow_libs.so
gcc -pthread -Wsign-compare -DNDEBUG -g -fwrapv -O3 -Wall -Wstrict-prototypes -fPIC -std=c++11 -fPIC -O2 -I/usr/lib/openmpi/include -I/usr/lib/openmpi/include/openmpi -pthread -L/usr//lib -L/usr/lib/openmpi/lib -lmpi_cxx -lmpi -ldl -lhwloc -D_GLIBCXX_USE_CXX11_ABI=0 -I/home/scw4750/anaconda3/envs/lcx_glow/lib/python3.5/site-packages/tensorflow/include -I/home/scw4750/anaconda3/envs/lcx_glow/lib/python3.5/site-packages/tensorflow/include/external/nsync/public -I/home/scw4750/anaconda3/envs/lcx_glow/include/python3.5m -c build/temp.linux-x86_64-3.5/test_compile/test_tensorflow_abi.cc -o build/temp.linux-x86_64-3.5/test_compile/test_tensorflow_abi.o
cc1plus: warning: command line option ‘-Wstrict-prototypes’ is valid for C/ObjC but not for C++
gcc -pthread -shared -L/home/scw4750/anaconda3/envs/lcx_glow/lib -Wl,-rpath=/home/scw4750/anaconda3/envs/lcx_glow/lib,--no-as-needed build/temp.linux-x86_64-3.5/test_compile/test_tensorflow_abi.o -L/home/scw4750/anaconda3/envs/lcx_glow/lib/python3.5/site-packages/tensorflow/core -L/home/scw4750/anaconda3/envs/lcx_glow/lib -o build/temp.linux-x86_64-3.5/test_compile/test_tensorflow_abi.so
building 'horovod.tensorflow.mpi_lib' extension
creating build/temp.linux-x86_64-3.5/horovod
creating build/temp.linux-x86_64-3.5/horovod/tensorflow
gcc -pthread -Wsign-compare -DNDEBUG -g -fwrapv -O3 -Wall -Wstrict-prototypes -fPIC -I/home/scw4750/anaconda3/envs/lcx_glow/include/python3.5m -c horovod/tensorflow/mpi_ops.cc -o build/temp.linux-x86_64-3.5/horovod/tensorflow/mpi_ops.o -std=c++11 -fPIC -O2 -I/usr/lib/openmpi/include -I/usr/lib/openmpi/include/openmpi -pthread -L/usr//lib -L/usr/lib/openmpi/lib -lmpi_cxx -lmpi -ldl -lhwloc -I/home/scw4750/anaconda3/envs/lcx_glow/lib/python3.5/site-packages/tensorflow/include -I/home/scw4750/anaconda3/envs/lcx_glow/lib/python3.5/site-packages/tensorflow/include/external/nsync/public -D_GLIBCXX_USE_CXX11_ABI=0
cc1plus: warning: command line option ‘-Wstrict-prototypes’ is valid for C/ObjC but not for C++
g++ -pthread -shared -L/home/scw4750/anaconda3/envs/lcx_glow/lib -Wl,-rpath=/home/scw4750/anaconda3/envs/lcx_glow/lib,--no-as-needed build/temp.linux-x86_64-3.5/horovod/tensorflow/mpi_ops.o -L/home/scw4750/anaconda3/envs/lcx_glow/lib -lpython3.5m -o build/lib.linux-x86_64-3.5/horovod/tensorflow/mpi_lib.cpython-35m-x86_64-linux-gnu.so -I/usr/lib/openmpi/include -I/usr/lib/openmpi/include/openmpi -pthread -L/usr//lib -L/usr/lib/openmpi/lib -lmpi_cxx -lmpi -ldl -lhwloc -L/home/scw4750/anaconda3/envs/lcx_glow/lib/python3.5/site-packages/tensorflow/core
INFO: Above error indicates that this PyTorch installation does not support CUDA.
INFO: Unable to build PyTorch plugin, will skip it.

Traceback (most recent call last):
File "/tmp/pip-install-qjvktya2/horovod/setup.py", line 572, in build_extensions
build_torch_extension(self, options, abi_compile_flags)
File "/tmp/pip-install-qjvktya2/horovod/setup.py", line 509, in build_torch_extension
from torch.utils.ffi import create_extension
File "/home/scw4750/anaconda3/envs/lcx_glow/lib/python3.5/site-packages/torch/utils/ffi/init.py", line 1, in
raise ImportError("torch.utils.ffi is deprecated. Please use cpp extensions instead.")
ImportError: torch.utils.ffi is deprecated. Please use cpp extensions instead.

building 'horovod.common.mpi_lib' extension
creating build/temp.linux-x86_64-3.5/horovod/common
gcc -pthread -Wsign-compare -DNDEBUG -g -fwrapv -O3 -Wall -Wstrict-prototypes -fPIC -I/home/scw4750/anaconda3/envs/lcx_glow/include/python3.5m -c horovod/common/common.cc -o build/temp.linux-x86_64-3.5/horovod/common/common.o -std=c++11 -fPIC -O2 -I/usr/lib/openmpi/include -I/usr/lib/openmpi/include/openmpi -pthread -L/usr//lib -L/usr/lib/openmpi/lib -lmpi_cxx -lmpi -ldl -lhwloc -D_GLIBCXX_USE_CXX11_ABI=0
cc1plus: warning: command line option ‘-Wstrict-prototypes’ is valid for C/ObjC but not for C++
gcc -pthread -Wsign-compare -DNDEBUG -g -fwrapv -O3 -Wall -Wstrict-prototypes -fPIC -I/home/scw4750/anaconda3/envs/lcx_glow/include/python3.5m -c horovod/common/mpi_message.cc -o build/temp.linux-x86_64-3.5/horovod/common/mpi_message.o -std=c++11 -fPIC -O2 -I/usr/lib/openmpi/include -I/usr/lib/openmpi/include/openmpi -pthread -L/usr//lib -L/usr/lib/openmpi/lib -lmpi_cxx -lmpi -ldl -lhwloc -D_GLIBCXX_USE_CXX11_ABI=0
cc1plus: warning: command line option ‘-Wstrict-prototypes’ is valid for C/ObjC but not for C++
gcc -pthread -Wsign-compare -DNDEBUG -g -fwrapv -O3 -Wall -Wstrict-prototypes -fPIC -I/home/scw4750/anaconda3/envs/lcx_glow/include/python3.5m -c horovod/common/operations.cc -o build/temp.linux-x86_64-3.5/horovod/common/operations.o -std=c++11 -fPIC -O2 -I/usr/lib/openmpi/include -I/usr/lib/openmpi/include/openmpi -pthread -L/usr//lib -L/usr/lib/openmpi/lib -lmpi_cxx -lmpi -ldl -lhwloc -D_GLIBCXX_USE_CXX11_ABI=0
cc1plus: warning: command line option ‘-Wstrict-prototypes’ is valid for C/ObjC but not for C++
In file included from /usr/lib/openmpi/include/mpi.h:253:0,
from horovod/common/operations.cc:36:
/usr/lib/openmpi/include/mpi_portable_platform.h:374:34: warning: invalid suffix on literal; C++11 requires a space between literal and string macro [-Wliteral-suffix]
_STRINGIFY(GNUC)"."_STRINGIFY(GNUC_MINOR)"."_STRINGIFY(GNUC_PATCHLEVEL)
^
/usr/lib/openmpi/include/mpi_portable_platform.h:374:63: warning: invalid suffix on literal; C++11 requires a space between literal and string macro [-Wliteral-suffix]
_STRINGIFY(GNUC)"."_STRINGIFY(GNUC_MINOR)"."_STRINGIFY(GNUC_PATCHLEVEL)
^
horovod/common/operations.cc: In function ‘void horovod::common::{anonymous}::PerformOperation(horovod::common::{anonymous}::TensorTable&, horovod::common::MPIResponse)’:
horovod/common/operations.cc:784:24: error: invalid conversion from ‘const void*’ to ‘void*’ [-fpermissive]
e.tensor->data(), (int)e.tensor->shape().num_elements(),
^
In file included from horovod/common/operations.cc:36:0:
/usr/lib/openmpi/include/mpi.h:1036:20: note: initializing argument 1 of ‘int MPI_Allgatherv(void*, int, MPI_Datatype, void*, int*, int*, MPI_Datatype, MPI_Comm)’
OMPI_DECLSPEC int MPI_Allgatherv(void sendbuf, int sendcount, MPI_Datatype sendtype,
^
horovod/common/operations.cc:1106:54: error: invalid conversion from ‘const void
’ to ‘void*’ [-fpermissive]
horovod_global.mpi_comm))
^
horovod/common/operations.cc:530:24: note: in definition of macro ‘MPI_CHECK’
auto mpi_result = (op);
^
In file included from horovod/common/operations.cc:36:0:
/usr/lib/openmpi/include/mpi.h:1041:20: note: initializing argument 1 of ‘int MPI_Allreduce(void*, void*, int, MPI_Datatype, MPI_Op, MPI_Comm)’
OMPI_DECLSPEC int MPI_Allreduce(void *sendbuf, void *recvbuf, int count,
^
horovod/common/operations.cc: In function ‘void horovod::common::{anonymous}::BackgroundThreadLoop(horovod::common::{anonymous}::HorovodGlobalState&)’:
horovod/common/operations.cc:1251:33: error: ‘MPI_COMM_TYPE_SHARED’ was not declared in this scope
MPI_Comm_split_type(mpi_comm, MPI_COMM_TYPE_SHARED, 0, MPI_INFO_NULL,
^
horovod/common/operations.cc:1252:34: error: ‘MPI_Comm_split_type’ was not declared in this scope
&local_comm);
^
error: command 'gcc' failed with exit status 1

ERROR: Failed building wheel for horovod
Running setup.py clean for horovod
Failed to build horovod
Installing collected packages: horovod, setuptools
Running setup.py install for horovod ... error
ERROR: Command errored out with exit status 1:
command: /home/scw4750/anaconda3/envs/lcx_glow/bin/python -u -c 'import sys, setuptools, tokenize; sys.argv[0] = '"'"'/tmp/pip-install-qjvktya2/horovod/setup.py'"'"'; file='"'"'/tmp/pip-install-qjvktya2/horovod/setup.py'"'"';f=getattr(tokenize, '"'"'open'"'"', open)(file);code=f.read().replace('"'"'\r\n'"'"', '"'"'\n'"'"');f.close();exec(compile(code, file, '"'"'exec'"'"'))' install --record /tmp/pip-record-zkq5olrq/install-record.txt --single-version-externally-managed --compile
cwd: /tmp/pip-install-qjvktya2/horovod/
Complete output (97 lines):
running install
running build
running build_py
creating build
creating build/lib.linux-x86_64-3.5
creating build/lib.linux-x86_64-3.5/horovod
copying horovod/init.py -> build/lib.linux-x86_64-3.5/horovod
creating build/lib.linux-x86_64-3.5/horovod/tensorflow
copying horovod/tensorflow/init.py -> build/lib.linux-x86_64-3.5/horovod/tensorflow
copying horovod/tensorflow/mpi_ops.py -> build/lib.linux-x86_64-3.5/horovod/tensorflow
creating build/lib.linux-x86_64-3.5/horovod/torch
copying horovod/torch/init.py -> build/lib.linux-x86_64-3.5/horovod/torch
copying horovod/torch/mpi_ops.py -> build/lib.linux-x86_64-3.5/horovod/torch
creating build/lib.linux-x86_64-3.5/horovod/keras
copying horovod/keras/callbacks.py -> build/lib.linux-x86_64-3.5/horovod/keras
copying horovod/keras/init.py -> build/lib.linux-x86_64-3.5/horovod/keras
creating build/lib.linux-x86_64-3.5/horovod/common
copying horovod/common/init.py -> build/lib.linux-x86_64-3.5/horovod/common
creating build/lib.linux-x86_64-3.5/horovod/torch/mpi_lib_impl
copying horovod/torch/mpi_lib_impl/init.py -> build/lib.linux-x86_64-3.5/horovod/torch/mpi_lib_impl
creating build/lib.linux-x86_64-3.5/horovod/torch/mpi_lib
copying horovod/torch/mpi_lib/init.py -> build/lib.linux-x86_64-3.5/horovod/torch/mpi_lib
running build_ext
gcc -pthread -Wsign-compare -DNDEBUG -g -fwrapv -O3 -Wall -Wstrict-prototypes -fPIC -std=c++11 -fPIC -O2 -I/home/scw4750/anaconda3/envs/lcx_glow/include/python3.5m -c build/temp.linux-x86_64-3.5/test_compile/test_cpp_flags.cc -o build/temp.linux-x86_64-3.5/test_compile/test_cpp_flags.o
cc1plus: warning: command line option ‘-Wstrict-prototypes’ is valid for C/ObjC but not for C++
gcc -pthread -shared -L/home/scw4750/anaconda3/envs/lcx_glow/lib -Wl,-rpath=/home/scw4750/anaconda3/envs/lcx_glow/lib,--no-as-needed build/temp.linux-x86_64-3.5/test_compile/test_cpp_flags.o -L/home/scw4750/anaconda3/envs/lcx_glow/lib -o build/temp.linux-x86_64-3.5/test_compile/test_cpp_flags.so
gcc -pthread -Wsign-compare -DNDEBUG -g -fwrapv -O3 -Wall -Wstrict-prototypes -fPIC -std=c++11 -fPIC -O2 -I/usr/lib/openmpi/include -I/usr/lib/openmpi/include/openmpi -pthread -L/usr//lib -L/usr/lib/openmpi/lib -lmpi_cxx -lmpi -ldl -lhwloc -I/home/scw4750/anaconda3/envs/lcx_glow/include/python3.5m -c build/temp.linux-x86_64-3.5/test_compile/test_tensorflow_libs.cc -o build/temp.linux-x86_64-3.5/test_compile/test_tensorflow_libs.o
cc1plus: warning: command line option ‘-Wstrict-prototypes’ is valid for C/ObjC but not for C++
gcc -pthread -shared -L/home/scw4750/anaconda3/envs/lcx_glow/lib -Wl,-rpath=/home/scw4750/anaconda3/envs/lcx_glow/lib,--no-as-needed build/temp.linux-x86_64-3.5/test_compile/test_tensorflow_libs.o -L/home/scw4750/anaconda3/envs/lcx_glow/lib/python3.5/site-packages/tensorflow/core -L/home/scw4750/anaconda3/envs/lcx_glow/lib -ltensorflow_framework -o build/temp.linux-x86_64-3.5/test_compile/test_tensorflow_libs.so
/usr/bin/ld: cannot find -ltensorflow_framework
collect2: error: ld returned 1 exit status
gcc -pthread -Wsign-compare -DNDEBUG -g -fwrapv -O3 -Wall -Wstrict-prototypes -fPIC -std=c++11 -fPIC -O2 -I/usr/lib/openmpi/include -I/usr/lib/openmpi/include/openmpi -pthread -L/usr//lib -L/usr/lib/openmpi/lib -lmpi_cxx -lmpi -ldl -lhwloc -I/home/scw4750/anaconda3/envs/lcx_glow/include/python3.5m -c build/temp.linux-x86_64-3.5/test_compile/test_tensorflow_libs.cc -o build/temp.linux-x86_64-3.5/test_compile/test_tensorflow_libs.o
cc1plus: warning: command line option ‘-Wstrict-prototypes’ is valid for C/ObjC but not for C++
gcc -pthread -shared -L/home/scw4750/anaconda3/envs/lcx_glow/lib -Wl,-rpath=/home/scw4750/anaconda3/envs/lcx_glow/lib,--no-as-needed build/temp.linux-x86_64-3.5/test_compile/test_tensorflow_libs.o -L/home/scw4750/anaconda3/envs/lcx_glow/lib/python3.5/site-packages/tensorflow/core -L/home/scw4750/anaconda3/envs/lcx_glow/lib -o build/temp.linux-x86_64-3.5/test_compile/test_tensorflow_libs.so
gcc -pthread -Wsign-compare -DNDEBUG -g -fwrapv -O3 -Wall -Wstrict-prototypes -fPIC -std=c++11 -fPIC -O2 -I/usr/lib/openmpi/include -I/usr/lib/openmpi/include/openmpi -pthread -L/usr//lib -L/usr/lib/openmpi/lib -lmpi_cxx -lmpi -ldl -lhwloc -D_GLIBCXX_USE_CXX11_ABI=0 -I/home/scw4750/anaconda3/envs/lcx_glow/lib/python3.5/site-packages/tensorflow/include -I/home/scw4750/anaconda3/envs/lcx_glow/lib/python3.5/site-packages/tensorflow/include/external/nsync/public -I/home/scw4750/anaconda3/envs/lcx_glow/include/python3.5m -c build/temp.linux-x86_64-3.5/test_compile/test_tensorflow_abi.cc -o build/temp.linux-x86_64-3.5/test_compile/test_tensorflow_abi.o
cc1plus: warning: command line option ‘-Wstrict-prototypes’ is valid for C/ObjC but not for C++
gcc -pthread -shared -L/home/scw4750/anaconda3/envs/lcx_glow/lib -Wl,-rpath=/home/scw4750/anaconda3/envs/lcx_glow/lib,--no-as-needed build/temp.linux-x86_64-3.5/test_compile/test_tensorflow_abi.o -L/home/scw4750/anaconda3/envs/lcx_glow/lib/python3.5/site-packages/tensorflow/core -L/home/scw4750/anaconda3/envs/lcx_glow/lib -o build/temp.linux-x86_64-3.5/test_compile/test_tensorflow_abi.so
building 'horovod.tensorflow.mpi_lib' extension
creating build/temp.linux-x86_64-3.5/horovod
creating build/temp.linux-x86_64-3.5/horovod/tensorflow
gcc -pthread -Wsign-compare -DNDEBUG -g -fwrapv -O3 -Wall -Wstrict-prototypes -fPIC -I/home/scw4750/anaconda3/envs/lcx_glow/include/python3.5m -c horovod/tensorflow/mpi_ops.cc -o build/temp.linux-x86_64-3.5/horovod/tensorflow/mpi_ops.o -std=c++11 -fPIC -O2 -I/usr/lib/openmpi/include -I/usr/lib/openmpi/include/openmpi -pthread -L/usr//lib -L/usr/lib/openmpi/lib -lmpi_cxx -lmpi -ldl -lhwloc -I/home/scw4750/anaconda3/envs/lcx_glow/lib/python3.5/site-packages/tensorflow/include -I/home/scw4750/anaconda3/envs/lcx_glow/lib/python3.5/site-packages/tensorflow/include/external/nsync/public -D_GLIBCXX_USE_CXX11_ABI=0
cc1plus: warning: command line option ‘-Wstrict-prototypes’ is valid for C/ObjC but not for C++
g++ -pthread -shared -L/home/scw4750/anaconda3/envs/lcx_glow/lib -Wl,-rpath=/home/scw4750/anaconda3/envs/lcx_glow/lib,--no-as-needed build/temp.linux-x86_64-3.5/horovod/tensorflow/mpi_ops.o -L/home/scw4750/anaconda3/envs/lcx_glow/lib -lpython3.5m -o build/lib.linux-x86_64-3.5/horovod/tensorflow/mpi_lib.cpython-35m-x86_64-linux-gnu.so -I/usr/lib/openmpi/include -I/usr/lib/openmpi/include/openmpi -pthread -L/usr//lib -L/usr/lib/openmpi/lib -lmpi_cxx -lmpi -ldl -lhwloc -L/home/scw4750/anaconda3/envs/lcx_glow/lib/python3.5/site-packages/tensorflow/core
INFO: Above error indicates that this PyTorch installation does not support CUDA.
INFO: Unable to build PyTorch plugin, will skip it.

Traceback (most recent call last):
  File "/tmp/pip-install-qjvktya2/horovod/setup.py", line 572, in build_extensions
    build_torch_extension(self, options, abi_compile_flags)
  File "/tmp/pip-install-qjvktya2/horovod/setup.py", line 509, in build_torch_extension
    from torch.utils.ffi import create_extension
  File "/home/scw4750/anaconda3/envs/lcx_glow/lib/python3.5/site-packages/torch/utils/ffi/__init__.py", line 1, in <module>
    raise ImportError("torch.utils.ffi is deprecated. Please use cpp extensions instead.")
ImportError: torch.utils.ffi is deprecated. Please use cpp extensions instead.

building 'horovod.common.mpi_lib' extension
creating build/temp.linux-x86_64-3.5/horovod/common
gcc -pthread -Wsign-compare -DNDEBUG -g -fwrapv -O3 -Wall -Wstrict-prototypes -fPIC -I/home/scw4750/anaconda3/envs/lcx_glow/include/python3.5m -c horovod/common/common.cc -o build/temp.linux-x86_64-3.5/horovod/common/common.o -std=c++11 -fPIC -O2 -I/usr/lib/openmpi/include -I/usr/lib/openmpi/include/openmpi -pthread -L/usr//lib -L/usr/lib/openmpi/lib -lmpi_cxx -lmpi -ldl -lhwloc -D_GLIBCXX_USE_CXX11_ABI=0
cc1plus: warning: command line option ‘-Wstrict-prototypes’ is valid for C/ObjC but not for C++
gcc -pthread -Wsign-compare -DNDEBUG -g -fwrapv -O3 -Wall -Wstrict-prototypes -fPIC -I/home/scw4750/anaconda3/envs/lcx_glow/include/python3.5m -c horovod/common/mpi_message.cc -o build/temp.linux-x86_64-3.5/horovod/common/mpi_message.o -std=c++11 -fPIC -O2 -I/usr/lib/openmpi/include -I/usr/lib/openmpi/include/openmpi -pthread -L/usr//lib -L/usr/lib/openmpi/lib -lmpi_cxx -lmpi -ldl -lhwloc -D_GLIBCXX_USE_CXX11_ABI=0
cc1plus: warning: command line option ‘-Wstrict-prototypes’ is valid for C/ObjC but not for C++
gcc -pthread -Wsign-compare -DNDEBUG -g -fwrapv -O3 -Wall -Wstrict-prototypes -fPIC -I/home/scw4750/anaconda3/envs/lcx_glow/include/python3.5m -c horovod/common/operations.cc -o build/temp.linux-x86_64-3.5/horovod/common/operations.o -std=c++11 -fPIC -O2 -I/usr/lib/openmpi/include -I/usr/lib/openmpi/include/openmpi -pthread -L/usr//lib -L/usr/lib/openmpi/lib -lmpi_cxx -lmpi -ldl -lhwloc -D_GLIBCXX_USE_CXX11_ABI=0
cc1plus: warning: command line option ‘-Wstrict-prototypes’ is valid for C/ObjC but not for C++
In file included from /usr/lib/openmpi/include/mpi.h:253:0,
                 from horovod/common/operations.cc:36:
/usr/lib/openmpi/include/mpi_portable_platform.h:374:34: warning: invalid suffix on literal; C++11 requires a space between literal and string macro [-Wliteral-suffix]
              _STRINGIFY(__GNUC__)"."_STRINGIFY(__GNUC_MINOR__)"."_STRINGIFY(__GNUC_PATCHLEVEL__)
                                  ^
/usr/lib/openmpi/include/mpi_portable_platform.h:374:63: warning: invalid suffix on literal; C++11 requires a space between literal and string macro [-Wliteral-suffix]
              _STRINGIFY(__GNUC__)"."_STRINGIFY(__GNUC_MINOR__)"."_STRINGIFY(__GNUC_PATCHLEVEL__)
                                                               ^
horovod/common/operations.cc: In function ‘void horovod::common::{anonymous}::PerformOperation(horovod::common::{anonymous}::TensorTable&, horovod::common::MPIResponse)’:
horovod/common/operations.cc:784:24: error: invalid conversion from ‘const void*’ to ‘void*’ [-fpermissive]
         e.tensor->data(), (int)e.tensor->shape().num_elements(),
                        ^
In file included from horovod/common/operations.cc:36:0:
/usr/lib/openmpi/include/mpi.h:1036:20: note: initializing argument 1 of ‘int MPI_Allgatherv(void*, int, MPI_Datatype, void*, int*, int*, MPI_Datatype, MPI_Comm)’
 OMPI_DECLSPEC  int MPI_Allgatherv(void *sendbuf, int sendcount, MPI_Datatype sendtype,
                    ^
horovod/common/operations.cc:1106:54: error: invalid conversion from ‘const void*’ to ‘void*’ [-fpermissive]
                               horovod_global.mpi_comm))
                                                      ^
horovod/common/operations.cc:530:24: note: in definition of macro ‘MPI_CHECK’
     auto mpi_result = (op);                                                    \
                        ^
In file included from horovod/common/operations.cc:36:0:
/usr/lib/openmpi/include/mpi.h:1041:20: note: initializing argument 1 of ‘int MPI_Allreduce(void*, void*, int, MPI_Datatype, MPI_Op, MPI_Comm)’
 OMPI_DECLSPEC  int MPI_Allreduce(void *sendbuf, void *recvbuf, int count,
                    ^
horovod/common/operations.cc: In function ‘void horovod::common::{anonymous}::BackgroundThreadLoop(horovod::common::{anonymous}::HorovodGlobalState&)’:
horovod/common/operations.cc:1251:33: error: ‘MPI_COMM_TYPE_SHARED’ was not declared in this scope
   MPI_Comm_split_type(mpi_comm, MPI_COMM_TYPE_SHARED, 0, MPI_INFO_NULL,
                                 ^
horovod/common/operations.cc:1252:34: error: ‘MPI_Comm_split_type’ was not declared in this scope
                       &local_comm);
                                  ^
error: command 'gcc' failed with exit status 1
----------------------------------------

ERROR: Command errored out with exit status 1: /home/scw4750/anaconda3/envs/lcx_glow/bin/python -u -c 'import sys, setuptools, tokenize; sys.argv[0] = '"'"'/tmp/pip-install-qjvktya2/horovod/setup.py'"'"'; file='"'"'/tmp/pip-install-qjvktya2/horovod/setup.py'"'"';f=getattr(tokenize, '"'"'open'"'"', open)(file);code=f.read().replace('"'"'\r\n'"'"', '"'"'\n'"'"');f.close();exec(compile(code, file, '"'"'exec'"'"'))' install --record /tmp/pip-record-zkq5olrq/install-record.txt --single-version-externally-managed --compile Check the logs for full command output.
Please give me some advice

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant