Skip to content
This repository has been archived by the owner on Nov 17, 2023. It is now read-only.

horovod cpu build segfault #18013

Open
eric-haibin-lin opened this issue Apr 10, 2020 · 5 comments
Open

horovod cpu build segfault #18013

eric-haibin-lin opened this issue Apr 10, 2020 · 5 comments

Comments

@eric-haibin-lin
Copy link
Member

eric-haibin-lin commented Apr 10, 2020

This works:

pip3 install https://repo.mxnet.io/dist/python/cpu/mxnet-2.0.0b20200306-py2.py3-none-manylinux1_x86_64.whl

pip3 uninstall horovod -y; pip3 install horovod --user --no-cache-dir
horovodrun -np 4 python3.6 test.py

test.py:

import horovod.mxnet as hvd
import mxnet as mx

hvd.init()
a = mx.nd.ones((1))
hvd.allreduce_(a)
print(a)

This fails:

pip3 install https://repo.mxnet.io/dist/python/cpu/mxnet-2.0.0b20200313-py2.py3-none-manylinux2014_x86_64.whl
@eric-haibin-lin
Copy link
Member Author

@apeforest FYI

@eric-haibin-lin
Copy link
Member Author

@TaoLv @PatricZhao FYI

@TaoLv
Copy link
Member

TaoLv commented Apr 11, 2020

@eric-haibin-lin @apeforest I cannot install mxnet and horovod through the command lines shared in the description. Besides, with pip install --pre mxnet -f https://dist.mxnet.io/python/cpu, I can only get https://repo.mxnet.io/dist/python/cpu/mxnet-2.0.0b20200312-py2.py3-none-manylinux1_x86_64.whl.

@eric-haibin-lin
Copy link
Member Author

@TaoLv can you try wget https://repo.mxnet.io/dist/python/cpu/mxnet-2.0.0b20200313-py2.py3-none-manylinux2014_x86_64.whl and then pip install mxnet-2.0.0b20200313-py2.py3-none-manylinux2014_x86_64.whl

@TaoLv
Copy link
Member

TaoLv commented Apr 12, 2020

@TaoLv can you try wget https://repo.mxnet.io/dist/python/cpu/mxnet-2.0.0b20200313-py2.py3-none-manylinux2014_x86_64.whl and then pip install mxnet-2.0.0b20200313-py2.py3-none-manylinux2014_x86_64.whl

Got:

(mxnet) [lvtao@mlt2-clx103 ~]$ pip install mxnet-2.0.0b20200313-py2.py3-none-manylinux2014_x86_64.whl
ERROR: mxnet-2.0.0b20200313-py2.py3-none-manylinux2014_x86_64.whl is not a supported wheel on this platform.

I'm using CentOS 7.2 and python 3.6.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

No branches or pull requests

2 participants