Skip to content
This repository has been archived by the owner on Aug 5, 2022. It is now read-only.

MLSL compiling error #96

Closed
Rpona opened this issue Jul 12, 2017 · 4 comments
Closed

MLSL compiling error #96

Rpona opened this issue Jul 12, 2017 · 4 comments

Comments

@Rpona
Copy link

Rpona commented Jul 12, 2017

I git clone the latest version of intel caffe today.

When I make with "USE_MLSL=1", it will show make error as below:

make: *** [.build_release/src/caffe/solvers/nesterov_solver.o] Error 1
In file included from ./include/caffe/syncedmem.hpp:51:0,
from ./include/caffe/blob.hpp:47,
from ./include/caffe/net.hpp:47,
from ./include/caffe/solver.hpp:44,
from ./include/caffe/sgd_solvers.hpp:44,
from src/caffe/solvers/sgd_solver.cpp:41:
./include/caffe/multinode/mlsl.hpp: In constructor ‘caffe::mn::Distribution::Distribution(int, int, int, int, int, int)’:
./include/caffe/multinode/mlsl.hpp:114:50: error: ‘class MLSL::Environment’ has no member named ‘CreateDistributionWithColors’
distrib_ = MLSL::Environment::GetEnv().CreateDistributionWithColors(dataColor, modelColor);

If I use default setting, it can be compiled well.
#USE_MLSL: 1
However, I found multi-node training are not synchronized.

@jgong5
Copy link
Contributor

jgong5 commented Jul 12, 2017

Please make sure you don't have MLSL_ROOT env that points to an old version of MLSL. Suggest you clear MLSL_ROOT and the latest Caffe would download MLSL binary package automatically.

@Rpona
Copy link
Author

Rpona commented Jul 12, 2017

@jgong5
Thanks for your reply.
I used "rpm -e intel-mlsl-devel-64-2017.1-014.x86_64" to remove the order version of mlsl.
Then , I reboot the server and I didn't export "MLSL_ROOT".
I still got the same error message.

@jgong5
Copy link
Contributor

jgong5 commented Jul 12, 2017

Could you please double check if there is MLSL_ROOT in your environment?

@Rpona
Copy link
Author

Rpona commented Jul 12, 2017

@jgong5
Hi, I fixed the issue. Thank you a lot :)

I found my intel caffe will download the order version "l_mlsl_p_2017.0.014 " of mlsl.
And there isn't "CreateDistributionWithColors" function in the header file "mlsl.hpp" of this version.
I install the latest version of mlsl from https://github.com/01org/MLSL/releases.
And followed these commands.
$ rpm --import PUBLIC_KEY.PUB
$ rpm -i intel-mlsl-devel-64-2017.1-016.x86_64.rpm
$source /opt/intel/mlsl_2017.1.016/intel64/bin/mlslvars.sh
then compile again.

@Rpona Rpona closed this as completed Jul 12, 2017
@Rpona Rpona mentioned this issue Jul 13, 2017
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants