run_example failed in MacOS #246

cuiwow · 2019-04-12T07:04:39Z

I installed a new version with the build.sh in MacOS (Xcode and CMake installed first).
When running the run_example.sh, it aborted with the error message:

MacOS: Mojave 10.14.4
xlearn: 0.43

(the pip installed one in version 0.40a works on my computer, but has a bug just fixed in latest version)

aksnzhy · 2019-04-12T07:13:28Z

@cuiwow I find the same error. You can solve the problem by doing this:

mkdir a build outside the xlearn source code and go into it:

cmake ../xlearn
make -j4

and then you can try the demo successfully.

I will figure out why this error come out. Thanks!

aksnzhy · 2019-04-12T07:58:17Z

@cuiwow I fixed the bug and you can use the latest code to test it. Thank you!

cuiwow · 2019-04-12T08:03:19Z

@aksnzhy Thanks, that works.

You have to agree with the xcode agreement before using /usr/bin/cc, otherwise it would throw an exception. I don't know whether it is the reason, and how i could install it before without /usr/bin/cc.

cuiwow · 2019-04-12T08:12:43Z

@aksnzhy Another case: failed with my own data, the same error message.

Line size is about 2.1MB,
csv file format: label,value0,value1... label is a int 0/1, while values are binary float 0.0/1.0
cmd: xlearn_train ./fm_train.csv -s 2 -v ./fm_eval.csv -x acc

It works when using just 10 values. But failed using real data with more than 500,000 columns, as the attached file.
I have checked the columns' number, and each element is an integer or float number.
fm_train.txt

aksnzhy · 2019-04-12T08:15:27Z

@etveritas Could you please help @cuiwow to solve this issue? Thanks!

etveritas · 2019-04-12T08:23:20Z

@aksnzhy okay, and @cuiwow, I'll try to solve this issue, please give me a minute.

etveritas · 2019-04-12T09:08:45Z

@aksnzhy There is another kMaxLineSize left unchanged before, and I have modified it. @cuiwow after change this constant, I test on WSL with your data, it passed, you can use the latest source code, btw, xLearn still skip zeros when read CSV, you should comment this line out if need.

cuiwow · 2019-04-12T10:13:44Z

@aksnzhy @etveritas
Thanks a lot. I have success in running a demo. Then encountered another problem...

I have a line size of 2MB. Industry recommendation datasets always have more than 1M lines, or 10M lines at least. The total data size would be larger than 2TB, while the original data size is only 50GB.

I want to train it on a single server, to build a baseline. But my disk is not large enough..

I think the data should be processed into the right format batch by batch during training, not whole data.

Thanks for your efforts.

etveritas · 2019-04-12T12:04:51Z

@cuiwow I'm not sure whether your mean is having no enough memory. If it is, for large dataset, xLearn have supported on-disk train if your machine have enough memory, you can find more in https://xlearn-doc.readthedocs.io/en/latest/large/index.html.

etveritas closed this as completed Apr 15, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

run_example failed in MacOS #246

run_example failed in MacOS #246

cuiwow commented Apr 12, 2019 •

edited

aksnzhy commented Apr 12, 2019 •

edited

aksnzhy commented Apr 12, 2019

cuiwow commented Apr 12, 2019

cuiwow commented Apr 12, 2019 •

edited

aksnzhy commented Apr 12, 2019

etveritas commented Apr 12, 2019

etveritas commented Apr 12, 2019 •

edited

cuiwow commented Apr 12, 2019 •

edited

etveritas commented Apr 12, 2019 •

edited

run_example failed in MacOS #246

run_example failed in MacOS #246

Comments

cuiwow commented Apr 12, 2019 • edited

aksnzhy commented Apr 12, 2019 • edited

aksnzhy commented Apr 12, 2019

cuiwow commented Apr 12, 2019

cuiwow commented Apr 12, 2019 • edited

aksnzhy commented Apr 12, 2019

etveritas commented Apr 12, 2019

etveritas commented Apr 12, 2019 • edited

cuiwow commented Apr 12, 2019 • edited

etveritas commented Apr 12, 2019 • edited

cuiwow commented Apr 12, 2019 •

edited

aksnzhy commented Apr 12, 2019 •

edited

cuiwow commented Apr 12, 2019 •

edited

etveritas commented Apr 12, 2019 •

edited

cuiwow commented Apr 12, 2019 •

edited

etveritas commented Apr 12, 2019 •

edited