-
Notifications
You must be signed in to change notification settings - Fork 1.1k
Illegal instruction on training image similarity model from example #20
Comments
Can you share some more information about your system for us to be able to debug this issue?
I have opened another issue #26 to track the issue related to memory. |
And if you happen to have a machine that is not x86_64, we don't support that and I've opened an issue to make sure future users will get an error earlier on #27. |
Yes, I am using x86_64. But, the machine is only have mediocre memory (8 gb). I also suspect something like the batch size (since the tutorial is not explicitly state the number). I have tried on much smaller dataset (<100 images) and it works just fine. |
Assigning to you @gustavla - but if this turns out to be a WSL-specific issue feel free to assign to me. Thanks! |
I have still not been able to reproduce this issue. @francisbitontistudio Thanks for adding another sample point to this investigation. I also see that you are using the object detector, which means it is not limited to image similarity. @taflahi @francisbitontistudio Can you please try the following? >>> import mxnet as mx
>>> a = mx.nd.random_uniform(shape=(2, 3), ctx=mx.cpu())
>>> b = mx.nd.random_uniform(shape=(3, 2), ctx=mx.cpu())
>>> mx.nd.dot(a, b).asnumpy() Can you also try this but replace |
@gustavla Thanks for reply. So with mx.gpu() it works, but same problem when using cpu for this script |
@francisbitontistudio Thanks, that is really useful! After further investigation, it turns out that the MXNet binaries for the 0.11.0 release were compiled with x86_64 extensions not available on all hardware. This seems to have been fixed in the 0.12.0 release, which unfortunately is not yet supported by Turi Create. We are working on this and consider it high priority (#17). Until then, I really wish I had a better work-around, but the only sure way to fix it is to compile MXNet yourself from source. This could get tricky (instructions), so I will let you know if I think of a better work-around. Sorry about this and thanks again for helping track this down. |
I may have found a work-around, although proceed with caution. MXNet puts up what looks to be nightly builds. I tried one of the 0.11 ones and this seems to resolve this issue. I do not know much about these versions, so I can't guarantee everything else will work correctly. Turi Create will also still complain that you have the wrong MXNet version. pip install -U mxnet==0.11.1b20170822 Another thing you can do is install version 0.12.1. I am actually working on support for that version right now and the only issue seems to be in the Activity Classifier. So, if you are not using that model, then it should work and you can ignore the warning. This is of course only a temporary work-around until official support has been added. |
@gustavla using 0.12.1 works for me. Thanks a lot! |
Models Store Train/Validation Accuracy/RMSE in Consistent Way
I am getting this error while training:
I am using ubuntu 16.04 on Windows 10 WSL.
The code is as following
The text was updated successfully, but these errors were encountered: