New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
versions (1.x.x) SIGSEGV in OSX ND4J CPU #8156
Comments
|
Please try again with a non-beta version of macOS.
|
|
Just updated my osX to 10.15 (Catalina). It is not a beta anymore. |
|
I see, it probably has something to do with some interaction between GCC/libstdc++ and Xcode's Clang/libc++. libnd4j has been relying on OpenMP, which Xcode doesn't provide, but @raver119 plans to move away from that. That would allow us to use Xcode only and probably work around these kinds of issues... |
@petrychenko Thats interesting, not sure how it would affect but its definitely worth to give a try. My java as seen in the previous log is |
|
I've upgraded to Catalina yesterday, so i'll check it out... But ye, as @saudet mentioned, we're working on replacement for OpenMP right now, so we hope to switch to Apple Clang for macOS really soon. |
|
Made the mistake of upgrading to 10.15 over the weekend and now I'm blocked because of this. I tried
with always the same result. :-( |
|
We are almost done with GCC replacement for macOS, just waiting for CI now. Once it's up - you'll be able to use snapshots.
The only alternative I can offer you right now is to build from sources
With best regards,
raver119
22 окт. 2019 г., 16:20 +0300, ff-will <notifications@github.com>, писал:
… Made the mistake of upgrading to 10.15 over the weekend and now I'm blocked because of this. I tried
• Oracle JDK 8
• OpenJDK 8
• OpenJDK 11
with always the same result. :-(
—
You are receiving this because you were assigned.
Reply to this email directly, view it on GitHub, or unsubscribe.
|
|
Latest AdoptOpenJDK 13 worked for me. |
|
Latest AdoptOpenJDK 13 did not fix it for me. |
|
Am facing a similar issue using nd4j 1.0.0-beta on Linux server where the server is crashing while doing inference. Any suggestions on what could be happening there ? |
|
@PMacho What does "worked for me" mean? According to my tests
|
Well, I didn’t check for correct function. I only tried the quick start example. It didn’t crash and it started printing log information, that it was training the network. |
|
@nimishatandon that is a separate problem. The problem here is specifically about macos. Please open a separate issue for your problem. |
|
Is there any update on this? I had to upgrade to Catalina over the weekend for work 🙄 and now my project isn't working. If there is a fix I can try out, I would be willing to help test possible solutions. |
|
Pull request merged, snapshots are getting build, so we'll be testing on random macs in next few days. |
|
PR was merged like a hour ago. |
|
AFAIK, issue is resolved. If, for some reason, you're still able to reproduce it - please reopen this issue or file a new one. |
|
That's great! Thanks @raver119 ! |
|
fix is already at eclipse master, and already available in daily snapshots. |
|
as for release: we hope release will be up early next week. |
|
@raver119 Thanks for that, and can I ask how to switch beta5 to snapshots in the dl4j example projects? Should I just change the parent |
|
http://deeplearning4j.org/docs/latest/deeplearning4j-config-snapshots |
|
@treo Thanks for that, it works and I forgot to add I hope this issue can be added to a release version soon. Updating macOS should be careful. Anyways, thanks again. |
|
problem still exists on 1.0.0-beta5, java 1.8 HotSpot. |
|
Sure. fix was applied in current master, hence it's available in snapshot builds.
With best regards,
raver119
21 нояб. 2019 г., 16:11 +0300, yexingzhe <notifications@github.com>, писал:
… problem still exists on 1.0.0-beta5, java 1.8 HotSpot.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub, or unsubscribe.
|
|
No it is not working in 1.0.0-beta5, java 1.8. |
|
Sure it doesn't. Fix was applied in snapshots only, and will also be available in upcoming 1.0.0-beta6. http://deeplearning4j.org/docs/latest/deeplearning4j-config-snapshots |
|
I have tried with SNAPSHOT versions and it doesnt work. In #927 i have posted the log. Ok, i am adding repository tag. Testing it now. This is the pom.xml This is the stacktrace after running clean package -Djavacpp.platform=macosx-x86_64 |
|
Great. Show your pom.xml and output (or crash log) you've got there please. |
|
I have updated the above comment with pom.xml and stacktrace. Thank you @raver119 |
|
@alonsoir I don't see the crashlog in the examples issue or here. |
|
Ok, got it. |
|
@alonsoir you're definitely still using 1.0.0-beta5, not snapshots. |
This particular library comes from gcc, so you're definitely still using 1.0.0-beta5. Current macOS snapshots (and upcoming release) is built with clang. Same with these libraries: They are not used anymore. |
|
Ok, i thought that changing parent`s pom did the change. I will change every pom.xml in the related projects, i mean, dl4j-examples and shared-utilities. There are compiling errors. I changed shared-utilities pom.xml to use SNAPSHOT version and dl4j-examples pom.xml file. |
|
TFGraphMapper.getInstance method is not longer present in nd4j-api-1.0.0-SNAPSHOT |
Just use the static methods - TFGraphMapper.importGraph etc |
|
Thank you @AlexDBlack , i will try to do the fix, in the meantime, i will expect to the release of beta6. Do you have any roadmap with the release of the stable version? Thank you. |
|
Full 1.0.0 should be the next release after the upcoming 1.0.0-beta6 release. |
|
any other quickfix @AlexDBlack ? |
|
I am having the same issue, even with the snapshot version. here is my pom.xml I think the problem that iam still running the beta5 version but I don't how. I have changed the version and added the repository tag. but with looking at the log file I found the following entries.
|
|
No, you don't: |
I'm not following. Do you mean: is there any fix other than switching to snapshots, waiting for the next release (coming very soon) or not using Mac? |
|
I'm on a Mac and I installed Catalina. :-( I got a clean compile via mvn and then started getting the errors others have found above. I have tried switching my pom.xml from 1.0.0-beta5 to 1.0.0-SNAPSHOT. Now I no longer can get a clean compile. I run into: deeplearning4j/examples/modelimport/tensorflow/LoadTensorFlowMNISTMLP.java:[67,39] cannot find symbol . ( and a couple more like this in LoadTensorFlowMNISTMLP.java) IntelliJ doesn't show any error in the editor. I am using java version "1.8.0_231". Anybody see anything like this? I'm scared to try to revert my Catalina to Mohave. |
|
Did you also add I have been able to run on Catalina with the |
|
Yea, I added that block in the pom.xml files where Maven complained. I also added this to the main pom.xml: I tried : mvn clean install , tried: mvn package -U outside of IntelliJ and tried the maven compile inside of IntelliJ. Pretty sure I am running with Java 1.8 everywhere. It makes no sense to me that IntelliJ doesn't mark that line with an error but the mvn compile does. |
|
I am also facing the same in Catalina. I haven't found any solution yet.
…On Thu, Dec 5, 2019 at 2:29 AM Jim Passmore ***@***.***> wrote:
Yea, I added that block in the pom.xml files where Maven complained. I
also added this to the main pom.xml:
<java.version>1.8</java.version>
<nd4j.version>1.0.0-SNAPSHOT</nd4j.version>
<dl4j.version>1.0.0-SNAPSHOT</dl4j.version>
<datavec.version>1.0.0-SNAPSHOT</datavec.version>
<arbiter.version>1.0.0-SNAPSHOT</arbiter.version>
<rl4j.version>1.0.0-SNAPSHOT</rl4j.version>
I tried : mvn clean install , tried: mvn package -U outside of IntelliJ
and tried the maven compile inside of IntelliJ. Pretty sure I am running
with Java 1.8 everywhere.
It makes no sense to me that IntelliJ doesn't mark that line with an error
but the mvn compile does.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#8156>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ALMGASL5AGVPV3XQVYF7ROLQXAK3BANCNFSM4IQO52AQ>
.
|
|
show full pom.xml please as gist - https://gist.github.com/ |
|
Does 1.0.0-beta6 fix this issue? I cant find anything related to this issue in beta6 release notes. |
|
Yes, this should be fixed in 1.0.0-beta6. |
|
The 1.0.0-beta6 release fixed the issue for me. Thanks to all involved, I really didn't want to rollback Catalina from my Mac. |
Issue Description
SIGSEGV in OSX with all versions above 0.9.1
Version Information
Affects all the versions after 0.9.1
1.0.0-beta4
1.0.0-beta3
1.0.0-beta2
1.0.0-beta
1.0.0-alpha
MacOS Catalina 10.15 Beta (19A487l)
MacBook Pro (13-inch, 2018, Four Thunderbolt 3 Ports)
2,3 GHz Intel Core i5
16GB
Additional Information
The text was updated successfully, but these errors were encountered: