-
Notifications
You must be signed in to change notification settings - Fork 171
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CaffeOnSpark #96
Comments
Great question! @anfeng, @junshi15, Mridul Jain, and @cypof are doing some really great work with CaffeOnSpark. Long term, this will depend on adoption. In the meantime, there are enough differences in design decisions and implementation choices between the two projects that it makes sense to develop both. |
...And also dl4j. Are there any plan to enter in spark-ml? Kudos to @saudet for his Caffe and Tensorflow java-cpp preset. |
Deeplearning4j is a very different and interesting approach to deep networks on Spark. On the one hand, I would expect it to work more seamlessly since they build everything themselves and so have to worry less about integrating with existing open source projects and native code. On the other hand, compatibility with existing standards (Caffe models and TensorFlow models) is harder. It's a good idea and very worthwhile, but it doesn't change our immediate plans. As for Spark-ML. We are open to the idea, but haven't started working on that. Indeed kudos to @saudet. He's doing an amazing job with JavaCPP. |
Should have cced me ;) thanks @saudet. We just added a computation graph api as well. We intend on adding other neural nets via javacpp from here on out as well. FWIW, we are starting to work with IBM on some of their integration /cc @cfregly if that helps at all. Adding backends to layers is one of our next priorities which will mean different kinds of integrations. That will play well with the computation graph. I think there's plenty of integration points if you guys are open for discussion. FWIW, we are almost done with our internal rewrite for c++ internals which will allow us to integrate a bit smoother with different hardware. I'm happy to talk about adding other c++ in there as well. Next will be adding a parameter server since there hasn't been interest on the spark side. FWIW I'm based in the bay area if an in person meeting is required. Good job on spark net! |
Hi @agibsonccc, and thanks for commenting! Sounds like you guys have a lot of great stuff in the pipeline. We're also based in the bay area and looking forward to discussing more! |
@robertnishihara Are you in contact with @thunterdb? Is there also an independent roadmap for Spark-Tensorflow by Databricks? I really hope the there are enough energies to not let fragmentation overcome. |
We've talked, but haven't coordinated. Thanks for bringing this up! @thunterdb, we're also interested in your Spark-Tensorflow roadmap. |
I suggest to take a look also to https://github.com/maxpumperla/elephas. Elephas rely on Keras that already interface Theano and Tensorflow backends via Python. I know that Keras has an easy path to integrate than Java but could be interesting to know other multi backend approach now that Google started to released the "official" distributed version of Tensorflow. |
Thanks for the link! |
I'll reach out.
|
/cc @maxpumperla |
What will be the impact of Yahoo CaffeOnSpark release for SparkNet?
The text was updated successfully, but these errors were encountered: