New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
MXNet backend for Keras #8697
Comments
Hello Keras Community, We have completed the majority of operators required for training a Keras MLP and CNNs with MXNet backend. (CPU, 1 GPU, multi-GPU). I am currently working on fixing broken test cases and preparing to create a PR here with keras-team. I am hoping to create the first PR in next 2 weeks. In the meantime, here is a quick guide to try out Keras with MXNet backend on the work in progress repository. https://github.com/deep-learning-tools/keras/wiki/Installation-Guide---Keras-with-MXNet-backend Thanks, |
Submitted first of the PR for adding MXNet backend support. |
Great job for supporting mxnet as Keras backend! Thanks! |
Hello @dkasper26,
|
Very interesting. I really look forward to having the mxnet backend in order to have better rnn training speed (TF is still slow with lstms) |
Hello Keras team,
@jiajiechen and myself have started working on adding MXNet backend for Keras 2. With the option to use MXNet as Keras backend, Keras users can benefit from MXNet’s performance and scalability, while still using their Keras skillset and models.
We are extending the work done by (@piiswrong, @kevinthesun, @yajiedesign, @howard0su) at https://github.com/dmlc/keras for Keras 1.2 with MXNet backend.
Here is the repo we are currently working on - https://github.com/deep-learning-tools/keras/tree/keras2_mxnet_backend (Note this is just a development fork for us to work initially, we will create PRs with keras from this repo). Till now we have added subset of operators in the MXNet backend.
To get focussed early feedback and continuously push out the work done for the users, we plan to create incremental PRs. First we plan to create PR with MXNet backend implementing basic operators(variable manipulations, Linear Algebra operations, element wise operations, shape operations and more), then we will create PR supporting CNN architectures to train, save, load and inference. Followed by support to Distributed multi-machine training capability for Keras users with MXNet backend, RNNs, Sparse and more.
Contributions are very welcome. Please participate in code reviews, early testing and development activities. You can create issues and PRs https://github.com/deep-learning-tools/keras/tree/keras2_mxnet_backend till we get a good stable code merged into fchollet/keras.
Thanks,
The text was updated successfully, but these errors were encountered: