Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

deprecation of genericLSTM #17

Closed
apbard opened this issue Jun 12, 2015 · 10 comments
Closed

deprecation of genericLSTM #17

apbard opened this issue Jun 12, 2015 · 10 comments

Comments

@apbard
Copy link
Contributor

apbard commented Jun 12, 2015

I have seen that in the last version genericLSTM has been "softly" deprecated. Could you please explain me the reason why this choice has been made?

@tmbdev
Copy link
Owner

tmbdev commented Jun 12, 2015

I've run a lot of benchmarks that have shown that (1) peephole optimizations don't help (and usually hurt), and (2) any other choice of non-linearity other than the default one performs considerably worse.

I'll publish that as an Arxiv tech report soon.

@BestSonny
Copy link

Is the arxiv link of your tech report available now?

@tmbdev
Copy link
Owner

tmbdev commented Aug 5, 2015

I got publication clearance, but it's not quite out yet. I'll try to push it in the next few days.

@BestSonny
Copy link

Thank you! Have you ever try to push a GPU version of LSTM since the community seems to have more interests in the RNN field?

@tmbdev
Copy link
Owner

tmbdev commented Aug 6, 2015

I'm planning on GPU support via Eigen; however, that will still be a few months before the necessary dependencies are out.

@amitdo
Copy link
Contributor

amitdo commented Aug 17, 2015

@BestSonny, here it is:

Benchmarking of LSTM Networks
Thomas M. Breuel

http://arxiv.org/abs/1508.02774

@BestSonny
Copy link

Thank you!

@futurely
Copy link

@tmbdev , mshadow may be a strong alternative to Eigen. Both cxxnet and mxnet are built upon it.

@tmbdev
Copy link
Owner

tmbdev commented Oct 4, 2015

Thanks. Looks like Eigen and mshadow have similar goals. Is there any performance data comparing the two?

@tmbdev tmbdev closed this as completed Oct 4, 2015
@futurely
Copy link

futurely commented Oct 7, 2015

@tmbdev , mshadow has very detailed tutorials and other documentations including multi-GPU support while Eigen documents almost nothing related to GPU on its official website.

More importantly, mshadow exploits the powerful NVIDIA cuDNN library. It's impossible for Eigen to catch up mshadow in performance on GPUs.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants