New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

create a C benchmark with cublas #1

Merged
merged 2 commits into from Feb 12, 2016

Conversation

Projects
None yet
2 participants
@ppwwyyxx
Contributor

ppwwyyxx commented Feb 12, 2016

Hi,
Your work is great! I did some benchmark in C++ and it shows that xnor is 23x faster than your baseline gemm, and 3.3x faster than nvidia cublas gemm, on GTX980.

Also, I think comparing with Theano doesn't make a lot of sense because it looks like everything got slow down in Theano. cublas would be a good baseline to compare with, since it is the fastest available gemm implementation.

MatthieuCourbariaux added a commit that referenced this pull request Feb 12, 2016

Merge pull request #1 from ppwwyyxx/master
create a C benchmark with cublas

@MatthieuCourbariaux MatthieuCourbariaux merged commit 2e80a30 into MatthieuCourbariaux:master Feb 12, 2016

@MatthieuCourbariaux

This comment has been minimized.

Show comment
Hide comment
@MatthieuCourbariaux

MatthieuCourbariaux Feb 12, 2016

Owner

Hi Yuxin,

I tried your C benchmark and it works great, thank you very much!
I think you are perfectly right, it makes much more sense to compare our kernel with cublas.
You just earned a place in our article acknowledgements :)

Owner

MatthieuCourbariaux commented Feb 12, 2016

Hi Yuxin,

I tried your C benchmark and it works great, thank you very much!
I think you are perfectly right, it makes much more sense to compare our kernel with cublas.
You just earned a place in our article acknowledgements :)

@ppwwyyxx ppwwyyxx referenced this pull request Jul 29, 2016

Closed

Binary ops #1592

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment