Provide Simple Matrix Multiplication kernel? #34

faruknane · 2019-10-07T13:04:05Z

Hi, first of all I want to thank you for all you've done!

I've decided to use ILGPU in my deep learning library project. I hope it is as fast as c++ cuda. Except compiling time, I believe that it doesn't have any latency while accessing gpu, which was the issue I am so afraid of. I made a couple tests for this.

I am still confused if I should use ILGPU, because @m4rs-mt didn't provide some basic kernels I don't know how to improve my code. I want to know how to improve the performance of SGEMM. Can you provide at least simple Matrix Multiplication kernel? I need to benchmark against CUBLAS and try to improve the performance.

m4rs-mt · 2019-10-12T22:39:47Z

@faruknane Thank you very much for your feedback. Your project looks quite interesting.

I am still confused if I should use ILGPU, because @m4rs-mt didn't provide some basic kernels I don't know how to improve my code

I'm afraid I don't fully understand your point here. Did you take a look at the sample repository or the documentation? There are several basic kernels and use cases that show how to use the library in an appropriate way to get started. If you want to write a matrix multiplication kernel, you can refer to a sample implementation and convert the kernel to the ILGPU world based on the kernels in the sample repository. However, I totally agree that in the near future we should add such a simple kernel to the sample repository to simplify the process of "getting used to the library".

MoFtZ · 2019-10-23T22:37:02Z

Added sample project in m4rs-mt/ILGPU.Samples#5

faruknane · 2019-10-24T04:14:27Z

Thank you so much. Appreciate it!

m4rs-mt · 2019-10-26T07:05:26Z

Thanks to @MoFtZ we can now close this ticket 🥇

m4rs-mt added help wanted question labels Oct 12, 2019

m4rs-mt closed this as completed Oct 26, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Provide Simple Matrix Multiplication kernel? #34

Provide Simple Matrix Multiplication kernel? #34

faruknane commented Oct 7, 2019

m4rs-mt commented Oct 12, 2019

MoFtZ commented Oct 23, 2019

faruknane commented Oct 24, 2019 •

edited

m4rs-mt commented Oct 26, 2019

Provide Simple Matrix Multiplication kernel? #34

Provide Simple Matrix Multiplication kernel? #34

Comments

faruknane commented Oct 7, 2019

m4rs-mt commented Oct 12, 2019

MoFtZ commented Oct 23, 2019

faruknane commented Oct 24, 2019 • edited

m4rs-mt commented Oct 26, 2019

faruknane commented Oct 24, 2019 •

edited