[JIT] Use __ldg for CUDA kernels in fuser #18540

t-vi · 2019-03-27T21:01:14Z

While benchmarking a kernel with broadcasted inputs, I noticed
that is was much slower than a hand-coded kernel for the smae task.

The kernel in question computed a * b + c for a of shape
32 x 32 x 10240 and b and c of shape 1 x 32 x 1.

This patch accellerates said kernel from 450us to 250us on my GTX1080Ti.

I didn't change half because there doesn't seem to be __ldg for
half.

An alternative could be to sprinkle const and restrict.

While benchmarking a kernel with broadcasted inputs, I noticed that is was much slower than a hand-coded kernel for the smae task. The kernel in question computed a * b + c for a of shape 32 x 32 x 10240 and b and c of shape 1 x 32 x 1. This patch accellerates said kernel from 450us to 250us on my GTX1080Ti. I didn't change half because there doesn't seem to be __ldg for half. An alternative could be to sprinkle const and restrict.

facebook-github-bot

@soumith is landing this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2019-03-28T04:03:17Z

@soumith merged this pull request in 9696f06.

facebook-github-bot added the oncall: jit Add this issue/PR to JIT oncall triage queue label Mar 27, 2019

t-vi force-pushed the fuser_ldg branch from c39f8e3 to b233145 Compare March 27, 2019 21:18

apaszke approved these changes Mar 27, 2019

View reviewed changes

facebook-github-bot reviewed Mar 28, 2019

View reviewed changes

facebook-github-bot closed this in 9696f06 Mar 28, 2019

facebook-github-bot added the merged label Mar 28, 2019

ezyang added the open source label Jun 24, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[JIT] Use __ldg for CUDA kernels in fuser #18540

[JIT] Use __ldg for CUDA kernels in fuser #18540

t-vi commented Mar 27, 2019

facebook-github-bot left a comment

facebook-github-bot commented Mar 28, 2019

[JIT] Use __ldg for CUDA kernels in fuser #18540

[JIT] Use __ldg for CUDA kernels in fuser #18540

Conversation

t-vi commented Mar 27, 2019

facebook-github-bot left a comment

Choose a reason for hiding this comment

facebook-github-bot commented Mar 28, 2019