[webgl] use shader loop instead of expanding with js to reduce shader size for packed Depthwise #5714

pyu10055 · 2021-10-11T19:43:24Z

This is a simple change to move the outermost loop of filter height from js code expansion to shader loop, which will greatly reduce the shader size for depthwise (factor of 3-5x depends on the filter height).
Tested on mobilenet and effientdet models, it will improve the model initialization time close to the same as unpack version.

fix #5343
To see the logs from the Cloud Build CI, please join either our discussion or announcement mailing list.

This change is

qjia7 · 2021-10-12T01:21:06Z

This is amazing! Didn't expect the loop unrolling will bring so such overhead for compiling time.
Will it affect much if we also move for (let c = 0; c < filterWidth; c++) to shader loop?

pyu10055

thanks, the line for (let c = 0; c < filterWidth; c++) is just to initiate the variables for the loop, it cannot be unrolled.

Reviewable status: 0 of 1 approvals obtained (waiting on @lina128)

lina128

Hi Ping, will this affect inference performance? I see that in the unpacked version, Daniel has a TODO to flatten the for loop, I don't know whether it's because of performance. https://github.com/tensorflow/tfjs/blob/master/tfjs-backend-webgl/src/conv_gpu_depthwise.ts#L96

If inference performance doesn't change significantly, then the change looks good. Thank you!

Reviewable status: 0 of 1 approvals obtained (waiting on @jinjingforever)

pyu10055

The performance is the same for mobilenet and efficientdet, maybe a slightly better for some reason. My understanding of the comment in unpacked shader is aiming to use vec4 to group summation, which is already there for packed version.

Reviewable status: 0 of 1 approvals obtained (waiting on @jinjingforever)

lina128

Got it, great. Thank you! LGTM.

Reviewable status: complete! 1 of 1 approvals obtained (waiting on @jinjingforever)

use shader loop instead of expand with js to reduce shader size

8a3546e

google-cla bot added the cla: yes label Oct 11, 2021

pyu10055 requested a review from lina128 October 11, 2021 19:43

Merge branch 'master' into pack_depthwise

f8707f1

pyu10055 changed the title ~~[webgl] use shader loop instead of expand with js to reduce shader size for packed Depthwise~~ [webgl] use shader loop instead of expanding with js to reduce shader size for packed Depthwise Oct 12, 2021

pyu10055 commented Oct 12, 2021

View reviewed changes

pyu10055 requested a review from jinjingforever October 12, 2021 17:01

lina128 reviewed Oct 12, 2021

View reviewed changes

pyu10055 commented Oct 12, 2021

View reviewed changes

Merge branch 'master' into pack_depthwise

ea81d8c

lina128 approved these changes Oct 12, 2021

View reviewed changes

pyu10055 merged commit 5d57453 into master Oct 12, 2021

pyu10055 deleted the pack_depthwise branch October 12, 2021 20:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[webgl] use shader loop instead of expanding with js to reduce shader size for packed Depthwise #5714

[webgl] use shader loop instead of expanding with js to reduce shader size for packed Depthwise #5714

pyu10055 commented Oct 11, 2021 •

edited by nsthorat

qjia7 commented Oct 12, 2021

pyu10055 left a comment

lina128 left a comment

pyu10055 left a comment

lina128 left a comment

[webgl] use shader loop instead of expanding with js to reduce shader size for packed Depthwise #5714

[webgl] use shader loop instead of expanding with js to reduce shader size for packed Depthwise #5714

Conversation

pyu10055 commented Oct 11, 2021 • edited by nsthorat

qjia7 commented Oct 12, 2021

pyu10055 left a comment

Choose a reason for hiding this comment

lina128 left a comment

Choose a reason for hiding this comment

pyu10055 left a comment

Choose a reason for hiding this comment

lina128 left a comment

Choose a reason for hiding this comment

pyu10055 commented Oct 11, 2021 •

edited by nsthorat