webgpu: support LRNGrad kernel #7196

xhcao · 2022-12-22T05:25:28Z

To see the logs from the Cloud Build CI, please join either our discussion or announcement mailing list.

This change is

gyagp

LGTM with one nit

gyagp · 2022-12-23T06:03:50Z

tfjs-backend-webgpu/src/lrn_grad_webgpu.ts

+            }
+            else if (k >= depthBegin && k < depthEnd){
+              var dyi = -2.0 * uniforms.alpha * uniforms.beta
+                * getInputImage(b ,r ,c, k) * getOutputImage(b, r, c, d) / norm;


Nit

Suggested change

* getInputImage(b ,r ,c, k) * getOutputImage(b, r, c, d) / norm;

* getInputImage(b, r, c, k) * getOutputImage(b, r, c, d) / norm;

You may also fix the WebGL code.

qjia7 · 2023-01-11T09:21:24Z

tfjs-backend-webgpu/src/lrn_grad_webgpu.ts

+  dispatch: [number, number, number];
+  variableNames = ['inputImage', 'outputImage', 'dy'];
+  uniforms =
+      'depth : i32, depthRadius : i32, bias : f32, alpha : f32, beta : f32,';


It seems that depth is not necessary as a uniform. You can use let MAX_DEPTH_END = uniforms.outShape[3]; in shader.

qjia7 · 2023-01-11T09:25:04Z

tfjs-backend-webgpu/src/lrn_grad_webgpu.ts

+          let depthEnd = min(uniforms.depth, d + uniforms.depthRadius + 1);
+
+          let MIN_DEPTH_BEGIN = 0;
+          let MAX_DEPTH_END = uniforms.depth;


Move L54-L55 out of for (var d = 0; d < uniforms.depth; d++). It seems that you can also use MAX_DEPTH_END for the outermost for.

qjia7

Two more comments. Sorry missed them in last review.

qjia7 · 2023-01-17T05:26:38Z

tfjs-backend-webgpu/src/lrn_grad_webgpu.ts

+              continue;
+            }
+            else if (k >= depthBegin && k < depthEnd) {
+              norm += getInputImage(b, r, c, k) * getInputImage(b, r, c, k);


It seems getInputImage(b, r, c, k) is called twice. Is it better to cache it like below:

let inputValue = getInputImage(b, r, c, k); norm += inputValue * inputValue ;

It may cost one more register here, let compiler optimizes the code, Is it OK?

qjia7 · 2023-01-17T05:30:00Z

tfjs-backend-webgpu/src/lrn_grad_webgpu.ts

+            }
+            else if (k >= depthBegin && k < depthEnd){
+              var dyi = -2.0 * uniforms.alpha * uniforms.beta
+                * getInputImage(b, r, c, k) * getOutputImage(b, r, c, d) / norm;


getOutputImage(b, r, c, d) should be put out of for(var k = MIN_DEPTH_BEGIN; k < MAX_DEPTH_END; k++) since the arguments are never changed.

Because getOutputImage(b, r, c, d) is in if-else branch, other branches may does not need to access memory, it is not necessary to put the IO access out of for-loop.

qjia7

LGTM, thanks.

gyagp approved these changes Dec 23, 2022

View reviewed changes

xhcao force-pushed the LRNGrad branch from c028030 to c7f55ca Compare December 23, 2022 07:11

xhcao force-pushed the LRNGrad branch from c7f55ca to 450a6ad Compare January 9, 2023 08:13

xhcao requested a review from qjia7 January 9, 2023 08:16

qjia7 reviewed Jan 11, 2023

View reviewed changes

webgpu: support LRNGrad kernel

2bca515

xhcao force-pushed the LRNGrad branch from 450a6ad to 8ca1333 Compare January 17, 2023 04:50

qjia7 reviewed Jan 17, 2023

View reviewed changes

Address Jiajia's comments

8c44348

xhcao force-pushed the LRNGrad branch from 8ca1333 to 8c44348 Compare January 17, 2023 06:10

qjia7 approved these changes Jan 17, 2023

View reviewed changes

xhcao merged commit 8fa597b into tensorflow:master Jan 17, 2023

	* getInputImage(b ,r ,c, k) * getOutputImage(b, r, c, d) / norm;
	* getInputImage(b, r, c, k) * getOutputImage(b, r, c, d) / norm;

webgpu: support LRNGrad kernel #7196

webgpu: support LRNGrad kernel #7196

Uh oh!

Conversation

xhcao commented Dec 22, 2022 • edited by dsmilkov Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gyagp left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

qjia7 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

qjia7 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

xhcao commented Dec 22, 2022 •

edited by dsmilkov

Loading