Skip to content

Conversation

@qjia7
Copy link
Contributor

@qjia7 qjia7 commented Feb 13, 2023

Conv2DBackpropInput in ArPortraitDepth
26.32ms -> 14.17ms on CFL
6.91ms -> 4.70ms on TGL
To see the logs from the Cloud Build CI, please join either our discussion or announcement mailing list.


This change is Reviewable

@qjia7 qjia7 marked this pull request as ready for review February 14, 2023 03:05
@qjia7 qjia7 requested review from Linchenn, gyagp and xhcao February 14, 2023 03:05
@qjia7
Copy link
Contributor Author

qjia7 commented Feb 14, 2023

With the second commit a734dd8, the time of Conv2DBackpropInput is further reduced (26.32ms -> 14.17ms -> 10.27ms on CFL)

Copy link
Contributor

@xhcao xhcao left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, please take care of WGSL code typeset.

Copy link
Collaborator

@Linchenn Linchenn left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, great finding!

Reviewable status: :shipit: complete! 2 of 1 approvals obtained (waiting on @gyagp and @qjia7)


tfjs-backend-webgpu/src/conv_backprop_webgpu.ts line 142 at r2 (raw file):

              dotProd[0] = dotProd[0] + tmpval;
            }
          } else if (bDyCVal2) {

We may not need if (bDyCVal2) because we have checked (bDyCVal == false && bDyCVal2 == false) before. Or we could remove if (bDyCVal == false && bDyCVal2 == false) above.

Copy link
Contributor Author

@qjia7 qjia7 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Reviewable status: :shipit: complete! 2 of 1 approvals obtained (waiting on @gyagp and @Linchenn)


tfjs-backend-webgpu/src/conv_backprop_webgpu.ts line 142 at r2 (raw file):

Previously, Linchenn wrote…

We may not need if (bDyCVal2) because we have checked (bDyCVal == false && bDyCVal2 == false) before. Or we could remove if (bDyCVal == false && bDyCVal2 == false) above.

Done.

@qjia7 qjia7 merged commit b8ba2df into tensorflow:master Feb 17, 2023
@qjia7 qjia7 deleted the conv2d_transpose_opt branch February 17, 2023 08:40
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants