Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unpack input in ArgMin/Max WebGL kernel #5437

Merged
merged 2 commits into from Aug 13, 2021

Conversation

ahmedsabie
Copy link
Contributor

@ahmedsabie ahmedsabie commented Aug 4, 2021

This prevents each shader from having to unpack the input again due to lazy unpacking

To see the logs from the Cloud Build CI, please join either our discussion or announcement mailing list.


This change is Reviewable

@google-cla google-cla bot added the cla: yes label Aug 4, 2021
Copy link
Collaborator

@lina128 lina128 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you Ahmed, have you benchmarked the performance change? How much time does it reduce?

Reviewable status: 0 of 1 approvals obtained

@ahmedsabie ahmedsabie marked this pull request as ready for review August 13, 2021 19:01
@ahmedsabie
Copy link
Contributor Author

Unpack is pretty fast so the saving is mostly having less shader calls, I see around 0.2ms per call and there's around log(log(N)) calls to it where N Is last dimension of input.

Copy link
Collaborator

@lina128 lina128 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the info.

Reviewable status: :shipit: complete! 1 of 1 approvals obtained (waiting on @pyu10055)

@lina128 lina128 merged commit fab8f0f into tensorflow:master Aug 13, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants