[WASM] Transpose the filter of the convolution before calling xnnpack. #2344

nsthorat · 2019-11-06T22:15:15Z

XNNPack expects kernels in the following format:
[output channels, kernel height, kernel width, input channels]

TensorFlow and TensorFlow.js use the following format:
[kernel height, kernel width, input channels, output channels]

This PR transposes the filter when an XNNPack kernel is created. Since XNNPack keeps a copy of the filter we transpose, call xnn pack, and immediately throw out the transposed kernel.

To share transposed I moved the body of Transpose.cc to a separate transpose_impl.cc/h (they must be named differently beyond capitalization or bazel gets confused).

This change is

googlebot · 2019-11-07T12:55:20Z

All (the pull request submitter and all commit authors) CLAs are signed, but one or more commits were authored or co-authored by someone other than the pull request submitter.

We need to confirm that all authors are ok with their commits being contributed to this project. Please have them confirm that by leaving a comment that contains only @googlebot I consent. in this pull request.

Note to project maintainer: There may be cases where the author cannot leave a comment, or the comment is not properly detected as consent. In those cases, you can manually confirm consent of the commit author(s), and set the cla label to yes (if enabled on your project).

ℹ️ Googlers: Go here for more info.

nsthorat · 2019-11-07T15:18:09Z

@googlebot I consent.

Maratyszcza

The code is more complicated than I expected. It would be sufficient to always do just a 2D transpose [M, N] -> [N, M], where M = kernel height, kernel width, input channels and N = output channels.

Reviewable status: 0 of 1 approvals obtained (waiting on @annxingyuan, @dsmilkov, and @Maratyszcza)

nsthorat · 2019-11-07T17:35:02Z

Most of the complexity is actually just moving transpose to a shared implementation, however you are right a direct 2d transpose is possible here (we do this internally anyways) but just to be sure I made it 2d.

Maratyszcza

Reviewable status: 0 of 1 approvals obtained (waiting on @annxingyuan, @dsmilkov, and @nsthorat)

tfjs-backend-wasm/src/cc/kernels/Conv2D.cc, line 112 at r4 (raw file):

    // This can be transposed with a 2d transpose to move output_channels to the
    // outer most dimension.
    float* transposed_filter = new float[filter_info.size]();

It is better to avoid directly allocating memory via new as it can lead to memory leaks. A safer way would be to create an std::vector<float> transposed_filter(filter_info.size()). The memory for the vector will be automatically released when it goes out of scope.

Maratyszcza

for XNNPACK-related parts

Reviewable status: 0 of 1 approvals obtained (waiting on @annxingyuan, @dsmilkov, and @nsthorat)

nsthorat

Reviewable status: 0 of 1 approvals obtained (waiting on @annxingyuan, @dsmilkov, and @Maratyszcza)

tfjs-backend-wasm/src/cc/kernels/Conv2D.cc, line 112 at r4 (raw file):

Previously, Maratyszcza (Marat Dukhan) wrote…

It is better to avoid directly allocating memory via new as it can lead to memory leaks. A safer way would be to create an std::vector<float> transposed_filter(filter_info.size()). The memory for the vector will be automatically released when it goes out of scope.

Thanks! Done!

dsmilkov

Great! That transpose should be pretty fast and it's only a one-time cost.

Reviewed 4 of 11 files at r1, 1 of 6 files at r3, 2 of 3 files at r4, 1 of 1 files at r5.
Reviewable status: complete! 1 of 1 approvals obtained (waiting on @annxingyuan, @dsmilkov, and @Maratyszcza)

googlebot · 2019-11-07T19:22:16Z

A Googler has manually verified that the CLAs look good.

(Googler, please make sure the reason for overriding the CLA status is clearly documented in these comments.)

ℹ️ Googlers: Go here for more info.

Nikhil Thorat added 2 commits November 6, 2019 15:05

Fused Conv2D

45c7ef4

save

15f270d

googlebot added the cla: yes label Nov 6, 2019

Nikhil Thorat added 3 commits November 6, 2019 17:41

save

3c0734f

save

edb54b6

save

620ec7d

nsthorat changed the title ~~WIP: Conv2d bug~~ [WIP] Transpose the filter of the convolution before calling xnnpack. Nov 6, 2019

Nikhil Thorat and others added 2 commits November 6, 2019 18:25

save

d0779bb

fix linker error by using explicit instantiation

38be195

googlebot added cla: no and removed cla: yes labels Nov 7, 2019

Nikhil Thorat added 3 commits November 7, 2019 10:07

save

1124473

save

b1912f4

save

63d255b

save

92807e9

nsthorat requested review from Maratyszcza, annxingyuan and dsmilkov and removed request for dsmilkov November 7, 2019 15:20

nsthorat changed the title ~~[WIP] Transpose the filter of the convolution before calling xnnpack.~~ [WASM] Transpose the filter of the convolution before calling xnnpack. Nov 7, 2019

Maratyszcza reviewed Nov 7, 2019

View reviewed changes

save

f9e8555

Maratyszcza reviewed Nov 7, 2019

View reviewed changes

save

aab09e1

nsthorat commented Nov 7, 2019

View reviewed changes

Maratyszcza approved these changes Nov 7, 2019

View reviewed changes

dsmilkov approved these changes Nov 7, 2019

View reviewed changes

dsmilkov added cla: yes and removed cla: no labels Nov 7, 2019

nsthorat merged commit 0b0a569 into master Nov 7, 2019

nsthorat deleted the conv2d-bug branch November 7, 2019 19:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WASM] Transpose the filter of the convolution before calling xnnpack. #2344

[WASM] Transpose the filter of the convolution before calling xnnpack. #2344

Uh oh!

nsthorat commented Nov 6, 2019 •

edited

Loading

Uh oh!

googlebot commented Nov 7, 2019

Uh oh!

nsthorat commented Nov 7, 2019

Uh oh!

Maratyszcza left a comment

Uh oh!

nsthorat commented Nov 7, 2019

Uh oh!

Maratyszcza left a comment

Uh oh!

Maratyszcza left a comment

Uh oh!

nsthorat left a comment

Uh oh!

dsmilkov left a comment

Uh oh!

googlebot commented Nov 7, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[WASM] Transpose the filter of the convolution before calling xnnpack. #2344

[WASM] Transpose the filter of the convolution before calling xnnpack. #2344

Uh oh!

Conversation

nsthorat commented Nov 6, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

googlebot commented Nov 7, 2019

Uh oh!

nsthorat commented Nov 7, 2019

Uh oh!

Maratyszcza left a comment

Choose a reason for hiding this comment

Uh oh!

nsthorat commented Nov 7, 2019

Uh oh!

Maratyszcza left a comment

Choose a reason for hiding this comment

Uh oh!

Maratyszcza left a comment

Choose a reason for hiding this comment

Uh oh!

nsthorat left a comment

Choose a reason for hiding this comment

Uh oh!

dsmilkov left a comment

Choose a reason for hiding this comment

Uh oh!

googlebot commented Nov 7, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

nsthorat commented Nov 6, 2019 •

edited

Loading