New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[PyTorch Edge][QNNPack] Depthwise Conv3d mp8x27 (per channel) Neon Kernel #69313
Conversation
…rnel Allows for depthwise conv3d with 3x3x3 kernel Implementation based heavily off of [mp8x25-neon-per-channel.c](https://www.internalfb.com/code/fbsource/[679135d62c0a64e3d0fa0c830aa062ac28f292b8]/fbcode/caffe2/aten/src/ATen/native/quantized/cpu/qnnpack/src/q8dwconv/mp8x25-neon-per-channel.c) (depthwise conv2d with 5x5 kernel) This supports per-channel convolution, but it works for non per-channel too Generated files (caffe2/aten/src/ATen/native/quantized/cpu/qnnpack/wrappers/q8dwconv/*) made with - cd caffe2/aten/src/ATen/native/quantized/cpu/qnnpack - python3 generate-wrapper.py Differential Revision: [D32074096](https://our.internmc.facebook.com/intern/diff/D32074096/) **NOTE FOR REVIEWERS**: This PR has internal Facebook specific changes or comments, please review them on [Phabricator](https://our.internmc.facebook.com/intern/diff/D32074096/)! [ghstack-poisoned]
CI Flow Status⚛️ CI FlowRuleset - Version:
You can add a comment to the PR and tag @pytorchbot with the following commands: # ciflow rerun, "ciflow/default" will always be added automatically
@pytorchbot ciflow rerun
# ciflow rerun with additional labels "-l <ciflow/label_name>", which is equivalent to adding these labels manually and trigger the rerun
@pytorchbot ciflow rerun -l ciflow/scheduled -l ciflow/slow For more information, please take a look at the CI Flow Wiki. |
🔗 Helpful links
💊 CI failures summary and remediationsAs of commit 4e77eba (more details on the Dr. CI page): 💚 💚 Looks good so far! There are no failures yet. 💚 💚 This comment was automatically generated by Dr. CI (expand for details).Please report bugs/suggestions to the (internal) Dr. CI Users group. |
…el) Neon Kernel" Allows for depthwise conv3d with 3x3x3 kernel Implementation based heavily off of [mp8x25-neon-per-channel.c](https://www.internalfb.com/code/fbsource/[679135d62c0a64e3d0fa0c830aa062ac28f292b8]/fbcode/caffe2/aten/src/ATen/native/quantized/cpu/qnnpack/src/q8dwconv/mp8x25-neon-per-channel.c) (depthwise conv2d with 5x5 kernel) This supports per-channel convolution, but it works for non per-channel too Generated files (caffe2/aten/src/ATen/native/quantized/cpu/qnnpack/wrappers/q8dwconv/*) made with - cd caffe2/aten/src/ATen/native/quantized/cpu/qnnpack - python3 generate-wrapper.py Differential Revision: [D32074096](https://our.internmc.facebook.com/intern/diff/D32074096/) **NOTE FOR REVIEWERS**: This PR has internal Facebook specific changes or comments, please review them on [Phabricator](https://our.internmc.facebook.com/intern/diff/D32074096/)! [ghstack-poisoned]
…el) Neon Kernel" Allows for depthwise conv3d with 3x3x3 kernel Implementation based heavily off of [mp8x25-neon-per-channel.c](https://www.internalfb.com/code/fbsource/[679135d62c0a64e3d0fa0c830aa062ac28f292b8]/fbcode/caffe2/aten/src/ATen/native/quantized/cpu/qnnpack/src/q8dwconv/mp8x25-neon-per-channel.c) (depthwise conv2d with 5x5 kernel) This supports per-channel convolution, but it works for non per-channel too Generated files (caffe2/aten/src/ATen/native/quantized/cpu/qnnpack/wrappers/q8dwconv/*) made with - cd caffe2/aten/src/ATen/native/quantized/cpu/qnnpack - python3 generate-wrapper.py Differential Revision: [D32074096](https://our.internmc.facebook.com/intern/diff/D32074096/) **NOTE FOR REVIEWERS**: This PR has internal Facebook specific changes or comments, please review them on [Phabricator](https://our.internmc.facebook.com/intern/diff/D32074096/)! [ghstack-poisoned]
…el) Neon Kernel" Allows for depthwise conv3d with 3x3x3 kernel Implementation based heavily off of [mp8x25-neon-per-channel.c](https://www.internalfb.com/code/fbsource/[679135d62c0a64e3d0fa0c830aa062ac28f292b8]/fbcode/caffe2/aten/src/ATen/native/quantized/cpu/qnnpack/src/q8dwconv/mp8x25-neon-per-channel.c) (depthwise conv2d with 5x5 kernel) This supports per-channel convolution, but it works for non per-channel too Generated files (caffe2/aten/src/ATen/native/quantized/cpu/qnnpack/wrappers/q8dwconv/*) made with - cd caffe2/aten/src/ATen/native/quantized/cpu/qnnpack - python3 generate-wrapper.py Differential Revision: [D32074096](https://our.internmc.facebook.com/intern/diff/D32074096/) **NOTE FOR REVIEWERS**: This PR has internal Facebook specific changes or comments, please review them on [Phabricator](https://our.internmc.facebook.com/intern/diff/D32074096/)! [ghstack-poisoned]
…el) Neon Kernel" Allows for depthwise conv3d with 3x3x3 kernel Implementation based heavily off of [mp8x25-neon-per-channel.c](https://www.internalfb.com/code/fbsource/[679135d62c0a64e3d0fa0c830aa062ac28f292b8]/fbcode/caffe2/aten/src/ATen/native/quantized/cpu/qnnpack/src/q8dwconv/mp8x25-neon-per-channel.c) (depthwise conv2d with 5x5 kernel) This supports per-channel convolution, but it works for non per-channel too Generated files (caffe2/aten/src/ATen/native/quantized/cpu/qnnpack/wrappers/q8dwconv/*) made with - cd caffe2/aten/src/ATen/native/quantized/cpu/qnnpack - python3 generate-wrapper.py Differential Revision: [D32074096](https://our.internmc.facebook.com/intern/diff/D32074096/) **NOTE FOR REVIEWERS**: This PR has internal Facebook specific changes or comments, please review them on [Phabricator](https://our.internmc.facebook.com/intern/diff/D32074096/)! [ghstack-poisoned]
…el) Neon Kernel" Allows for depthwise conv3d with 3x3x3 kernel Implementation based heavily off of [mp8x25-neon-per-channel.c](https://www.internalfb.com/code/fbsource/[679135d62c0a64e3d0fa0c830aa062ac28f292b8]/fbcode/caffe2/aten/src/ATen/native/quantized/cpu/qnnpack/src/q8dwconv/mp8x25-neon-per-channel.c) (depthwise conv2d with 5x5 kernel) This supports per-channel convolution, but it works for non per-channel too Generated files (caffe2/aten/src/ATen/native/quantized/cpu/qnnpack/wrappers/q8dwconv/*) made with - cd caffe2/aten/src/ATen/native/quantized/cpu/qnnpack - python3 generate-wrapper.py Differential Revision: [D32074096](https://our.internmc.facebook.com/intern/diff/D32074096/) **NOTE FOR REVIEWERS**: This PR has internal Facebook specific changes or comments, please review them on [Phabricator](https://our.internmc.facebook.com/intern/diff/D32074096/)! [ghstack-poisoned]
…el) Neon Kernel" Allows for depthwise conv3d with 3x3x3 kernel Implementation based heavily off of [mp8x25-neon-per-channel.c](https://www.internalfb.com/code/fbsource/[679135d62c0a64e3d0fa0c830aa062ac28f292b8]/fbcode/caffe2/aten/src/ATen/native/quantized/cpu/qnnpack/src/q8dwconv/mp8x25-neon-per-channel.c) (depthwise conv2d with 5x5 kernel) This supports per-channel convolution, but it works for non per-channel too Generated files (caffe2/aten/src/ATen/native/quantized/cpu/qnnpack/wrappers/q8dwconv/*) made with - cd caffe2/aten/src/ATen/native/quantized/cpu/qnnpack - python3 generate-wrapper.py Differential Revision: [D32074096](https://our.internmc.facebook.com/intern/diff/D32074096/) **NOTE FOR REVIEWERS**: This PR has internal Facebook specific changes or comments, please review them on [Phabricator](https://our.internmc.facebook.com/intern/diff/D32074096/)! [ghstack-poisoned]
…el) Neon Kernel" Allows for depthwise conv3d with 3x3x3 kernel Implementation based heavily off of [mp8x25-neon-per-channel.c](https://www.internalfb.com/code/fbsource/[679135d62c0a64e3d0fa0c830aa062ac28f292b8]/fbcode/caffe2/aten/src/ATen/native/quantized/cpu/qnnpack/src/q8dwconv/mp8x25-neon-per-channel.c) (depthwise conv2d with 5x5 kernel) This supports per-channel convolution, but it works for non per-channel too Generated files (caffe2/aten/src/ATen/native/quantized/cpu/qnnpack/wrappers/q8dwconv/*) made with - cd caffe2/aten/src/ATen/native/quantized/cpu/qnnpack - python3 generate-wrapper.py Differential Revision: [D32074096](https://our.internmc.facebook.com/intern/diff/D32074096/) **NOTE FOR REVIEWERS**: This PR has internal Facebook specific changes or comments, please review them on [Phabricator](https://our.internmc.facebook.com/intern/diff/D32074096/)! [ghstack-poisoned]
…el) Neon Kernel" Allows for depthwise conv3d with 3x3x3 kernel Implementation based heavily off of [mp8x25-neon-per-channel.c](https://www.internalfb.com/code/fbsource/[679135d62c0a64e3d0fa0c830aa062ac28f292b8]/fbcode/caffe2/aten/src/ATen/native/quantized/cpu/qnnpack/src/q8dwconv/mp8x25-neon-per-channel.c) (depthwise conv2d with 5x5 kernel) This supports per-channel convolution, but it works for non per-channel too Generated files (caffe2/aten/src/ATen/native/quantized/cpu/qnnpack/wrappers/q8dwconv/*) made with - cd caffe2/aten/src/ATen/native/quantized/cpu/qnnpack - python3 generate-wrapper.py Differential Revision: [D32074096](https://our.internmc.facebook.com/intern/diff/D32074096/) **NOTE FOR REVIEWERS**: This PR has internal Facebook specific changes or comments, please review them on [Phabricator](https://our.internmc.facebook.com/intern/diff/D32074096/)! [ghstack-poisoned]
…el) Neon Kernel" Allows for depthwise conv3d with 3x3x3 kernel Implementation based heavily off of [mp8x25-neon-per-channel.c](https://www.internalfb.com/code/fbsource/[679135d62c0a64e3d0fa0c830aa062ac28f292b8]/fbcode/caffe2/aten/src/ATen/native/quantized/cpu/qnnpack/src/q8dwconv/mp8x25-neon-per-channel.c) (depthwise conv2d with 5x5 kernel) This supports per-channel convolution, but it works for non per-channel too Generated files (caffe2/aten/src/ATen/native/quantized/cpu/qnnpack/wrappers/q8dwconv/*) made with - cd caffe2/aten/src/ATen/native/quantized/cpu/qnnpack - python3 generate-wrapper.py Differential Revision: [D32074096](https://our.internmc.facebook.com/intern/diff/D32074096/) **NOTE FOR REVIEWERS**: This PR has internal Facebook specific changes or comments, please review them on [Phabricator](https://our.internmc.facebook.com/intern/diff/D32074096/)! [ghstack-poisoned]
…el) Neon Kernel" Allows for depthwise conv3d with 3x3x3 kernel Implementation based heavily off of [mp8x25-neon-per-channel.c](https://www.internalfb.com/code/fbsource/[679135d62c0a64e3d0fa0c830aa062ac28f292b8]/fbcode/caffe2/aten/src/ATen/native/quantized/cpu/qnnpack/src/q8dwconv/mp8x25-neon-per-channel.c) (depthwise conv2d with 5x5 kernel) This supports per-channel convolution, but it works for non per-channel too Generated files (caffe2/aten/src/ATen/native/quantized/cpu/qnnpack/wrappers/q8dwconv/*) made with - cd caffe2/aten/src/ATen/native/quantized/cpu/qnnpack - python3 generate-wrapper.py Differential Revision: [D32074096](https://our.internmc.facebook.com/intern/diff/D32074096/) **NOTE FOR REVIEWERS**: This PR has internal Facebook specific changes or comments, please review them on [Phabricator](https://our.internmc.facebook.com/intern/diff/D32074096/)! [ghstack-poisoned]
…rnel (#69313) Summary: Pull Request resolved: #69313 Allows for depthwise conv3d with 3x3x3 kernel Implementation based heavily off of [mp8x25-neon-per-channel.c](https://www.internalfb.com/code/fbsource/[679135d62c0a64e3d0fa0c830aa062ac28f292b8]/fbcode/caffe2/aten/src/ATen/native/quantized/cpu/qnnpack/src/q8dwconv/mp8x25-neon-per-channel.c) (depthwise conv2d with 5x5 kernel) This supports per-channel convolution, but it works for non per-channel too Generated files (caffe2/aten/src/ATen/native/quantized/cpu/qnnpack/wrappers/q8dwconv/*) made with - cd caffe2/aten/src/ATen/native/quantized/cpu/qnnpack - python3 generate-wrapper.py ghstack-source-id: 146346785 Test Plan: Test when used in depthwise conv3d later in this diff stack (D31966574) Reviewed By: kimishpatel Differential Revision: D32074096 fbshipit-source-id: 8111926df6ecb89d88ca810deeab87b1c072f55a
Stack from ghstack:
Allows for depthwise conv3d with 3x3x3 kernel
Implementation based heavily off of mp8x25-neon-per-channel.c (depthwise conv2d with 5x5 kernel)
This supports per-channel convolution, but it works for non per-channel too
Generated files (caffe2/aten/src/ATen/native/quantized/cpu/qnnpack/wrappers/q8dwconv/*) made with
Differential Revision: D32074096
NOTE FOR REVIEWERS: This PR has internal Facebook specific changes or comments, please review them on Phabricator!