[QNNPACK, Sparsity] Sparse kernel with 4x8 blocking #50590

kimishpatel · 2021-01-15T16:53:46Z

Stack from ghstack:

[QNNPACK Sparsity] Add aarch64 kernel of 8x1 sparsity #51120 [QNNPACK Sparsity] Add aarch64 kernel of 8x1 sparsity
[QNNPACK, Sparsity] Add 8x1 block sparse kernels for aarch32. #51119 [QNNPACK, Sparsity] Add 8x1 block sparse kernels for aarch32.
[QNNPACK, Sparsity] Code refactoring to allow for more generic block #51118 [QNNPACK, Sparsity] Code refactoring to allow for more generic block
[QNNPACK, Sparsity] Add dyanmic linear sparse kernel for arm64 #50591 [QNNPACK, Sparsity] Add dyanmic linear sparse kernel for arm64
[QNNPACK, Sparsity] Sparse kernel with 4x8 blocking #50590 [QNNPACK, Sparsity] Sparse kernel with 4x8 blocking
[QNNPACK, Sparsity] Added prepacking base aarch32 kernels #50589 [QNNPACK, Sparsity] Added prepacking base aarch32 kernels
[QNNPACK, Sparsity] ARMV7, aarch32, kernels for dynamic linear #50588 [QNNPACK, Sparsity] ARMV7, aarch32, kernels for dynamic linear

Summary:
Larger blocking across M dim such as 8 in previous PR is likely
introducing wasted compute on the shapes being benchmarked.
Here we introduced 4x8 blocking of mrxnr. This helps 1) in packing
smaller data for small values of M and 2) for compute kernel it writes
same number of bytes but more contiguously. It is not certain but it
likely helps.

Test Plan:
q8gemm-sparse-test
fully-connected-sparse-test

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: D25925499

Summary: Larger blocking across M dim such as 8 in previous PR is likely introducing wasted compute on the shapes being benchmarked. Here we introduced 4x8 blocking of mrxnr. This helps 1) in packing smaller data for small values of M and 2) for compute kernel it writes same number of bytes but more contiguously. It is not certain but it likely helps. Test Plan: q8gemm-sparse-test fully-connected-sparse-test Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

Summary: Larger blocking across M dim such as 8 in previous PR is likely introducing wasted compute on the shapes being benchmarked. Here we introduced 4x8 blocking of mrxnr. This helps 1) in packing smaller data for small values of M and 2) for compute kernel it writes same number of bytes but more contiguously. It is not certain but it likely helps. Test Plan: q8gemm-sparse-test fully-connected-sparse-test Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D25925499](https://our.internmc.facebook.com/intern/diff/D25925499) [ghstack-poisoned]

facebook-github-bot · 2021-02-05T17:14:02Z

This pull request has been merged in 70830b5.

facebook-github-bot added the cla signed label Jan 15, 2021

kimishpatel requested a review from AshkanAliabadi January 15, 2021 16:56

This was referenced Jan 26, 2021

[QNNPACK, Sparsity] Code refactoring to allow for more generic block #51118

Closed

[QNNPACK, Sparsity] Add 8x1 block sparse kernels for aarch32. #51119

Closed

[QNNPACK Sparsity] Add aarch64 kernel of 8x1 sparsity #51120

Closed

kimishpatel added 2 commits January 26, 2021 12:01

This was referenced Jan 26, 2021

[QNNPACK, Sparsity] Remove unused kernels and codepaths #51149

Closed

[QNNPACK, Sparsity] Add wrapper for the kernels #51150

Closed

kimishpatel added 2 commits January 26, 2021 15:53

AshkanAliabadi approved these changes Jan 27, 2021

View reviewed changes

kimishpatel added 10 commits January 29, 2021 08:50

kimishpatel added 2 commits February 2, 2021 14:29

facebook-github-bot closed this in 70830b5 Feb 5, 2021

facebook-github-bot added the Merged label Feb 5, 2021

facebook-github-bot deleted the gh/kimishpatel/39/head branch February 9, 2021 15:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[QNNPACK, Sparsity] Sparse kernel with 4x8 blocking #50590

[QNNPACK, Sparsity] Sparse kernel with 4x8 blocking #50590

kimishpatel commented Jan 15, 2021 •

edited

facebook-github-bot commented Feb 5, 2021

[QNNPACK, Sparsity] Sparse kernel with 4x8 blocking #50590

[QNNPACK, Sparsity] Sparse kernel with 4x8 blocking #50590

Conversation

kimishpatel commented Jan 15, 2021 • edited

facebook-github-bot commented Feb 5, 2021

kimishpatel commented Jan 15, 2021 •

edited