[QNN EP] Support HardSigmoid #20508

adrianlizarraga · 2024-04-29T18:00:16Z

Description

Adds support for float32/float16 HardSigmoid on HTP backend. Decomposes HardSigmoid(X) into max(0, min(1, alpha * X + beta)).
Fuses the sequence X * HardSigmoid<alpha=1/6, beta=0.5>(X) into a single HardSwish(x). Only applies to non-quantized HardSigmoid/Mul.

Motivation and Context

QNN does not natively support HardSigmoid. These changes expand model support on QNN EP.

onnxruntime/core/providers/qnn/builder/qnn_def.h

onnxruntime/core/providers/qnn/qnn_execution_provider.cc

…. Reuse code to move a TensorWrapper obj.

onnxruntime/core/providers/qnn/builder/qnn_fusions.cc

HectorSVC

### Description - Adds support for float32/float16 HardSigmoid on HTP backend. Decomposes `HardSigmoid(X)` into `max(0, min(1, alpha * X + beta))`. - Fuses the sequence `X * HardSigmoid<alpha=1/6, beta=0.5>(X)` into a single `HardSwish(x)`. Only applies to non-quantized HardSigmoid/Mul. ### Motivation and Context QNN does not natively support HardSigmoid. These changes expand model support on QNN EP.

adrianlizarraga added 8 commits April 26, 2024 17:18

Add new fusions files

32eb3cb

Start adding stub function to fuse HardSigmoid sequence

f9c3eb3

Remove unnecessary build class for Convert fusion

8e72a41

HardSwish fusion

a68d3bc

Add common AddTensor() code to QnnModelWrapper

24f664a

CLean up

b966892

Added unit tests for HardSigmoid fusion into HardSwish

15f74eb

linter fix

62c891d

jywu-msft added the ep:QNN issues related to QNN exeution provider label Apr 30, 2024

adrianlizarraga added 2 commits April 30, 2024 19:03

Add explicit builder for HardSigmoid

ed4cebf

Merge branch 'main' into adrianl/qnn-hardsigmoid-to-hardswish-fusion

bbe3a9f

adrianlizarraga changed the title ~~[QNN EP] Fuse HardSigmoid sequence to HardSwish~~ [QNN EP] Support HardSigmoid May 1, 2024

adrianlizarraga added 2 commits April 30, 2024 20:19

Clean up linter

01cd3e5

Remove unnecessary test now that HardSigmoid is decomposed

1cbb475

adrianlizarraga marked this pull request as ready for review May 1, 2024 05:46

adrianlizarraga requested review from HectorSVC and jywu-msft May 1, 2024 05:46

jywu-msft previously approved these changes May 1, 2024

View reviewed changes

Detect HardSigmoid QDQ node units

77888bf

adrianlizarraga dismissed jywu-msft’s stale review via 77888bf May 1, 2024 16:01

adrianlizarraga added 3 commits May 1, 2024 15:07

Add handled node to set in case of error with fusion

332466c

Merge branch 'main' into adrianl/qnn-hardsigmoid-to-hardswish-fusion

92b7ef1

Merge branch 'main' into adrianl/qnn-hardsigmoid-to-hardswish-fusion

a1a0742

adrianlizarraga requested a review from jywu-msft May 2, 2024 07:56

HectorSVC reviewed May 2, 2024

View reviewed changes

onnxruntime/core/providers/qnn/builder/qnn_def.h Show resolved Hide resolved

HectorSVC reviewed May 2, 2024

View reviewed changes

onnxruntime/core/providers/qnn/qnn_execution_provider.cc Outdated Show resolved Hide resolved

Enable op validation flag when calling TryFusion from GetCapability()…

d2ab7a1

…. Reuse code to move a TensorWrapper obj.

adrianlizarraga commented May 2, 2024

View reviewed changes

onnxruntime/core/providers/qnn/builder/qnn_fusions.cc Show resolved Hide resolved

HectorSVC approved these changes May 2, 2024

View reviewed changes

jywu-msft approved these changes May 2, 2024

View reviewed changes

adrianlizarraga merged commit 7211eab into main May 2, 2024

adrianlizarraga deleted the adrianl/qnn-hardsigmoid-to-hardswish-fusion branch May 2, 2024 22:36

jywu-msft added the release:1.18.0 label May 2, 2024

sophies927 added the triage:approved Approved for cherrypicks for release label May 3, 2024

yihonglyu added the cherry-picked Cherry-picked for a cherrypicks branch label May 4, 2024

yihonglyu added the rel-merged Cherrypicks merged into release label May 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[QNN EP] Support HardSigmoid #20508

[QNN EP] Support HardSigmoid #20508

Uh oh!

adrianlizarraga commented Apr 29, 2024 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

HectorSVC left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

[QNN EP] Support HardSigmoid #20508

[QNN EP] Support HardSigmoid #20508

Uh oh!

Conversation

adrianlizarraga commented Apr 29, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Motivation and Context

Uh oh!

Uh oh!

Uh oh!

Uh oh!

HectorSVC left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

adrianlizarraga commented Apr 29, 2024 •

edited

Loading