[CoreML EP] Implement Unary & Reduce operators by ShukantPal · Pull Request #15532 · microsoft/onnxruntime

ShukantPal · 2023-04-16T18:21:19Z

Description

This change is a follow-up to #15327. It adds Unary operators (Sqrt, Reciprocal) and Reduce operators (ReduceSum, ReduceMean). I've tried to follow existing patterns in the code :-)

Motivation and Context

This reduces fragmentation across EPs when using CoreML on macOS, thereby speeding up execution.

## Description Implements support for LeakyReLU in ActivationOpBuilder for CoreML's EP. This speeds up inference on macOS significantly for models using LeakyReLU.

…duceMean ## Description Implements support for mentioned operators in ActivationOpBuilder for CoreML's EP.

onnxruntime/core/providers/coreml/builders/helper.cc

edgchen1

Thanks for your contribution!

onnxruntime/core/providers/coreml/builders/impl/reduction_op_builder.cc

onnxruntime/core/providers/coreml/builders/impl/unary_op_builder.cc

onnxruntime/core/providers/coreml/builders/op_builder_factory.cc

ShukantPal · 2023-04-23T02:43:47Z

Thanks for your review @edgchen1 ! I've followed up on all of your comments/suggestions. Please let me know how it looks now.

onnxruntime/core/providers/coreml/builders/impl/reduction_op_builder.cc

edgchen1 · 2023-04-25T18:14:40Z

/azp run MacOS CI Pipeline

azure-pipelines · 2023-04-25T18:14:51Z

Azure Pipelines successfully started running 1 pipeline(s).

…uilder.cc Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com>

natke · 2023-04-26T01:10:45Z

Hi @ShukantPal, thank you for this contribution! We're interested in learning more about your use case. Can you tell us a little bit about how you are using ONNX Runtime? Or if you would prefer you can reach me at nakersha@microsoft.com

onnxruntime/core/providers/coreml/builders/impl/reduction_op_builder.cc

onnxruntime/core/providers/coreml/builders/impl/unary_op_builder.cc

ShukantPal · 2023-04-26T01:20:21Z

Hi @natke, I'm developing a goofy macOS virtual camera that uses different video filters like FaceMesh, CenterFace, DFL, etc. To get real-time frame rates, executing on CoreML / ANE is necessary on M1 MacBooks. I wanted to keep my application cross-platform to run on Intel Macs (with discrete GPUs) and Windows in the future. That's why I chose ONNX runtime, but having full CoreML support is still necessary for satisfactory performance on M1.

natke · 2023-04-26T01:30:33Z

Sounds pretty awesome. Is it published anywhere? I'd love to see a demo

ShukantPal · 2023-04-26T01:33:17Z

@natke Haha, not yet − I've been working on it. Can send you a beta build when ready :-)

azure-pipelines · 2023-05-19T06:44:20Z

Azure Pipelines successfully started running 5 pipeline(s).

azure-pipelines · 2023-05-19T06:44:33Z

Azure Pipelines successfully started running 10 pipeline(s).

… the old ones

ShukantPal · 2023-05-20T19:03:56Z

I ran the linter on Windows and fixed the linting issues.
It seems like TensorRT, OpenVINO, and DNN don't support ReduceMean opset 18 with axes input, so I disabled those providers in the new tests (copying that from similar tests for ReduceMean).

But I need help understanding why the test_layer_normalization* tests are failing on CoreML/macOS. The ReduceMean op-tests pass, but still there's some mismatch with downstream "Reshape" layers in the test models.

I don't have this issue when testing the ReduceMean layers in my own models.

tools/ci_build/github/apple/coreml_supported_ops.md

ShukantPal · 2023-05-20T19:30:23Z

@skottmckay

Given the testing complications, I've commented out the line registering ReduceMean. Hopefully, rest of the PR provides enough value for it to be merged :-)

skottmckay · 2023-05-22T09:27:41Z

Can you clarify which tests are failing?

I pulled your changes, uncommented the ReduceMean line in onnxruntime/core/providers/coreml/builders/op_builder_factory.cc and did a build on a mac with CoreML enabled and the unit tests passed.

skottmckay · 2023-05-22T09:28:50Z

/azp run Windows CPU CI Pipeline,Windows GPU CI Pipeline,Windows GPU TensorRT CI Pipeline,ONNX Runtime Web CI Pipeline,Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline,Linux OpenVINO CI Pipeline,MacOS CI Pipeline

skottmckay · 2023-05-22T09:28:52Z

/azp run orttraining-amd-gpu-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,onnxruntime-python-checks-ci-pipeline,onnxruntime-binary-size-checks-ci-pipeline

azure-pipelines · 2023-05-22T09:29:15Z

Azure Pipelines successfully started running 5 pipeline(s).

azure-pipelines · 2023-05-22T09:29:33Z

Azure Pipelines successfully started running 10 pipeline(s).

ShukantPal · 2023-05-22T20:26:02Z

@skottmckay Here were the failing tests: https://dev.azure.com/onnxruntime/onnxruntime/_build/results?buildId=1012534&view=logs&j=07136875-e4b3-5a15-ff50-026d13f57d01&t=b16e8f5d-fb27-5dd3-278d-cb1fb09fb81a

They come from the --build_wheel option.

skottmckay · 2023-05-22T23:14:17Z

Definitely want to get it added, but I think we need to figure out the root cause given we're calling AddReductionParams the same way for both ReduceMean and ReduceSum, so if there's an issue I would expect it potentially applies to both operator types and it's possibly that there's no test showing the issue for ReduceSum.

The failing tests are ONNX test cases. We create a binary called onnx_test_runner that can be used to execute/debug. The ONNX tests are in ./cmake/external/onnx/onnx/backend/test/data/node. Specify the EP with '-e'. CPU passes, CoreML fails.

e.g. ./build/MacOS/Debug/onnx_test_runner -e coreml ./cmake/external/onnx/onnx/backend/test/data/node/test_layer_normalization_2d_axis0_expanded

I don't think this is the root cause, but the last param for the CoreML parms is reduceAll which might not be 1:1 with noop_with_empty_axes.

https://apple.github.io/coremltools/mlmodel/Format/NeuralNetwork.html#reducemeanlayerparams

CoreML reduceAll says to ignore the axes parameter and reduce all.

So if axes is empty, reduceAll is true, but if noop_with_empty_axes is also true the ONNX spec says do nothing which doesn't seem to have an equivalent in CoreML. In that case we could probably drop the node in CoreML or insert an Identity node to do the value rename given it's turned into a no-op.

skottmckay · 2023-05-23T00:13:17Z

Sorry - I overlooked the early exit for noop_with_empty_axes. It would be good to clarify the implementation in ReductionOpBuilder::AddToModelBuilderImpl though and set a reduceAll value instead of passing noop_with_empty_axes into the AddReductionParms call.

skottmckay · 2023-05-23T01:32:06Z

Might be an issue with Flatten. There's not a lot of places in this particular test where this combination of shapes is possible to get wrong. One path is the Flatten of the model input, which should result in (1, 4) not (4, 1)

2023-05-19T06:58:33.0585720Z ERROR: test_layer_normalization_2d_axis1_expanded_cpu

2023-05-19T06:58:33.0593450Z onnxruntime.capi.onnxruntime_pybind11_state.Fail: [ONNXRuntimeError] : 1 : FAIL : Error compiling model compiler error: Espresso exception: "Invalid blob shape": generic_elementwise_kernel: cannot broadcast:
2023-05-19T06:58:33.0593910Z (3, 4)
2023-05-19T06:58:33.0594210Z (4, 1)

I ran a couple of tools to a) add shape info to all the nodes, and b) run the 'basic' level optimizations like constant folding so we can see what nodes are around when the CoreML EP is creating the model.

python -m onnxruntime.tools.symbolic_shape_infer --input D:\src\github\ort\cmake\external\onnx\onnx\backend\test\data\node\test_layer_normalization_2d_axis1_expanded\model.onnx --output D:\src\github\ort\cmake\external\onnx\onnx\backend\test\data\node\test_layer_normalization_2d_axis1_expanded\model.ssi.onnx
python -m onnxruntime.tools.optimize_onnx_model --opt_level basic D:\src\github\ort\cmake\external\onnx\onnx\backend\test\data\node\test_layer_normalization_2d_axis1_expanded\model.ssi.onnx D:\src\github\ort\cmake\external\onnx\onnx\backend\test\data\node\test_layer_normalization_2d_axis1_expanded\model.ssi.basic.onnx

If I comment out Flatten in op_builder_factory.cc the tests pass.

skottmckay · 2023-05-23T01:45:06Z

And this is the issue:

onnxruntime/onnxruntime/core/providers/coreml/builders/impl/flatten_op_builder.cc

Line 44 in 684e900

const int64_t axis = helper.Get("axis ", 1);

Rogue space in the attribute name so it wasn't reading the actual value from the node of 0.

I'll add a test and create a separate PR for that fix, but please test out your changes with just the fix ("axis " -> "axis") so we can check all CIs pass with that.

skottmckay · 2023-05-23T06:49:47Z

Should be addressed by #16046.

ShukantPal · 2023-05-23T12:46:34Z

I can confirm that patching the rogue space in Flatten fixes this! Amazing.

skottmckay · 2023-05-23T23:51:39Z

/azp run Windows CPU CI Pipeline,Windows GPU CI Pipeline,Windows GPU TensorRT CI Pipeline,ONNX Runtime Web CI Pipeline,Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline,Linux OpenVINO CI Pipeline,MacOS CI Pipeline

skottmckay · 2023-05-23T23:51:42Z

/azp run orttraining-amd-gpu-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,onnxruntime-python-checks-ci-pipeline,onnxruntime-binary-size-checks-ci-pipeline

azure-pipelines · 2023-05-23T23:52:05Z

Azure Pipelines successfully started running 5 pipeline(s).

azure-pipelines · 2023-05-23T23:52:20Z

Azure Pipelines successfully started running 10 pipeline(s).

skottmckay · 2023-05-24T05:23:18Z

/azp run Windows ARM64 QNN CI Pipeline,Linux QNN CI Pipeline

azure-pipelines · 2023-05-24T05:23:29Z

Azure Pipelines successfully started running 2 pipeline(s).

ShukantPal and others added 7 commits April 2, 2023 15:10

[CoreML EP] Add support for LeakyReLU activation layers

3d12f5d

## Description Implements support for LeakyReLU in ActivationOpBuilder for CoreML's EP. This speeds up inference on macOS significantly for models using LeakyReLU.

Follow up on https://github.com/YUNQIUGUO's suggestion

198de7c

Merge branch 'microsoft:main' into main

f8de047

[CoreML EP] Add support for Mul, Pow, Sqrt, Reciprocal, ReduceSum, Re…

fc0cb0e

…duceMean ## Description Implements support for mentioned operators in ActivationOpBuilder for CoreML's EP.

Merge remote-tracking branch 'origin/main'

1605d47

hMerge branch 'main' of https://github.com/ShukantPal/onnxruntime

b425c69

Merge branch 'microsoft:main' into main

b365684

ShukantPal commented Apr 17, 2023

View reviewed changes

onnxruntime/core/providers/coreml/builders/helper.cc Outdated Show resolved Hide resolved

Update onnxruntime/core/providers/coreml/builders/helper.cc

8d5f394

edgchen1 reviewed Apr 18, 2023

View reviewed changes

ShukantPal added 8 commits April 22, 2023 22:02

Merge branch 'microsoft:main' into main

9ffd91f

Merge remote-tracking branch 'origin/main'

94002ef

Merge branch 'main' of https://github.com/ShukantPal/onnxruntime

89335a1

Remove [[nodiscard]] from AddToModelBuilderImpl

2ad308c

Handle noop_with_empty_axes for ReduceSum and ReduceMean

c118612

Handle latest opset for reduction, with axes in constant initializers

d81db0c

Remove [[nodiscard]] in UnaryOpBuilder

833157b

Update documentation

da916e4

ShukantPal requested a review from edgchen1 April 23, 2023 02:43

edgchen1 reviewed Apr 25, 2023

View reviewed changes

ShukantPal and others added 2 commits April 25, 2023 15:48

Update onnxruntime/core/providers/coreml/builders/impl/reduction_op_b…

7f360bb

…uilder.cc Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com>

Update onnxruntime/core/providers/coreml/builders/impl/reduction_op_b…

c4762e3

…uilder.cc Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com>

github-advanced-security bot found potential problems Apr 26, 2023

View reviewed changes

onnxruntime/core/providers/coreml/builders/impl/reduction_op_builder.cc Fixed Show fixed Hide fixed

onnxruntime/core/providers/coreml/builders/impl/unary_op_builder.cc Fixed Show fixed Hide fixed

ShukantPal and others added 2 commits May 19, 2023 23:20

Supress DNNL, TensorRT, and OpenVINO EPs in new ReduceMean tests like…

8fc0172

… the old ones

Run lintrunner on Windows

44dbebd

Disable ReduceMean to pass tests

26da6f1

ShukantPal commented May 20, 2023

View reviewed changes

tools/ci_build/github/apple/coreml_supported_ops.md Outdated Show resolved Hide resolved

Update tools/ci_build/github/apple/coreml_supported_ops.md

87d720c

Patch flatten_op_builder axis retrieval and re-enable ReduceMean

51d2e35

skottmckay approved these changes May 24, 2023

View reviewed changes

skottmckay merged commit f316bc5 into microsoft:main May 24, 2023

Conversation

ShukantPal commented Apr 16, 2023

Description

Motivation and Context

Uh oh!

Uh oh!

edgchen1 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ShukantPal commented Apr 23, 2023

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

edgchen1 commented Apr 25, 2023

Uh oh!

azure-pipelines bot commented Apr 25, 2023

Uh oh!

natke commented Apr 26, 2023

Uh oh!

Uh oh!

Uh oh!

ShukantPal commented Apr 26, 2023

Uh oh!

natke commented Apr 26, 2023

Uh oh!

ShukantPal commented Apr 26, 2023

Uh oh!

azure-pipelines bot commented May 19, 2023

Uh oh!

azure-pipelines bot commented May 19, 2023

Uh oh!

ShukantPal commented May 20, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

ShukantPal commented May 20, 2023

Uh oh!

skottmckay commented May 22, 2023

Uh oh!

skottmckay commented May 22, 2023

Uh oh!

skottmckay commented May 22, 2023

Uh oh!

azure-pipelines bot commented May 22, 2023

Uh oh!

azure-pipelines bot commented May 22, 2023

Uh oh!

ShukantPal commented May 22, 2023

Uh oh!

skottmckay commented May 22, 2023

Uh oh!

skottmckay commented May 23, 2023

Uh oh!

skottmckay commented May 23, 2023

Uh oh!

skottmckay commented May 23, 2023

Uh oh!

skottmckay commented May 23, 2023

Uh oh!

ShukantPal commented May 23, 2023 via email

Uh oh!

skottmckay commented May 23, 2023

Uh oh!

skottmckay commented May 23, 2023

Uh oh!

azure-pipelines bot commented May 23, 2023

Uh oh!

azure-pipelines bot commented May 23, 2023

Uh oh!

skottmckay commented May 24, 2023

Uh oh!

ShukantPal commented May 20, 2023 •

edited

Loading