[CoreML EP] Add Where and And builders by maxwbuckley · Pull Request #28597 · microsoft/onnxruntime

maxwbuckley · 2026-05-20T17:56:19Z

Summary

Two new ML Program op builders, both produced by transformer attention-mask
graphs:

Where → ML Program select. WhereOpBuilder gates the X/Y branches to
float / float16 and requires cond to be bool.
And → ML Program logical_and, via a new LogicalOpBuilder. Inputs must
be bool.

Both are ML-Program-only; IsOpSupportedImpl rejects them on the NeuralNetwork
format so such nodes fall back to CPU.

Depends on the bool-Cast PR

And's inputs and output are all bool, and a CoreML partition cannot have bool
I/O, so a meaningful And test sandwiches it between int ↔ bool casts (the bool
stays internal). This branch is therefore stacked on coreml-cast-bool — the
cb43b7c75f commit in this PR is the bool-Cast PR and will drop from this diff
once that one merges (via git merge main). Where needs no such scaffolding:
its cond can be a constant initializer and X/Y/output are float.

Tests (`coreml_basic_test.cc`)

Where_MLProgram — Where with a constant bool cond runs on CoreML, matches CPU.
WhereNeuralNetworkNotSupported — Where falls back on the NeuralNetwork format.
WhereNonFloatBranchesNotSupported — an int32 Where falls back to CPU.
And_MLProgram — a Cast → And → Cast chain runs fully on CoreML, matches CPU.
AndNeuralNetworkNotSupported — the chain falls back on the NeuralNetwork format.

Doc: coreml_supported_mlprogram_ops.md lists And and Where.

Series — CoreML EP coverage for transformer / diffusion graphs

[CoreML EP] Support bool Cast in ML Program #28595 — Support bool Cast in ML Program (prerequisite)
[CoreML EP] Add Sin and Cos unary ops #28596 — Add Sin and Cos unary ops (independent)
[CoreML EP] Add Where and And builders #28597 — Add Where and And builders (this PR — depends on [CoreML EP] Support bool Cast in ML Program #28595)
[CoreML EP] Add GatherND builder #28598 — Add GatherND builder (depends on [CoreML EP] Support bool Cast in ML Program #28595)

Together with #28278 (scalar-Gather), the series takes BERT / GPT-2 / ViT /
diffusion-UNet graphs — tiny and full-size — from 2 CoreML partitions to 1, with
zero graph breaks.

Two changes to the ML Program Cast builder: 1. Accept BOOL as a source and target dtype in HasSupportedInputsImpl. The ML Program `cast` op already handles bool, and AddToModelBuilderImpl already maps `to == BOOL`; only the input/output type gate omitted it. This lets int64<->bool<->float casts (transformer attention-mask graphs) stay on CoreML. 2. Move the "no preceding node" check after the ML Program early-return. It was legacy gating for the NeuralNetwork ArgMax-only path (which dereferences InputEdgesBegin()); on the ML Program path a Cast fed directly by a graph input is fine, and rejecting it forced needless CPU fallback. Tests (coreml_basic_test.cc): - CastBoolRoundTrip_MLProgram: an int64->bool->float cast chain runs fully on CoreML and matches the CPU reference. The bool tensor is internal (a CoreML partition cannot have bool I/O) and the first Cast is graph-input fed. - CastNonArgMaxNeuralNetworkNotSupported: the same chain falls back to CPU on the NeuralNetwork format, guarding the IsOpSupportedImpl reordering. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Two new ML Program op builders, both produced by transformer attention-mask graphs: - Where -> ML Program `select`. WhereOpBuilder gates X/Y to float / float16 (CoreML `select` operates on float branches) and requires the `cond` input to be bool. - And -> ML Program `logical_and`, via a new LogicalOpBuilder. Inputs must be bool. Both are ML-Program-only; IsOpSupportedImpl rejects them on the NeuralNetwork format so such nodes fall back to CPU. Stacked on the bool-Cast branch: `And`'s inputs and output are bool, and a CoreML partition cannot have bool I/O, so a meaningful `And` test sandwiches it between int<->bool casts (the bool stays internal). `Where` needs no such scaffolding -- its `cond` can be a constant initializer and X/Y/output are float. Tests (coreml_basic_test.cc): - Where_MLProgram: Where with a constant bool cond runs on CoreML, matches CPU. - WhereNeuralNetworkNotSupported: Where falls back on the NeuralNetwork format. - WhereNonFloatBranchesNotSupported: an int32 Where falls back to CPU. - And_MLProgram: a Cast->And->Cast chain runs fully on CoreML, matches CPU. - AndNeuralNetworkNotSupported: the chain falls back on the NeuralNetwork format. Doc: coreml_supported_mlprogram_ops.md lists And and Where. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

CastBoolRoundTrip_MLProgram exercised int64 -> Cast(bool) -> Cast(float). CoreML's compiler fuses the two back-to-back `cast` ops and drops the bool clamp (cast(cast(x,bool),fp32) collapses to cast(x,fp32)), so the round-trip produces the raw input value instead of 0/1 -- the test can't be numerically verified standalone. The bool-Cast support itself is correct: it is exercised end to end by the dependent PRs, where a non-Cast op sits between the int<->bool casts so no fusion occurs -- Cast->And->Cast (Where/And PR) and Cast->GatherND->Cast (GatherND PR), both numerically verified against the CPU EP. CastNonArgMaxNeuralNetworkNotSupported (the NeuralNetwork-format negative test) is kept; it guards the IsOpSupportedImpl reordering. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

# Conflicts: # onnxruntime/core/providers/coreml/builders/op_builder_factory.h

maxwbuckley and others added 2 commits May 20, 2026 16:18

This was referenced May 20, 2026

[CoreML EP] Support bool Cast in ML Program #28595

Open

[CoreML EP] Add Sin and Cos unary ops #28596

Draft

[CoreML EP] Add GatherND builder #28598

Draft

maxwbuckley and others added 4 commits May 21, 2026 09:34

Merge coreml-cast-bool: drop standalone bool-Cast round-trip test

22a08f7

Merge remote-tracking branch 'origin/main' into coreml-cast-bool

56ce3ca

Merge branch 'coreml-cast-bool' into coreml-where-and

33a3df5

# Conflicts: # onnxruntime/core/providers/coreml/builders/op_builder_factory.h

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CoreML EP] Add Where and And builders#28597

[CoreML EP] Add Where and And builders#28597
maxwbuckley wants to merge 6 commits into
microsoft:mainfrom
maxwbuckley:coreml-where-and

maxwbuckley commented May 20, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

maxwbuckley commented May 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Depends on the bool-Cast PR

Tests (coreml_basic_test.cc)

Series — CoreML EP coverage for transformer / diffusion graphs

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

maxwbuckley commented May 20, 2026 •

edited

Loading

Tests (`coreml_basic_test.cc`)