Update ORT to handle explicit OpSchemaRegisterOnce API in ONNX >= 1.18.0 for fluent chaining #24561

titaiwangms · 2025-04-25T23:17:21Z

ORT was leveraging on implicit api ONNX_NAMESPACE::OpSchemaRegistry::OpSchemaRegisterOnce https://github.com/microsoft/onnxruntime/blob/fe97f8b608bbd9131a8de25b0d6b57a9ee30c388/onnxruntime/core/graph/contrib_ops/contrib_defs.h#L39C10-L39C64 to do fluent chaining style as following

onnxruntime/onnxruntime/core/graph/contrib_ops/collective_defs.cc

Lines 17 to 28 in fe97f8b

    
           ONNX_CONTRIB_OPERATOR_SCHEMA(AllReduce) 
        
               .SetDomain(kMSDomain) 
        
               .SinceVersion(1) 
        
               .Input(0, "input", "tensors to be reduced", "T", OpSchema::Variadic) 
        
               .Output(0, "output", "reduced tensors", "T", OpSchema::Variadic) 
        
               .TypeConstraint( 
        
                   "T", 
        
                   {"tensor(float16)", "tensor(float)", "tensor(double)"}, 
        
                   "Constrain to float, float16 and double tensors.") 
        
               .TypeAndShapeInferenceFunction([](ONNX_NAMESPACE::InferenceContext& ctx) { 
        
                 propagateShapeAndTypeFromFirstInput(ctx); 
        
               });

However, the api ONNX_NAMESPACE::OpSchemaRegistry::OpSchemaRegisterOnce will be explicit starting from ONNX==1.18.0.

The PR: onnx/onnx#6378
The change: https://github.com/onnx/onnx/blob/9e379bdd51be054c89b29386f17a2fd731ce190f/onnx/defs/schema.h#L984

It's currently avoid from cmake/patches/onnx.patch within #24449

The text was updated successfully, but these errors were encountered:

### Description  The PR adds CPU support by following release logics in https://github.com/onnx/onnx/wiki/Logistics-for-ONNX-Release-1.18.0. The goal is to do the minimal changes needed to ensure ONNXRUNTIME works fine with ONNX 1.18.0 ### Motivation and Context  Essentially, incoming ONNX 1.18.0 provides the following (1) Introduce opset 23 (included in this PR) (2) Support Attention, RMSNormalization, and RotaryEmbedding (**NOT** included in this PR) (3) Support float4e2m1 (**NOT** included in this PR) ### Remaining Issues 1. onnx.patch * ONNXRUNTIME is using static functions (shape inference) from ONNX (#24558) * GroupNormalization-18 is deprecated because its spec was wrong (#24560) * Contrib op registration api from ONNX: OpSchemaRegisterOnce is changed to explicit, and ONNXRUNTIME was leveraging it to do fluent-chaining style. (#24561) 2. Support float4e2m1 (#24553) 3. Support Attention(#24554), RMSNormalization(#24555), and RotaryEmbedding(#24556) 4. Disable QNN tests

### Description  The PR adds CPU support by following release logics in https://github.com/onnx/onnx/wiki/Logistics-for-ONNX-Release-1.18.0. The goal is to do the minimal changes needed to ensure ONNXRUNTIME works fine with ONNX 1.18.0 ### Motivation and Context  Essentially, incoming ONNX 1.18.0 provides the following (1) Introduce opset 23 (included in this PR) (2) Support Attention, RMSNormalization, and RotaryEmbedding (**NOT** included in this PR) (3) Support float4e2m1 (**NOT** included in this PR) ### Remaining Issues 1. onnx.patch * ONNXRUNTIME is using static functions (shape inference) from ONNX (microsoft#24558) * GroupNormalization-18 is deprecated because its spec was wrong (microsoft#24560) * Contrib op registration api from ONNX: OpSchemaRegisterOnce is changed to explicit, and ONNXRUNTIME was leveraging it to do fluent-chaining style. (microsoft#24561) 2. Support float4e2m1 (microsoft#24553) 3. Support Attention(microsoft#24554), RMSNormalization(microsoft#24555), and RotaryEmbedding(microsoft#24556) 4. Disable QNN tests

titaiwangms mentioned this issue Apr 25, 2025

Integration with ONNX rel-1.18.0 #24449

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update ORT to handle explicit OpSchemaRegisterOnce API in ONNX >= 1.18.0 for fluent chaining #24561

Update ORT to handle explicit OpSchemaRegisterOnce API in ONNX >= 1.18.0 for fluent chaining #24561

titaiwangms commented Apr 25, 2025

Update ORT to handle explicit OpSchemaRegisterOnce API in ONNX >= 1.18.0 for fluent chaining #24561

Update ORT to handle explicit OpSchemaRegisterOnce API in ONNX >= 1.18.0 for fluent chaining #24561

Comments

titaiwangms commented Apr 25, 2025