Skip to content

Fill CUDA EP opset gaps for Round and Equal operators#27754

Merged
tianleiwu merged 3 commits into
mainfrom
copilot/update-round-equal-onnx-operators
May 6, 2026
Merged

Fill CUDA EP opset gaps for Round and Equal operators#27754
tianleiwu merged 3 commits into
mainfrom
copilot/update-round-equal-onnx-operators

Conversation

Copy link
Copy Markdown
Contributor

Copilot AI commented Mar 19, 2026

Description

Caps existing non-versioned CUDA kernel registrations and adds new registrations at the latest ONNX opset:

  • Round: opset 11 (non-versioned) → versioned 11–21 + new opset 22
  • Equal: opset 13 (non-versioned) → versioned 13–18 + new opset 19

Changes across three files:

  • unary_elementwise_ops.ccUNARY_OP_HFD(Round, 11)UNARY_OP_VERSIONED_HFD + UNARY_OP_HFD
  • binary_elementwise_ops.ccBINARY_LOGICALOP_REGISTER_UZILHFD(Equal, 13) → versioned 13–18 + new 19 (same for bool typed registration)
  • cuda_execution_provider.cc — corresponding forward declarations and BuildKernelCreateInfo entries

No type changes; both operators retain their existing CUDA type support at the new opsets.

Motivation and Context

Tracks with the ongoing effort to close ONNX opset coverage gaps in the CUDA execution provider (#27729). Without these registrations, models targeting opset 19+ (Equal) or 22+ (Round) fall back from CUDA to CPU.

Round: version 11 → versioned 11-21 + new opset 22
Equal: version 13 → versioned 13-18 + new opset 19

Co-authored-by: tianleiwu <30328909+tianleiwu@users.noreply.github.com>
@tianleiwu tianleiwu marked this pull request as ready for review May 5, 2026 20:09
@tianleiwu tianleiwu enabled auto-merge (squash) May 6, 2026 05:16
@tianleiwu tianleiwu merged commit 673c332 into main May 6, 2026
99 of 101 checks passed
@tianleiwu tianleiwu deleted the copilot/update-round-equal-onnx-operators branch May 6, 2026 06:19
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants