Skip to content

Pull requests: microsoft/onnxruntime

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[webgpu] Enable per-run control for graph capture
#25367 opened Jul 11, 2025 by qjia7 Loading… updated Jul 11, 2025
Enable CUDA Graph in nv_tensorrt_rtx EP
#25368 opened Jul 11, 2025 by umangb-09 Loading… updated Jul 11, 2025
Attention Operator (CPU) release:1.23.0
#25156 opened Jun 24, 2025 by xadupre Loading… updated Jul 11, 2025
Use env. allocators for initializers (#25108)
#25281 opened Jul 3, 2025 by AndreyOrb Loading… updated Jul 11, 2025
[webgpu] Apply template to MatMulNBitsWideTile
#25353 opened Jul 10, 2025 by daijh Loading… updated Jul 11, 2025
[EP ABI] Get EP compiled model compatibility
#25331 opened Jul 8, 2025 by adrianlizarraga Draft updated Jul 11, 2025
Add vendor id to OrtEpFactory
#25365 opened Jul 11, 2025 by skottmckay Loading… updated Jul 11, 2025
[CPU] GQA supports attention scores output
#25319 opened Jul 7, 2025 by derdeljan-msft Loading… updated Jul 11, 2025
Plugin EP data transfer and Stream support.
#25254 opened Jul 2, 2025 by skottmckay Loading… updated Jul 11, 2025
[JSEP] Fix inputShape index OOB in slice.ts
#25364 opened Jul 11, 2025 by jchen10 Loading… updated Jul 11, 2025
[EP ABI] Update to use Node_GetEpName
#25363 opened Jul 11, 2025 by chilo-ms Loading… updated Jul 11, 2025
Upgrade xnnpack to latest ep:Xnnpack issues related to XNNPACK EP
#25275 opened Jul 3, 2025 by fanchenkong1 Loading… updated Jul 11, 2025
[QNN EP] Add EP-aware Reshape handler for Transpose optimization. ep:QNN issues related to QNN exeution provider
#25344 opened Jul 9, 2025 by minfhong-quic Loading… updated Jul 11, 2025
[QNN_EP] Implement Efficient Mode API ep:QNN issues related to QNN exeution provider
#25146 opened Jun 24, 2025 by quic-calvnguy Loading… updated Jul 11, 2025
Fix SigLIP casual mask bug
#25360 opened Jul 10, 2025 by nenad1002 Draft updated Jul 11, 2025
KleidiAI SGEMM/IGEMM/Quantized MatMul - Modular MLAS API Changes for KleidiAI
#25187 opened Jun 26, 2025 by damdoo01-arm Loading… updated Jul 11, 2025
[CUDA] Update Flash Attention to support head_sink for smooth softmax in GQA
#25358 opened Jul 10, 2025 by tianleiwu Draft updated Jul 10, 2025
2 of 3 tasks
[MIGraphx EP] Sync AMD changes upstream
#25338 opened Jul 9, 2025 by TedThemistokleous Loading… updated Jul 10, 2025
[WebGPU EP] extend concat to handle large number of inputs ep:WebGPU ort-web webgpu provider
#25177 opened Jun 25, 2025 by prathikr Draft updated Jul 10, 2025
Add Compile API to set the location for the context binary file
#25356 opened Jul 10, 2025 by HectorSVC Loading… updated Jul 10, 2025
Remove arm 32 references api:CSharp issues related to the C# API
#25341 opened Jul 9, 2025 by ispysoftware Loading… updated Jul 10, 2025
[ARM CPU] SVE support for Elementwise kernels
#25238 opened Jul 1, 2025 by sanketkaleoss Loading… updated Jul 10, 2025
Add a new operator attribute type ORT_OP_ATTR_BYTES to the ORT C API
#25300 opened Jul 7, 2025 by wcy123 Loading… updated Jul 10, 2025
Convert Initializers to OrtValues Phase 2
#25320 opened Jul 8, 2025 by yuslepukhin Draft updated Jul 10, 2025
ProTip! Filter pull requests by the default branch with base:main.