microsoft / onnxruntime Public

Notifications
Fork 3.2k
Star 16.2k

Code
Issues 2.6k
Pull requests 572
Discussions
Actions
Projects
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Wiki
Security
Insights

Pull requests: microsoft/onnxruntime

Labels 67 Milestones 2

New pull request New

Clear current search query, filters, and sorts

572 Open 15,628 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Support mixed precision in quantization for RTN

#24401 opened Apr 12, 2025 by jiafatom

Loading…

[webgpu] move comments out from WGSL in FlashAttention impl

#24400 opened Apr 12, 2025 by fs-eire

Loading…

[Native WebGPU] Support shared memory version of ReduceOps ep:WebGPU

ort-web webgpu provider

#24399 opened Apr 11, 2025 by satyajandhyala

Loading…

Add LSX support for S8S8 and S8U8 GEMM kernels

#24397 opened Apr 11, 2025 by wszqkzqk

Loading…

ORT-OVEP Doc update

#24395 opened Apr 11, 2025 by preetha-intel

Loading…

ONNXRuntime OpenVINO - Release 1.22

#24394 opened Apr 11, 2025 by preetha-intel

Loading…

No float from chars on gcc9

#24393 opened Apr 11, 2025 by BengtGustafsson

Loading…

Replace gsl::narrow with narrow in xnnpack code

#24392 opened Apr 11, 2025 by cdliang11

Loading…

[DML EP] Support in-memory external data TensorProto

#24391 opened Apr 11, 2025 by huningxin

Loading…

[webgpu] Supports batch and zero points in MatMulNBits WideTileProgram ep:WebGPU

ort-web webgpu provider

#24390 opened Apr 11, 2025 by daijh • Draft

[MacOS] Add MLProgram Gather op for CoreML EP

#24387 opened Apr 10, 2025 by carzh

Loading…

Add Resize cubic mode without antialias (scales = [1, ≥1, ≥1, 1])

#24385 opened Apr 10, 2025 by yihonglyu

Loading…

[CPU] Add 8bit support to matmulnbits quantizer

#24384 opened Apr 10, 2025 by fajin-corp

Loading…

Update whisper transformer module to 4.48.0

#24382 opened Apr 10, 2025 by jchen351 • Draft

[WIP] Support export of Llama with DynamicCache and transformers>=4.51

#24379 opened Apr 10, 2025 by xadupre

Loading…

Fix MatmulTransposeFusion when input A and B are the same

#24373 opened Apr 10, 2025 by fs-eire

Loading…

Implement experimental intermediate cross CPU EP allocation

#24371 opened Apr 9, 2025 by yuslepukhin • Draft

[ort-build] Pass ORT_EXTRA_INTERFACE_FLAGS to onnxruntime_session

#24368 opened Apr 9, 2025 by karim-vad

Loading…

[nodejs] support Node.js binding in multi env

#24366 opened Apr 9, 2025 by fs-eire

Loading…

Fix Windows_CI_GPU_DML_Dev_x86 and Windows_CI_GPU_DML_Dev_arm64 pipeline steps

#24365 opened Apr 9, 2025 by amarin16

Loading…

[VitisAI] enable weights sharing

#24359 opened Apr 9, 2025 by mingyueliuh • Draft

[WebGPU EP] Add EINSUM implementation ep:WebGPU

ort-web webgpu provider

#24358 opened Apr 9, 2025 by feich-ms • Draft

Enable SME for sgemm and sbgemm through KleidiAI

#24346 opened Apr 8, 2025 by MichaelTylerArm

Loading…

Add GQA fusion for CUDA EP

#24335 opened Apr 7, 2025 by nenad1002

Loading…

Update protobuf-java to 3.25.5

#24333 opened Apr 7, 2025 by jchen351

Loading…

1 of 2 tasks

Previous 1 2 3 4 5 … 22 23 Next

Previous Next

ProTip! Exclude everything labeled bug with -label:bug.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly