-
Notifications
You must be signed in to change notification settings - Fork 3.2k
Pull requests: microsoft/onnxruntime
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[webgpu] move comments out from WGSL in FlashAttention impl
#24400
opened Apr 12, 2025 by
fs-eire
Loading…
[Native WebGPU] Support shared memory version of ReduceOps
ep:WebGPU
ort-web webgpu provider
#24399
opened Apr 11, 2025 by
satyajandhyala
Loading…
[DML EP] Support in-memory external data TensorProto
#24391
opened Apr 11, 2025 by
huningxin
Loading…
[webgpu] Supports batch and zero points in MatMulNBits WideTileProgram
ep:WebGPU
ort-web webgpu provider
Add Resize cubic mode without antialias (scales = [1, ≥1, ≥1, 1])
#24385
opened Apr 10, 2025 by
yihonglyu
Loading…
[WIP] Support export of Llama with DynamicCache and transformers>=4.51
#24379
opened Apr 10, 2025 by
xadupre
Loading…
Fix MatmulTransposeFusion when input A and B are the same
#24373
opened Apr 10, 2025 by
fs-eire
Loading…
Implement experimental intermediate cross CPU EP allocation
#24371
opened Apr 9, 2025 by
yuslepukhin
•
Draft
[ort-build] Pass ORT_EXTRA_INTERFACE_FLAGS to onnxruntime_session
#24368
opened Apr 9, 2025 by
karim-vad
Loading…
Fix Windows_CI_GPU_DML_Dev_x86 and Windows_CI_GPU_DML_Dev_arm64 pipeline steps
#24365
opened Apr 9, 2025 by
amarin16
Loading…
Enable SME for sgemm and sbgemm through KleidiAI
#24346
opened Apr 8, 2025 by
MichaelTylerArm
Loading…
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.