-
Notifications
You must be signed in to change notification settings - Fork 3.1k
Insights: microsoft/onnxruntime
Overview
Could not load contribution data
Please try again later
15 Pull requests merged by 9 people
-
replace usage of gsl::narrow and gsl::narrow_cast in WebGPU EP
#23926 merged
Mar 7, 2025 -
Fix license in example test code.
#23936 merged
Mar 7, 2025 -
Create a packaging pipeline for a custom nuget package
#23918 merged
Mar 7, 2025 -
[AIX] External data handling
#23859 merged
Mar 7, 2025 -
Updated ov version in pipeline (#595)
#23882 merged
Mar 7, 2025 -
Fix ConvInteger handling of optional inputs.
#23935 merged
Mar 7, 2025 -
Updated run_CIs_for_external_pr.py to support the Windows OpenVINO CI pipeline
#23931 merged
Mar 7, 2025 -
fix binplace file in web pipeline
#23930 merged
Mar 7, 2025 -
Enabling L2+ Optimizations for EPs
#23517 merged
Mar 7, 2025 -
Example custom op with output type inferencing
#23916 merged
Mar 7, 2025 -
Support all block sizes that are multiples of 32 for DP4A
#23907 merged
Mar 7, 2025 -
Exclude MAUI projects from GPU C# packaging builds
#23923 merged
Mar 7, 2025 -
[WebGPU EP] SoftMax Implementation
#23538 merged
Mar 7, 2025 -
Adding OpenVINO Windows CI Pipeline
#23919 merged
Mar 7, 2025 -
enable WebGPU EP in WebAssembly build
#23913 merged
Mar 6, 2025
11 Pull requests opened by 10 people
-
Extend CMAKE_CUDA_FLAGS with all Blackwell compute capacity
#23928 opened
Mar 7, 2025 -
[WIP] DepthToSpace for WebGPU EP
#23929 opened
Mar 7, 2025 -
update transformer version to 4.48.0
#23932 opened
Mar 7, 2025 -
VCPKG improvement: set VCPKG_OSX_DEPLOYMENT_TARGET
#23933 opened
Mar 7, 2025 -
[Native WebGPU] Added ReduceMax and ReduceSum
#23934 opened
Mar 7, 2025 -
[js] Add API for accessing metadata of a model's input/output
#23937 opened
Mar 7, 2025 -
[Fix] Dependencies find_package Eigen error
#23939 opened
Mar 7, 2025 -
Add support for custom position ids and attention mask to GQA CPU operator
#23944 opened
Mar 7, 2025 -
Qnn weight sharing improvement
#23945 opened
Mar 7, 2025 -
Allow using a different version of flatbuffers when building with vcpkg
#23946 opened
Mar 7, 2025
2 Issues closed by 2 people
-
[Build] Released asset for v1.20.1 doesn't work on macOS Sequoia
#23922 closed
Mar 7, 2025 -
What's the right way to construct custom ops with the same name but different output types?
#23891 closed
Mar 7, 2025
7 Issues opened by 7 people
-
[Web] WASM sigmoid producing numbers below 0 or above 1
#23943 opened
Mar 7, 2025 -
Error when I use cuda_runtime.h and OpenVINO EP at the same time
#23941 opened
Mar 7, 2025 -
[Feature Request] Add more options to load models at InferenceSession constructor
#23940 opened
Mar 7, 2025 -
Bad Allocation Error in ONNX Runtime on Windows x86 CPU When Processing Multiple Images Sequentially
#23938 opened
Mar 7, 2025 -
ConvInteger segfaults when x_zero_point is the empty string
#23927 opened
Mar 7, 2025 -
[Feature Request] Multi-Head Latent Attention(DeepSeek) support on CPU/NPU
#23925 opened
Mar 6, 2025 -
[Web] Facing this error in WebGPU: Model warmup failed: Error: input 'detection' is missing in 'feeds'.
#23921 opened
Mar 6, 2025
26 Unresolved conversations
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
-
[WebGPU EP] Implements Gelu, BiasSplitGelu, and QuickGelu
#23920 commented on
Mar 7, 2025 • 11 new comments -
Whisper Redesigned Solution
#23549 commented on
Mar 6, 2025 • 8 new comments -
[webgpu] Optimize MatMulNBits for f16 Block32 prefill performance
#23908 commented on
Mar 7, 2025 • 7 new comments -
Integrate KleidiAI for MatMulNBits via MlasQNBitGemm
#23627 commented on
Mar 7, 2025 • 5 new comments -
[WebGPU] Direct CPU->GPU buffer upload for UMA
#23910 commented on
Mar 7, 2025 • 2 new comments -
[VitisAI EP] export InferShapes to VitisAIEP
#23881 commented on
Mar 7, 2025 • 0 new comments -
Synchronize patch files, fix resource compiler invocations in some situations
#23855 commented on
Mar 6, 2025 • 0 new comments -
[OpenVINO]Session Options Appended After AppendExecutionProvider
#23852 commented on
Mar 7, 2025 • 0 new comments -
[WebNN] Better int64 integration
#23831 commented on
Mar 7, 2025 • 0 new comments -
Make Cuda packaging pipeline 1ES compliant
#23806 commented on
Mar 7, 2025 • 0 new comments -
[WIP] migrate WebGPU EP to WebAssembly to replace JSEP
#23697 commented on
Mar 6, 2025 • 0 new comments -
Migrate Zip-Nuget Package Pipeline to 1ES
#23609 commented on
Mar 7, 2025 • 0 new comments -
[WebGPU/JSEP] Support group query attention do_rotary attribute
#23524 commented on
Mar 7, 2025 • 0 new comments -
[WebNN EP] Support GroupQueryAttention(GQA)
#23416 commented on
Mar 7, 2025 • 0 new comments -
[VitisAI] Add vaip Integration Using FetchContent
#22038 commented on
Mar 7, 2025 • 0 new comments -
Broken multithreading inference session Onnxruntime-directml >= 1.18
#20713 commented on
Mar 7, 2025 • 0 new comments -
[Performance] using onnxruntime with ray and also fix for memory footprint too high
#16793 commented on
Mar 7, 2025 • 0 new comments -
[DO NOT UNPIN] ORT 1.21.0 Release Candidates available for testing
#23885 commented on
Mar 7, 2025 • 0 new comments -
Mixed Precision ValueError: validation failed for model with all nodes in node_block_list
#14235 commented on
Mar 7, 2025 • 0 new comments -
[Web] BiRefNet_T not working on webgpu
#21968 commented on
Mar 7, 2025 • 0 new comments -
[Build] memory leaked
#23915 commented on
Mar 7, 2025 • 0 new comments -
[C++, Linux] Segmentation fault when run OrtApi::Run
#23897 commented on
Mar 7, 2025 • 0 new comments -
Public and open source contains header references to "confidential and proprietary" Microsoft code.
#23917 commented on
Mar 7, 2025 • 0 new comments -
When using the int8 quantization model to convert to onnx, an error occurs during runtime
#23879 commented on
Mar 7, 2025 • 0 new comments -
[Build] Openvino fails to build with AUTO:GPU,CPU
#23866 commented on
Mar 6, 2025 • 0 new comments -
Abs node runs into error with bf16 tensor
#23875 commented on
Mar 6, 2025 • 0 new comments