Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
23 commits
Select commit Hold shift + click to select a range
0482251
Bump @babel/helpers from 7.25.6 to 7.26.10 in /js/react_native/e2e (#…
dependabot[bot] Aug 25, 2025
a7b3b66
Fix a typo in ORT_API_CALL macro (_stdcall) (#25834)
SavaLione Aug 25, 2025
8da3e40
[build] allow custom CMAKE_C_STANDARD and CMAKE_CXX_STANDARD (#25782)
fs-eire Aug 25, 2025
5b021b2
[webgpu] Add support for Q2 in matmulnbits (#25763)
sushraja-msft Aug 26, 2025
cf05366
[webgpu ] Optimize flash attention for Nvidia (#25777)
qjia7 Aug 26, 2025
17ede5a
safeint.h: quelch gcc's -Wreturn-type (#25655)
bonktree Aug 26, 2025
a6c92bd
Fix focus contrast ratios for accessibility compliance (WCAG 2.1 AA) …
Copilot Aug 26, 2025
ce90609
Fix keyboard navigation accessibility for DocFX tab controls (#25819)
Copilot Aug 26, 2025
db4b0f4
[CPU] Improve QMoE kernel (#25822)
apsonawane Aug 26, 2025
0de1c01
Enable ABSL_FLAGS flag registration for onnxruntime_perf_test for mob…
chilo-ms Aug 26, 2025
f58f7eb
[webgpu] Expand Unsqueeze version to 23 (#25858)
qjia7 Aug 26, 2025
7e3174b
[webgpu] Optimize dp4 prefill shader for Qualcomm (#25578)
qjia7 Aug 26, 2025
16ae99e
Add cuda graph implementation for NV TRT RTX EP (#25787)
umangb-09 Aug 27, 2025
c9c23b0
[QNN-EP] Enable einsum with QK equations for QNN. (#25861)
quic-muchhsu Aug 27, 2025
568ad20
Fix a long standing bug on file memory mapping on windows. (#25833)
yuslepukhin Aug 27, 2025
c9ec1da
Add default constructor to Ort::Status. (#25860)
yuslepukhin Aug 27, 2025
0ba4d29
Bump actions/setup-java from 4 to 5 (#25840)
dependabot[bot] Aug 27, 2025
3cadbdb
Add API for precompiled model compatibility check using just the comp…
adrastogi Aug 27, 2025
7af42b8
[WebNN] Fix the op support limit for batchNormalization (#25856)
shiyi9801 Aug 28, 2025
1d07e94
[WebNN] Support Round op (#25810)
Honry Aug 28, 2025
d4e31dc
Improve SimplifiedLayerNorm by using same techniques as SkipSimplifie…
sushraja-msft Aug 28, 2025
0b15200
DequantizeLinear should support non-zero zero_point when input type i…
wangw-1991 Aug 28, 2025
676a4d2
Merge branch 'master' into sync_msft_28082025
Jaswanth51 Aug 28, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
4 changes: 2 additions & 2 deletions .github/workflows/android.yml
Original file line number Diff line number Diff line change
Expand Up @@ -124,7 +124,7 @@ jobs:
- uses: actions/checkout@v5

- name: Use jdk 17
uses: actions/setup-java@v4
uses: actions/setup-java@v5
with:
distribution: 'temurin'
java-version: '17'
Expand Down Expand Up @@ -206,7 +206,7 @@ jobs:
- uses: actions/checkout@v5

- name: Use jdk 17
uses: actions/setup-java@v4
uses: actions/setup-java@v5
with:
distribution: 'temurin'
java-version: '17'
Expand Down
2 changes: 1 addition & 1 deletion .github/workflows/codeql.yml
Original file line number Diff line number Diff line change
Expand Up @@ -55,7 +55,7 @@ jobs:
# Setup Java to use a version that is not too old for the project
- if: ${{ matrix.language == 'java' }}
name: Setup Java 11
uses: actions/setup-java@v4
uses: actions/setup-java@v5
with:
java-version: '11'
distribution: 'microsoft'
Expand Down
2 changes: 1 addition & 1 deletion .github/workflows/publish-java-apidocs.yml
Original file line number Diff line number Diff line change
Expand Up @@ -25,7 +25,7 @@ jobs:
steps:
- uses: actions/checkout@v5
- name: Set up JDK 11
uses: actions/setup-java@v4
uses: actions/setup-java@v5
with:
java-version: '11'
distribution: 'adopt'
Expand Down
4 changes: 2 additions & 2 deletions .github/workflows/windows_cuda.yml
Original file line number Diff line number Diff line change
Expand Up @@ -60,7 +60,7 @@ jobs:
with:
node-version: '20.x'

- uses: actions/setup-java@v4
- uses: actions/setup-java@v5
with:
distribution: 'temurin'
java-version: '17'
Expand Down Expand Up @@ -172,7 +172,7 @@ jobs:
with:
node-version: '20.x'

- uses: actions/setup-java@v4
- uses: actions/setup-java@v5
with:
distribution: 'temurin'
java-version: '17'
Expand Down
2 changes: 1 addition & 1 deletion .github/workflows/windows_dml.yml
Original file line number Diff line number Diff line change
Expand Up @@ -51,7 +51,7 @@ jobs:
with:
node-version: '20.x'

- uses: actions/setup-java@v4
- uses: actions/setup-java@v5
with:
distribution: 'temurin'
java-version: '17'
Expand Down
4 changes: 2 additions & 2 deletions .github/workflows/windows_tensorrt.yml
Original file line number Diff line number Diff line change
Expand Up @@ -65,7 +65,7 @@ jobs:
with:
node-version: '20.x'

- uses: actions/setup-java@v4
- uses: actions/setup-java@v5
with:
distribution: 'temurin'
java-version: '17'
Expand Down Expand Up @@ -177,7 +177,7 @@ jobs:
with:
node-version: '20.x'

- uses: actions/setup-java@v4
- uses: actions/setup-java@v5
with:
distribution: 'temurin'
java-version: '17'
Expand Down
4 changes: 2 additions & 2 deletions .github/workflows/windows_webgpu.yml
Original file line number Diff line number Diff line change
Expand Up @@ -61,7 +61,7 @@ jobs:
node-version: "20.x"

- name: Setup Java
uses: actions/setup-java@v4
uses: actions/setup-java@v5
with:
distribution: "temurin"
java-version: "17"
Expand Down Expand Up @@ -249,7 +249,7 @@ jobs:
node-version: "20.x"

- name: Setup Java
uses: actions/setup-java@v4
uses: actions/setup-java@v5
with:
distribution: "temurin"
java-version: "17"
Expand Down
2 changes: 1 addition & 1 deletion .github/workflows/windows_x64_debug_build_x64_debug.yml
Original file line number Diff line number Diff line change
Expand Up @@ -43,7 +43,7 @@ jobs:
node-version: '20.x'

- name: Setup Java
uses: actions/setup-java@v4
uses: actions/setup-java@v5
with:
distribution: 'temurin'
java-version: '17'
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,7 +43,7 @@ jobs:
node-version: '20.x'

- name: Setup Java
uses: actions/setup-java@v4
uses: actions/setup-java@v5
with:
distribution: 'temurin'
java-version: '17'
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,7 +43,7 @@ jobs:
node-version: '20.x'

- name: Setup Java
uses: actions/setup-java@v4
uses: actions/setup-java@v5
with:
distribution: 'temurin'
java-version: '17'
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,7 +43,7 @@ jobs:
node-version: '20.x'

- name: Setup Java
uses: actions/setup-java@v4
uses: actions/setup-java@v5
with:
distribution: 'temurin'
java-version: '17'
Expand Down
2 changes: 1 addition & 1 deletion .github/workflows/windows_x64_release_xnnpack.yml
Original file line number Diff line number Diff line change
Expand Up @@ -43,7 +43,7 @@ jobs:
node-version: '20.x'

- name: Setup Java
uses: actions/setup-java@v4
uses: actions/setup-java@v5
with:
distribution: 'temurin'
java-version: '17'
Expand Down
2 changes: 1 addition & 1 deletion .github/workflows/windows_x86.yml
Original file line number Diff line number Diff line change
Expand Up @@ -44,7 +44,7 @@ jobs:
architecture: x86 #Add architecture

- name: Setup Java
uses: actions/setup-java@v4
uses: actions/setup-java@v5
with:
distribution: 'temurin'
java-version: '17'
Expand Down
16 changes: 10 additions & 6 deletions cmake/CMakeLists.txt
Original file line number Diff line number Diff line change
Expand Up @@ -22,7 +22,9 @@ if("${CMAKE_CXX_COMPILER_ID}" MATCHES "IntelLLVM")
endif()

# Needed for Java
set(CMAKE_C_STANDARD 99)
if (NOT CMAKE_CXX_STANDARD)
set(CMAKE_C_STANDARD 99)
endif()

include(CheckCXXCompilerFlag)
include(CheckLanguage)
Expand All @@ -32,11 +34,13 @@ include(CheckFunctionExists)
include(CheckSymbolExists)
include(GNUInstallDirs) # onnxruntime_providers_* require CMAKE_INSTALL_* variables

# TODO: update this once all system adapt c++20
if(CMAKE_SYSTEM_NAME STREQUAL "Darwin")
set(CMAKE_CXX_STANDARD 20)
else()
set(CMAKE_CXX_STANDARD 17)
if (NOT CMAKE_CXX_STANDARD)
# TODO: update this once all system adapt c++20
if (CMAKE_SYSTEM_NAME STREQUAL "Darwin")
set(CMAKE_CXX_STANDARD 20)
else()
set(CMAKE_CXX_STANDARD 17)
endif()
endif()

if (MSVC)
Expand Down
6 changes: 6 additions & 0 deletions cmake/onnxruntime_unittests.cmake
Original file line number Diff line number Diff line change
Expand Up @@ -1232,6 +1232,12 @@ if (NOT onnxruntime_ENABLE_TRAINING_TORCH_INTEROP)
${onnxruntime_perf_test_src_patterns}
)
onnxruntime_add_executable(onnxruntime_perf_test ${onnxruntime_perf_test_src} ${ONNXRUNTIME_ROOT}/core/platform/path_lib.cc)

# ABSL_FLAGS_STRIP_NAMES is set to 1 by default to disable flag registration when building for Android, iPhone, and "embedded devices".
# See the issue: https://github.com/abseil/abseil-cpp/issues/1875
# We set it to 0 for all builds to be able to use ABSL flags for onnxruntime_perf_test.
target_compile_definitions(onnxruntime_perf_test PRIVATE ABSL_FLAGS_STRIP_NAMES=0)

if(MSVC)
target_compile_options(onnxruntime_perf_test PRIVATE "$<$<COMPILE_LANGUAGE:CUDA>:SHELL:--compiler-options /utf-8>"
"$<$<NOT:$<COMPILE_LANGUAGE:CUDA>>:/utf-8>")
Expand Down
4 changes: 4 additions & 0 deletions csharp/ApiDocs/_exported_templates/default/styles/docfx.css
Original file line number Diff line number Diff line change
Expand Up @@ -323,6 +323,8 @@ article section {
}
.docs-search > .search-query:focus {
outline: 0;
border: 2px solid #0050C5;
background-color: #f8f9fa;
}
.search-results-frame {
clear: both;
Expand Down Expand Up @@ -597,6 +599,8 @@ body .toc{
}
.toc-filter > input:focus {
outline: 0;
border: 2px solid #0050C5;
background-color: #f8f9fa;
}
.toc-filter > .filter-icon {
position: absolute;
Expand Down
87 changes: 87 additions & 0 deletions csharp/ApiDocs/_exported_templates/default/styles/docfx.js
Original file line number Diff line number Diff line change
Expand Up @@ -802,6 +802,7 @@ $(function () {
}
}
container.addEventListener('click', function (event) { return handleClick(event, state); });
container.addEventListener('keydown', function (event) { return handleKeyDown(event, state); });
if (state.groups.length === 0) {
return state;
}
Expand All @@ -820,6 +821,7 @@ $(function () {
while (li) {
var a = li.firstElementChild;
a.setAttribute(contentAttrs.name, 'tab');
a.setAttribute('role', 'tab');
var dataTab = a.getAttribute('data-tab').replace(/\+/g, ' ');
a.setAttribute('data-tab', dataTab);
var section = element.querySelector("[id=\"" + a.getAttribute('aria-controls') + "\"]");
Expand Down Expand Up @@ -915,6 +917,91 @@ $(function () {
}
}

function handleKeyDown(event, state) {
var info = getTabInfoFromEvent(event);
if (info === null) {
return;
}

var handled = false;
var tabGroup = info.group;
var currentTabIndex = tabGroup.tabs.findIndex(function(tab) { return tab.a === info.anchor; });

switch (event.key) {
case 'ArrowLeft':
case 'ArrowUp':
// Move to previous tab
handled = true;
var prevIndex = currentTabIndex - 1;
if (prevIndex < 0) {
prevIndex = tabGroup.tabs.length - 1;
}
while (prevIndex !== currentTabIndex && !tabGroup.tabs[prevIndex].visible) {
prevIndex--;
if (prevIndex < 0) {
prevIndex = tabGroup.tabs.length - 1;
}
}
if (tabGroup.tabs[prevIndex].visible) {
tabGroup.tabs[prevIndex].focus();
}
break;

case 'ArrowRight':
case 'ArrowDown':
// Move to next tab
handled = true;
var nextIndex = currentTabIndex + 1;
if (nextIndex >= tabGroup.tabs.length) {
nextIndex = 0;
}
while (nextIndex !== currentTabIndex && !tabGroup.tabs[nextIndex].visible) {
nextIndex++;
if (nextIndex >= tabGroup.tabs.length) {
nextIndex = 0;
}
}
if (tabGroup.tabs[nextIndex].visible) {
tabGroup.tabs[nextIndex].focus();
}
break;

case 'Home':
// Move to first visible tab
handled = true;
for (var i = 0; i < tabGroup.tabs.length; i++) {
if (tabGroup.tabs[i].visible) {
tabGroup.tabs[i].focus();
break;
}
}
break;

case 'End':
// Move to last visible tab
handled = true;
for (var i = tabGroup.tabs.length - 1; i >= 0; i--) {
if (tabGroup.tabs[i].visible) {
tabGroup.tabs[i].focus();
break;
}
}
break;

case 'Enter':
case ' ': // Space key
// Activate the current tab
handled = true;
handleClick(event, state);
break;
}

if (handled) {
event.preventDefault();
event.stopPropagation();
}
}

function selectTabs(tabIds) {
for (var _i = 0, tabIds_1 = tabIds; _i < tabIds_1.length; _i++) {
var tabId = tabIds_1[_i];
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -1220,7 +1220,7 @@ to{background-position:0 0}
.list-group-item.active .list-group-item-text,.list-group-item.active:focus .list-group-item-text,.list-group-item.active:hover .list-group-item-text{color:#c7ddef}
a.list-group-item,button.list-group-item{color:#555}
a.list-group-item .list-group-item-heading,button.list-group-item .list-group-item-heading{color:#333}
a.list-group-item:focus,a.list-group-item:hover,button.list-group-item:focus,button.list-group-item:hover{color:#555;text-decoration:none;background-color:#f5f5f5}
a.list-group-item:focus,a.list-group-item:hover,button.list-group-item:focus,button.list-group-item:hover{color:#333;text-decoration:none;background-color:#f5f5f5}
button.list-group-item{width:100%;text-align:left}
.list-group-item-success{color:#3c763d;background-color:#dff0d8}
a.list-group-item-success,button.list-group-item-success{color:#3c763d}
Expand All @@ -1230,7 +1230,7 @@ a.list-group-item-success.active,a.list-group-item-success.active:focus,a.list-g
.list-group-item-info{color:#31708f;background-color:#d9edf7}
a.list-group-item-info,button.list-group-item-info{color:#31708f}
a.list-group-item-info .list-group-item-heading,button.list-group-item-info .list-group-item-heading{color:inherit}
a.list-group-item-info:focus,a.list-group-item-info:hover,button.list-group-item-info:focus,button.list-group-item-info:hover{color:#31708f;background-color:#c4e3f3}
a.list-group-item-info:focus,a.list-group-item-info:hover,button.list-group-item-info:focus,button.list-group-item-info:hover{color:#1e4a5f;background-color:#c4e3f3}
a.list-group-item-info.active,a.list-group-item-info.active:focus,a.list-group-item-info.active:hover,button.list-group-item-info.active,button.list-group-item-info.active:focus,button.list-group-item-info.active:hover{color:#fff;background-color:#31708f;border-color:#31708f}
.list-group-item-warning{color:#8a6d3b;background-color:#fcf8e3}
a.list-group-item-warning,button.list-group-item-warning{color:#8a6d3b}
Expand Down
26 changes: 10 additions & 16 deletions include/onnxruntime/core/graph/model_saving_options.h
Original file line number Diff line number Diff line change
Expand Up @@ -9,36 +9,30 @@ class PrepackedWeightsForGraph;

// These options affect how the model initializers are written to the external file.
// This includes options to align external initializer offset.
// For models running on CPU, ORT will try to use mmap to load external
// initializers. To use mmap, external initializer need to be offset aligned.
// ORT will try to use mmap to load external initializers.
//
// ORT saves external initializers into single data file, each initializer is
// accessed with offset(start position of initializer) and length(byte length of
// initializer) of the data file. To use mmap, each offset need to be aligned
// which means offset need to divisible by allocation granularity(64KB for
// windows and 4K for other OSes). With align_offset to true, ORT will align
// offset for large initializer when save ONNX model with external data file.
// initializer) of the data file. With align_offset to true, ORT will align
// offset for large initializer (larger than align_threshold)
// when save ONNX model with external data file. It will align then to
// on_disk_alignment value.
struct ModelSavingOptions {
explicit ModelSavingOptions(size_t size_threshold)
: initializer_size_threshold(size_threshold) {}

// Minimal initializer size in bytes to be externalized on disk
size_t initializer_size_threshold;
// Offset will always be page aligned and allocation granularity aligned for
// mmap support. This is done by padding previous tensor data with zeros
// keeping same length.
// Offset will always be aligned for mmap support.
// This is done by padding previous tensor data with zeros keeping same length.
bool align_offset = false;
// Alignment threshold for size of data.
// Having a low threshold will waste file space for small initializers.
// Only when tensor's data size is > the page_align_threshold it will be force
// aligned. Default to 1MB.
int64_t align_threshold = 1048576;
// The allocation Granularity for mmap() support.
// Typically 64KB for Windows & 4KB for other OSes. Default to 64KB.
#ifdef _WIN32
int64_t allocation_granularity = 65536;
#else
int64_t allocation_granularity = 4096;
#endif
// Alignment factor for big tensors (bigger than align_threshold). Defaults to 4K.
int64_t on_disk_alignment = 4096;
// Force embed all external initializer into the Onnx file
// Used for EPContext model generation while some nodes fallback on CPU which has external data dependency
bool force_embed_external_ini = false;
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -31,7 +31,7 @@ constexpr const char* kDetailedBuildLog = "nv_detailed_build_log";
constexpr const char* kProfilesMinShapes = "nv_profile_min_shapes";
constexpr const char* kProfilesMaxShapes = "nv_profile_max_shapes";
constexpr const char* kProfilesOptShapes = "nv_profile_opt_shapes";
constexpr const char* kCudaGraphEnable = "nv_cuda_graph_enable";
constexpr const char* kCudaGraphEnable = "enable_cuda_graph";
constexpr const char* kMultiProfileEnable = "nv_multi_profile_enable";
constexpr const char* kUseExternalDataInitializer = "nv_use_external_data_initializer";

Expand Down
Loading
Loading