Bump onnxruntime from 1.13.1 to 1.16.2 #214

dependabot · 2023-11-09T17:36:21Z

Bumps onnxruntime from 1.13.1 to 1.16.2.

Release notes

ONNX Runtime v1.16.2

The patch release fixed some issues.

ONNX Runtime v1.16.1

Patch release for 1.16

Fix type of weights and activations in the ONNX quantizer

Fix quantization bug in historic quantizer #17619

Enable session option access in nodejs API

Update nodejs to v18

Align ONNX Runtime extensions inclusion in source and build

Limit per thread context to 1 in the TensorRT EP to avoid error caused by input shape changes

ONNX Runtime v1.16.0

General

Support for serialization of models >=2GB

APIs

New session option to disable default CPU EP fallback session.disable_cpu_ep_fallback

Java

Support for fp16 and bf16 tensors as inputs and outputs, along with utilities to convert between these and fp32 data. On JDK 20 and newer the fp16 conversion methods use the JDK's Float.float16ToFloat and Float.floatToFloat16 methods which can be hardware accelerated and vectorized on some platforms.

Support for external initializers so that large models that can be instantiated without filesystem access

C#

Expose OrtValue API as the new preferred API to run inference in C#. This reduces garbage and exposes direct native memory access via Slice like interfaces.

Make Float16 and BFloat16 full featured fp16 interfaces that support conversion and expose floating properties (e.g. IsNaN, IsInfinity, etc)

C++

Make Float16_t and BFloat16_t full featured fp16 interfaces that support conversion and expose floating properties (e.g. IsNaN, IsInfinity, etc)

Performance

Improve LLM quantization accuracy with smoothquant

Support 4-bit quantization on CPU

Optimize BeamScore to improve BeamSearch performance

Add FlashAttention v2 support for Attention, MultiHeadAttention and PackedMultiHeadAttention ops

Execution Providers

CUDA EP

Initial fp8 support (QDQ, Cast, MatMul)

Relax CUDA Graph constraints to allow more models to utilize

Allow CUDA allocator to be registered with ONNX Runtime externally

TensorRT EP

CUDA Graph support

Support user provided cuda compute stream

Misc bug fixes and improvements

OpenVINO EP

Support OpenVINO 2023.1

... (truncated)

Commits

0c5b95f Cherry-pick LLaMA GQA mask to rel-1.16.2 (round 4) (#18350)
8f06330 Cherry pick LLaMA or SDXL to 1.16.2 release (round 3) (#18323)
0ccca88 Update eigen version (#18308)
ad7cecb Update eigen's URL (#18301)
95c20d0 Cherry-pick two pipeline changes for the 1.16.2 patch release (#18249)
27b0910 cherry pick resize grad pr (#18255)
70b8cda Cherry pick LLaMA to rel-1.16.2 (round 2) (#18245)
2f57f1e Some cherry-picks for the 1.16.2 release (#18218)
bc533a6 [DML EP] Add dynamic graph compilation (#18199)
c273f7a Cherry-pick LLaMA/SDXL to rel-1.16.2 (#18202)
Additional commits viewable in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR
@dependabot recreate will recreate this PR, overwriting any edits that have been made to it
@dependabot merge will merge this PR after your CI passes on it
@dependabot squash and merge will squash and merge this PR after your CI passes on it
@dependabot cancel merge will cancel a previously requested merge and block automerging
@dependabot reopen will reopen this PR if it is closed
@dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
@dependabot show <dependency name> ignore conditions will show all of the ignore conditions of the specified dependency
@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Bumps [onnxruntime](https://github.com/microsoft/onnxruntime) from 1.13.1 to 1.16.2. - [Release notes](https://github.com/microsoft/onnxruntime/releases) - [Changelog](https://github.com/microsoft/onnxruntime/blob/main/docs/ReleaseManagement.md) - [Commits](microsoft/onnxruntime@v1.13.1...v1.16.2) --- updated-dependencies: - dependency-name: onnxruntime dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com>

dependabot · 2023-11-20T21:47:47Z

OK, I won't notify you again about this release, but will get in touch when a new version is available. If you'd rather skip all updates until the next major or minor version, let me know by commenting @dependabot ignore this major version or @dependabot ignore this minor version. You can also ignore all major, minor, or patch releases for a dependency by adding an ignore condition with the desired update_types to your config file.

If you change your mind, just re-open this PR and I'll resolve any conflicts on it.

dependabot bot added the dependencies Pull requests that update a dependency file label Nov 9, 2023

wdika closed this Nov 20, 2023

dependabot bot deleted the dependabot/pip/onnxruntime-1.16.2 branch November 20, 2023 21:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bump onnxruntime from 1.13.1 to 1.16.2 #214

Bump onnxruntime from 1.13.1 to 1.16.2 #214

dependabot bot commented on behalf of github Nov 9, 2023

dependabot bot commented on behalf of github Nov 20, 2023

Bump onnxruntime from 1.13.1 to 1.16.2 #214

Bump onnxruntime from 1.13.1 to 1.16.2 #214

Conversation

dependabot bot commented on behalf of github Nov 9, 2023

ONNX Runtime v1.16.2

ONNX Runtime v1.16.1

ONNX Runtime v1.16.0

General

APIs

Performance

Execution Providers

dependabot bot commented on behalf of github Nov 20, 2023