v15.13.ort: latest ONNX Runtime libraries

Pre-release

Pre-release

github-actions released this 31 Aug 12:11

· 53 commits to master since this release

f4f1278

This is a pre-release for the latest onnx runtime library.

ORT

Upgraded to ONNX Runtime 1.23.0 (4754a1d) and added support for Nvidia RTX 50-series GPUs.
- Support for attention operations in ONNX Runtime for LLMs is disabled.
- Support for 900 and 10-series GPUs are dropped from ORT_CUDA.

General

Upgraded to cuDNN 9.12.0.

vsmlrt.py

Added optional support for fp16 conversion using TensorRT model optimizer for TRT_RTX.

Community contributions

TRT_RTX improvements by @abihf in #140

Known issues

fp16 inference for RIFE v2 and SAFA models, as well as fp32/fp16 inference for some SwinIR models, are not currently working in TRT_RTX.
The old cudnn v8 installation should be removed; otherwise, DLL loading may not work.

Full Changelog: v15.13.rtx...v15.13.ort

Contributors

abihf

Assets 19