v15.13.ort: latest ONNX Runtime libraries
Pre-release
Pre-release
·
53 commits
to master
since this release
This is a pre-release for the latest onnx runtime library.
ORT
- Upgraded to ONNX Runtime 1.23.0 (
4754a1d) and added support for Nvidia RTX 50-series GPUs.- Support for attention operations in ONNX Runtime for LLMs is disabled.
- Support for 900 and 10-series GPUs are dropped from
ORT_CUDA.
General
- Upgraded to cuDNN 9.12.0.
vsmlrt.py
- Added optional support for fp16 conversion using TensorRT model optimizer for
TRT_RTX.
Community contributions
Known issues
- fp16 inference for RIFE v2 and SAFA models, as well as fp32/fp16 inference for some SwinIR models, are not currently working in
TRT_RTX. - The old cudnn v8 installation should be removed; otherwise, DLL loading may not work.
Full Changelog: v15.13.rtx...v15.13.ort