Skip to content

v13.2: latest ort library, DirectML backend

Compare
Choose a tag to compare
@github-actions github-actions released this 28 May 15:01
· 157 commits to master since this release
  • Added support for DirectML backend through ONNX Runtime. It is available for all dx12 devices and can be accessed through backend=Backend.ORT_DML(). waifu2x swin_unet models may not be supported on this backend, and RIFE models may be poorly supported.
  • Asset vsmlrt-windows-x64-vk.*.7z is renamed to vsmlrt-windows-x64-generic-gpu.*.7z and includes backends OV_CPU, OV_GPU, ORT_CPU, ORT_DML, NCNN_VK. cuda asset continue to include all backends in this release.
  • Update onnxruntime to microsoft/onnxruntime@73584f9.

Note

  • Backend OV_GPU may produces reduced precision output. This is under investigation.

benchmark 1

NVIDIA GeForce RTX 3090, 10496 shaders @ 1695 MHz, driver 532.03, Windows 10 21H2 LTSC (19044.1415), Python 3.11.3, vapoursynth-classic R57.A8

Measurements: (1080p, fp16) FPS / Device Memory (MB)

model ORT_CUDA ORT_DML
dpir 4.25 / 2573.3 7.01 / 2371.0
dpir (2 streams) 4.58 / 5506.2 8.85 / 4643.1
waifu2x upconv7 9.10 / 5248.1 9.65 / 4503.1
waifu2x upconv7 (2 streams) 11.15 / 2966.9 18.52 / 8911.2
waifu2x cunet / cugan 4.06 / 7875.7 6.36 / 8973.7
waifu2x cunet / cugan (2 streams) N/A 9.51 / 17849.1
waifu2x swin_unet_art N/A N/A
realesrgan 7.52 / 1901.7 8.54 / 1352.4
realesrgan (2 streams) 11.15 / 2966.9 15.58 / 2608.7
rife 34.30 / 1109.1 2.12 / 1417.8
rife (2 streams) 61.45 / 2051.4 4.27 / 2740.9

benchmark 2

AMD Radeon Pro V620 MxGPU, 4608 shaders @ 2390 MHz, Adrenalin 21.20.02.13, Windows Server 2019, Python 3.11.3, vapoursynth-classic R57.A8

Measurements: (1080p, fp16) FPS / Device Memory (MB)

model NCNN_VK ORT_DML
dpir 1.70 / 3248.4 4.75 / 2308.1
dpir (2 streams) 1.74 / 6099.5 4.86 / 4584.6
waifu2x upconv7 5.18 / 6872.3 14.51 / 4448.5
waifu2x upconv7 (2 streams) 6.14 / 13701 15.98 / 8861.2
waifu2x cunet / cugan (2x2 tiles) 1.07 / 3159.8 5.57 / 2196.7
waifu2x cunet / cugan (2x2 tiles, 2 streams) 1.07 / 3159.8 6.08 / 4357.8
waifu2x swin_unet_art N/A N/A
realesrgan 3.86 / 2699.7 9.59 / 1290.4
realesrgan (2 streams) 4.43 / 5355.8 10.58 / 2545.3
rife N/A 2.68 / 1353.5
rife (2 streams) N/A 4.44 / 2673.3