You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Describe the bug
Hello! Found the performance issue in new ORT 1.8.1
New ORT 1.8.1 is slower than 1.8.0 ~5-6 times.
Urgency
Not urgent, but
we use C# version of ORT for production environment. Unfortunately we can't use C# ORT 1.8.0 version because of this bug #8052. Old 1.7.0 has performance issues too. Current 1.8.1 has significant performance degradation. Proof on Python below.
System information
OS Platform and Distribution: Linux Ubuntu 18.04: Linux x86_64
ONNX Runtime installed from (source or binary): binary
ONNX Runtime version: onnxruntime-1.8.0 onnxruntime-1.8.1
Python version: Python 3.8.5
Pytorch version: 1.8.1
CUDA/cuDNN version: CUDA Version: 11.1
GPU model and memory: Tesla V100 (32G)
This issue has been automatically marked as stale due to inactivity and will be closed in 7 days if no further activity occurs. If further support is needed, please provide an update and/or more details.
stalebot
added
the
stale
issues that have not been addressed in a while; categorized by a bot
label
Apr 19, 2022
Describe the bug
Hello! Found the performance issue in new ORT 1.8.1
New ORT 1.8.1 is slower than 1.8.0 ~5-6 times.
Urgency
Not urgent, but
we use C# version of ORT for production environment. Unfortunately we can't use C# ORT 1.8.0 version because of this bug #8052. Old 1.7.0 has performance issues too. Current 1.8.1 has significant performance degradation. Proof on Python below.
System information
OS Platform and Distribution: Linux Ubuntu 18.04: Linux x86_64
ONNX Runtime installed from (source or binary): binary
ONNX Runtime version: onnxruntime-1.8.0 onnxruntime-1.8.1
Python version: Python 3.8.5
Pytorch version: 1.8.1
CUDA/cuDNN version: CUDA Version: 11.1
GPU model and memory: Tesla V100 (32G)
To Reproduce
save this as benchmark_repro.py
install ORT 1.8.1
reproduce 1.8.1
install ORT 1.8.1
reproduce 1.8.0
And compare metrics
ORT 1.8.1:
average load+fwd 61.56 msec
vs
ORT 1.8.0:
average load+fwd 11.77 msec
Expected behavior
ORT 1.8.1 works as fast as 1.8.0
Additional context
Maybe this old issue would help: #7212
Thanks!
The text was updated successfully, but these errors were encountered: