You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Can I get inference run time speed gain using quantization in WinMLTools.
Can I get inference run time speed gain after I pruned my model.(weight prune, in other words, there are more 0 in weight.)
envs:
OS: windows server 2019
processor: Intel(R) Xeon(R) CPU E5-2673 v4 @2.30GHz 2.29GHz
python env: a anaconda virtual env.
Thanks.