-
Notifications
You must be signed in to change notification settings - Fork 1.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
同样的模型结构一个快一个慢 #786
Comments
而且静态编译的MNN比动态的MNN慢,满了接近8倍 |
Sort by node name ! |
上面是速度快的,下面是速度慢的,二者的结构是一模一样得,profile显示mflops也一模一样, 但是慢的这个卷积慢了很多。 Sort by node name ! |
什么backend?编译选项? |
有谁能解释下么,不太明白为什么,还是说卷积层的参数数值分布也会对速度造成影响? |
数值的大小是有可能对计算速度有影响的,加上 |
加了测试依旧老样子...可以提供模型给你们分析分析么 |
opencv/opencv#17259 I have asked OpenCV for help, they have reproduced it, and they found something, maybe it helps |
在Linux下测试一下,优化项改成 |
跑起来好像并没有差别:
|
linux上吗? |
我目前是在windows上测出来的 |
linux上我测试动态库没差异,静态库比动态库慢 |
opencv/opencv#17259 opencv已经找到了原因,是因为有denormal float,我按照Opencv的改动在转模型时做了类似的改动 |
看msvc等效的应该是 |
一模一样得模型结构,只是不同时间训练的,其中一个耗时600ms,而另一个要2s左右。什么原因呢?
The text was updated successfully, but these errors were encountered: