New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Performance far below result with Raspberry Pi 3 B+ #238
Comments
兄弟,你好,我的树莓派系统找不到OpenMP,而你的却能找到,这是什么原因?能给我讲讲吗 |
你是不是没装opencv? |
你看我报的错 |
OpenMP和opencv没关系吧 |
arm上肯定不能开avx2啊,你可以改一下CMakeLists.txt,把avx2关掉,neon打开。 |
你好 |
你好,我使用Raspberry Pi 3 B+调试,编译的打印如下: 想请教一下,这些错误是什么原因呢? |
加到CMakeLists.txt里面 |
CMakeLists.txt里加一个编译选项: |
多谢了,兄弟,已经解决!!! |
恭喜! |
g++编译代码时加-O3了吗? |
There are 4 threads, 4 processors. |
应该是有加的,我后面为做验证,又在CMakeLists.txt里面新加了 set(CMAKE_C_FLAGS "${CMAKE_C_FLAGS} -O3") 编译后效果还是一样,性能没有变化。 |
单核时间如何?
…On Wed, Apr 1, 2020 at 5:05 PM ohohohh ***@***.***> wrote:
于老师您好!
我在用Raspberry Pi 3
B+做调试,看到项目README的结论,这款设备在320x240的图片下,单核性能是8.1fps,多核性能是23.74fps。但是我这边在320x240的实际测试结果,多核只有4.2fps,远远低于前面给出的结论。方便的话,请帮我找找原因,谢谢!
g++编译代码时加-O3了吗?
应该是有加的,我后面为做验证,又在CMakeLists.txt里面新加了
set(CMAKE_C_FLAGS "${CMAKE_C_FLAGS} -O3")
set(CMAKE_CXX_FLAGS "${CMAKE_CXX_FLAGS} -O3")
编译后效果还是一样,性能没有变化。
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#238 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ABWR4HLDTR7ZJS2DSQGL5ODRKL7W5ANCNFSM4LXVFREA>
.
|
单核的表现: 对比前面4核的: 大约差了一倍。 |
理论上讲,耗时应该是上一个版本的2倍,8.1fps/2=4fps。但我一直没有在ARM上进行测试。
…On Wed, Apr 1, 2020 at 9:45 PM ohohohh ***@***.***> wrote:
单核时间如何?
… <#m_-7571150502542449208_>
On Wed, Apr 1, 2020 at 5:05 PM ohohohh *@*.***> wrote: 于老师您好!
我在用Raspberry Pi 3
B+做调试,看到项目README的结论,这款设备在320x240的图片下,单核性能是8.1fps,多核性能是23.74fps。但是我这边在320x240的实际测试结果,多核只有4.2fps,远远低于前面给出的结论。方便的话,请帮我找找原因,谢谢!
g++编译代码时加-O3了吗? 应该是有加的,我后面为做验证,又在CMakeLists.txt里面新加了 set(CMAKE_C_FLAGS
"${CMAKE_C_FLAGS} -O3") set(CMAKE_CXX_FLAGS "${CMAKE_CXX_FLAGS} -O3")
编译后效果还是一样,性能没有变化。 — You are receiving this because you commented. Reply to
this email directly, view it on GitHub <#238 (comment)
<#238 (comment)>>,
or unsubscribe
https://github.com/notifications/unsubscribe-auth/ABWR4HLDTR7ZJS2DSQGL5ODRKL7W5ANCNFSM4LXVFREA
.
单核的表现:
***@***.***:~/dev/libfacedetection/build $ ./benchmark 320_0.jpg
There is 1 thread.
cnn facedetection average time = 468.18ms | 2.14 FPS
对比前面4核的:
***@***.***:~/dev/libfacedetection/build $ ./benchmark 320_0.jpg
There are 4 threads, 4 processors.
cnn facedetection average time = 238.36ms | 4.20 FPS
大约差了一倍。
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#238 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ABWR4HOTA2DQNSTJKRK53RDRKNAQ3ANCNFSM4LXVFREA>
.
|
好的,了解了。 |
@iamliuyin 我的测试结果跟你非常接近,Raspberry Pi 3 B+, 320X240图片,NEON = ON 多核 FPS 只有5 你后来有解决办法了吗?求指点 |
于老师您好!
我在用Raspberry Pi 3 B+做调试,看到项目README的结论,这款设备在320x240的图片下,单核性能是8.1fps,多核性能是23.74fps。但是我这边在320x240的实际测试结果,多核只有4.2fps,远远低于前面给出的结论。方便的话,请帮我找找原因,谢谢!
//下面是一些环境参数
AVX512 = OFF
AVX2 = OFF
NEON = ON
OpenMP = TRUE
DEMO = ON
add_compile_options(-mfpu=neon) (不加编译不通过)
//cmake的输出
-- The C compiler identification is GNU 8.3.0
-- The CXX compiler identification is GNU 8.3.0
-- Check for working C compiler: /usr/bin/cc
-- Check for working C compiler: /usr/bin/cc -- works
-- Detecting C compiler ABI info
-- Detecting C compiler ABI info - done
-- Detecting C compile features
-- Detecting C compile features - done
-- Check for working CXX compiler: /usr/bin/c++
-- Check for working CXX compiler: /usr/bin/c++ -- works
-- Detecting CXX compiler ABI info
-- Detecting CXX compiler ABI info - done
-- Detecting CXX compile features
-- Detecting CXX compile features - done
fatal: No names found, cannot describe anything.
BUILD_VERSION:v0.0.1
Using ENON
-- Found OpenMP_C: -fopenmp (found version "4.5")
-- Found OpenMP_CXX: -fopenmp (found version "4.5")
-- Found OpenMP: TRUE (found version "4.5")
-- Performing Test COMPILER_HAS_HIDDEN_VISIBILITY
-- Performing Test COMPILER_HAS_HIDDEN_VISIBILITY - Success
-- Performing Test COMPILER_HAS_HIDDEN_INLINE_VISIBILITY
-- Performing Test COMPILER_HAS_HIDDEN_INLINE_VISIBILITY - Success
-- Performing Test COMPILER_HAS_DEPRECATED_ATTR
-- Performing Test COMPILER_HAS_DEPRECATED_ATTR - Success
-- Found OpenCV: /usr (found version "3.2.0")
AVX512 = OFF
AVX2 = OFF
NEON = ON
OpenMP = TRUE
DEMO = ON
-- Configuring done
-- Generating done
-- Build files have been written to: /home/pi/dev/libfacedetection/build
//benchmark的运行结果
pi@raspberrypi:~/dev/libfacedetection/build $ ./benchmark 320_0.jpg
There are 4 threads, 4 processors.
cnn facedetection average time = 238.36ms | 4.20 FPS
The text was updated successfully, but these errors were encountered: