-
Notifications
You must be signed in to change notification settings - Fork 491
Result not correct when applying it to SSD (Single Shot MultiBox Detector) #10
Comments
hi @heagoo would you be willing to share your work on the merge with only MKL? would be interesting to see the results and performance with intelcaffe. do you have rough performance numbers per frame? |
@nickltj86 The performance is not so good (262ms per frame with 32 cores), I think it could be much better after enable MKL2017_AS_DEFAULT_ENGINE, but unfortunately, the result is not correct yet. |
Thanks. Good to know the performance. I think even after enabling MKL2017, the performance may not increase too much. From the benchmark, the new layers which are not optimized with MKL or MKL2017 such as the mbox layer will cause the most delays. The performance of the common cnn layers are very optimized and seems like as fast as those running on CUDA. the other non-optimized layers are a different story. Do you have a fork with the merge? I think the Intel folks will need that to evaluate it as well. |
Shared to: |
Could you please try with MKL2017 engine enabled, but for layers that cause problems explicitly select engine CAFFE? We are working on fixes to simplify this process and avoid manual engine selection. |
@jdukat Yes, I've already do this, but the result is different from just using MKL, much different, and totally not correct. I tried:
|
Input size matters for performance with MKL2017 for sure. It should not matter for correctness and it seems to be a bug. We should take a closer look at this, but at the moment I am not able to put this on top of my priorities. I'll see what I can do about documentation for MKL DNN functions. |
Thanks! |
Great, the same MKL version will be released for Caffe within next few days. |
Sure, will do it. @jdukat |
Is it confirmed that this bug exists in current master? If so, can we open this issue again to track the bug until it is fixed? |
This bug does not exist any more in the latest update with MKL 2017 update 1. |
Hey @heagoo |
I am using E5-2699 v4, the most powerful one, :)
…On Tue, Nov 22, 2016 at 5:31 AM, Abdallah Moussawi ***@***.*** > wrote:
Hey @heagoo <https://github.com/heagoo>
Could you please tell me what intel processor are you using if you don't
mind? Thanks :)
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#10 (comment)>, or mute
the thread
<https://github.com/notifications/unsubscribe-auth/AFsLw4b9l5JImymyMnvS54hWvYTrxmtUks5rAg2tgaJpZM4Kh3BG>
.
--
--
"The first step is as good as half over."
|
I merged to code of SSD (https://github.com/weiliu89/caffe/tree/ssd) and intel caffe, but when run ssd_detect with SSD300, the result is not correct if MKL2017_AS_DEFAULT_ENGINE is enabled.
After code merge, following 2 things have been confirmed:
I guess SSD300 has a different input shape (3x300x300, not 3x224x224), thus make the convolution result is not correct.
Could you please help to look into it? Thanks!
The text was updated successfully, but these errors were encountered: