1x1 Convolution of 2 stride code issue #95

sj-leo · 2017-08-07T03:15:27Z

Dear MKLDNN developers,

When I ran Resnet 50 with the Intel Caffe and MKLDNN, there is no change of memory address in 1x1 convolution of stride 2.

In the code, your team defined 1x1 convolution of 2 stride to ‘reduce’ and defined address of ‘reduce’ to ‘jcp.ws’.
When I print the address of ‘jcp.ws’, there is only one address of ‘jcp.ws’(I think it should change).
So, I want to know that it is true the address changes or not.

For a detailed explanation, I write the setup and the file of 1x1 convolution code.

Setup

ML framework: Intel Caffe (version: v1.0.3)
MKLDNN version: v0.9
ML Algorithm: Resnet 50

code
src/cpu/jit_avx512_common_1x1_convolution.cpp (line: 166-175)

Thank you.

emfomenk · 2017-08-07T03:24:46Z

Hi @sj-leo,

ws stands for workspace. It is internal memory for the primitive.
1x1 convolutions w/ non-unit stride now works as follows:

compress (reduce) source image into workspace (take only pixels with given stride)
run regular 1x1 conv kernel for unit-stride with workspace as input (instead of original src)

So even if address of the actual src data is changed (i.e. another src is taken) the workspace address remains the same. In other words I would not expect ws to be changed.

sj-leo · 2017-08-07T05:53:34Z

In workflow, you said that 1x1 convolutions w/ non-unit stride compress source image into workspace.
In order to use workspace, I think the code change input height(ih) and input weight(iw) in the base ws address. But, there is no change of ih and iw (I also see jit_avx512_common_1x1_conv_kernel.cpp).

So, in the situation where ws does not change address, I think that convolution for all space of ws is not performed and I don’t know it is right implementation.

I wonder why the iterative convolution is performed on the same area without changing ih and iw of ws.

rsdubtso · 2017-08-07T09:44:59Z

The inner_ker() is called from a loop over the original dimensions. Look at the code starting from src/cpu/jit_avx512_common_1x1_convolution.cpp:180. The data from the original input tensor is then copied to the workspace to remove strides. And you are right that each thread reuses its own part of the workspace across different inner_ker() calls, but the data in the workspace comes from different parts of the input.

sj-leo · 2017-08-07T11:44:21Z

@rsdubtso As you say, I check the code the copied data to remove strides and workspace comes from different parts of the input in thread level.

But in single thread, there seems to be no part where iw and ih are changed.
If you tell me more about that part, I will refer to it.

emfomenk · 2017-08-07T13:46:33Z

Convolution kernel goes over the whole image ih * iw, so the pointer shouldn't be changed wrt to the spatial dimension.

sj-leo · 2017-08-08T03:06:45Z

As far as I know, the convolution kernel goes over only a fraction of the image ih * iw by the bcast_step.
If iwork changes by bcast_step, I think the rp.ws pointer needs to change.

I would like to ask for further information.

emfomenk · 2017-08-08T06:28:43Z

Yeah, my bad -- you are right, kernel does not always process ih * iw.

According to the line we reduce src from the right place.
Though the strange thing is line 169 -- the condition.
It doesn't seem to be always correct (at least not for all the loop orders).
Let me double check that.

Do you have a particular failing example or this question is mostly due to curiosity? :)

sj-leo · 2017-08-08T07:51:30Z

In my case, I am looking at how the convolution operation of Resnet 50 is performed and whether there is room for performance optimization on my curiosity..
For this reason, I am in doubt about normal operation.

ankalinin · 2017-08-08T18:56:59Z

Loop order is set in jit_avx512_common_1x1_conv_kernel::init_conf method from jit_avx512_common_1x1_conv_kernel.cpp. There are two possible loop orders for 1x1 convolutions with non-unit strides: loop_blr (bcast-load-reduce) or loop_rbl (reduce-bcast-load). The condition in line 169 is correct for these loop orders and it controls that reducing performs only once for certain blocks of input channels and certain fraction of the image ih * iw.

ankalinin · 2017-08-08T19:11:01Z

But in single thread, there seems to be no part where iw and ih are changed.

iw and ih are changed on each iteration of loop over bcast dimension. E.g. line 190

haidj · 2017-11-08T01:13:25Z

Hi, @emfomenk

I wonder if this issue within 1x1 convolution is resolved.

Thank you for your help

Yeah, my bad -- you are right, kernel does not always process ih * iw.

According to the line we reduce src from the right place.
Though the strange thing is line 169 -- the condition.
It doesn't seem to be always correct (at least not for all the loop orders).
Let me double check that.

Do you have a particular failing example or this question is mostly due to curiosity? :)

emfomenk · 2017-11-08T20:34:18Z

Hi @haidj,

My suspicion was incorrect.
According to this comment there is not issue with the current code.
Thanks to @ankalinin for the confirmation.

emfomenk added the question label Aug 7, 2017

vpirogov closed this as completed Aug 16, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

1x1 Convolution of 2 stride code issue #95

1x1 Convolution of 2 stride code issue #95

sj-leo commented Aug 7, 2017

emfomenk commented Aug 7, 2017

sj-leo commented Aug 7, 2017

rsdubtso commented Aug 7, 2017

sj-leo commented Aug 7, 2017

emfomenk commented Aug 7, 2017

sj-leo commented Aug 8, 2017

emfomenk commented Aug 8, 2017

sj-leo commented Aug 8, 2017

ankalinin commented Aug 8, 2017

ankalinin commented Aug 8, 2017

haidj commented Nov 8, 2017

emfomenk commented Nov 8, 2017

1x1 Convolution of 2 stride code issue #95

1x1 Convolution of 2 stride code issue #95

Comments

sj-leo commented Aug 7, 2017

emfomenk commented Aug 7, 2017

sj-leo commented Aug 7, 2017

rsdubtso commented Aug 7, 2017

sj-leo commented Aug 7, 2017

emfomenk commented Aug 7, 2017

sj-leo commented Aug 8, 2017

emfomenk commented Aug 8, 2017

sj-leo commented Aug 8, 2017

ankalinin commented Aug 8, 2017

ankalinin commented Aug 8, 2017

haidj commented Nov 8, 2017

emfomenk commented Nov 8, 2017