foreground normalization #281

powder21 · 2019-06-26T12:40:38Z

How can the coefficient |x| be canceled in the right term. Is there something that i misunderstand?

JiahuiYu · 2019-06-26T23:40:51Z

@powder21 Hi, sorry for the misleading and for being so patience. It has been a while when I wrote that code. And just now I toke a look at it seriously and found that this term will not be cancelled. On the other hand, as we add a scale before softmax anyway, the norm |x_i| may not matter (can be viewed as a high softmax temperature). Note the softmax is over channels (which means over w_j) thus adding |w_j| is more important.

In conclusion I agree that there should be a norm |x_i| and I appreciate your findings. But the results may not change a lot because anyway the softmax is with high temperature. If you are interested, you can run a simple example of contextual attention as shown in https://github.com/JiahuiYu/generative_inpainting#faq (How to implement contextual attention?) to verify the difference when with or without norm |x_i|.

yi = tf.nn.conv2d(xi, wi_normed, strides=[1,1,1,1], padding="SAME")
yi = tf.nn.softmax(yi*scale, 3)
yi = tf.nn.conv2d_transpose(yi, wi_center, tf.concat([[1], raw_fs[1:]], axis=0), strides=[1,rate,rate,1]) / 4.

powder21 · 2019-06-27T05:42:14Z

@JiahuiYu Thanks! I understand That |x_i| doesn't matter rather than is canceled: the high softmax temperature is to find the max similarity (just like one-hot), and the x_i is common if calculate softmax over channels.

Besides, normalization can not be executed on x_i if we don't extract patches from it, but extracting patches may destroy the efficiency by convolution.

JiahuiYu · 2019-06-27T06:21:10Z

Thanks for pointing that out! These issues will be very helpful for those who have same question/concern in the future.

JiahuiYu closed this as completed Jun 26, 2019

JiahuiYu mentioned this issue Jun 4, 2020

Is the real cos similarity being used in Contextual Attention? #441

Closed

daa233 mentioned this issue Jun 6, 2020

Is the real cos similarity being used in Contextual Attention? daa233/generative-inpainting-pytorch#37

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

foreground normalization #281

foreground normalization #281

powder21 commented Jun 26, 2019 •

edited

JiahuiYu commented Jun 26, 2019

powder21 commented Jun 27, 2019 •

edited

JiahuiYu commented Jun 27, 2019

foreground normalization #281

foreground normalization #281

Comments

powder21 commented Jun 26, 2019 • edited

JiahuiYu commented Jun 26, 2019

powder21 commented Jun 27, 2019 • edited

JiahuiYu commented Jun 27, 2019

powder21 commented Jun 26, 2019 •

edited

powder21 commented Jun 27, 2019 •

edited