Extra softmax layer #6

Atcold · 2017-11-26T21:52:16Z

Why is there an extra softmax layer https://github.com/gram-ai/capsule-networks/blob/master/capsule_network.py#L106?
Each capsule's norm is already modelling a probability.

InnovArul · 2018-06-10T21:10:23Z

Though each capsules norm is a probability [0-1], the capsules will be fighting within themselves to send the info to higher level capsules (based on their correlation with the output of the higher level capsules). Hence, there is a softmax layer.

Atcold · 2018-06-11T15:12:06Z

That's not how Capsules work...

InnovArul · 2018-06-11T21:48:11Z

Maybe if you could write your understanding about capsules or point out the lines in the paper, it will be helpful to discuss and learn I guess. Anyway, I will let the code owner to clarify your doubts.

In my understanding, more the correlation between primary capsule's output to digit capsule's output, the higher the bond between them. Hence, it's a kind of attention mechanism between primary capsules and digit capsules, which necessitates the need for a softmax (based on correlation).

Atcold · 2018-06-11T23:07:38Z

From the paper, section 4, last paragraph, you have that

Our implementation [...] minimize the sum of the margin losses in Eq. 4.

(Install this extension to view LaTeX on GitHub.)

$L_k = T_k \max(0, m^+ - ||v_k||)^2 + λ (1 - T_k) \max(0, ||v_k|| - m^-)^2$

So, as you can see, you're supposed to use $||v_k||$, which is classes = (x ** 2).sum(dim=-1) ** 0.5.

InnovArul · 2018-06-12T03:58:35Z

Oh I see. My bad. I didn't see which softmax you are mentioning:)

I think you are right. There is no need for softmax (since the vector's magnitude emulates probability). Thanks for elaborating it.

By the way, I have noticed some more deviations in the implementation with respect to paper. Please check if you find time. I'm not sure if my interpretation is correct.

#23

Atcold · 2018-06-13T23:53:16Z

That's why I put there the link to the wrong line.
The point is not that "there is no need" but "it's plain wrong".
Okay, let me see.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extra softmax layer #6

Extra softmax layer #6

Atcold commented Nov 26, 2017 •

edited

Loading

InnovArul commented Jun 10, 2018 •

edited

Loading

Atcold commented Jun 11, 2018

InnovArul commented Jun 11, 2018 •

edited

Loading

Atcold commented Jun 11, 2018

InnovArul commented Jun 12, 2018 •

edited

Loading

Atcold commented Jun 13, 2018 •

edited

Loading

Extra softmax layer #6

Extra softmax layer #6

Comments

Atcold commented Nov 26, 2017 • edited Loading

InnovArul commented Jun 10, 2018 • edited Loading

Atcold commented Jun 11, 2018

InnovArul commented Jun 11, 2018 • edited Loading

Atcold commented Jun 11, 2018

InnovArul commented Jun 12, 2018 • edited Loading

Atcold commented Jun 13, 2018 • edited Loading

Atcold commented Nov 26, 2017 •

edited

Loading

InnovArul commented Jun 10, 2018 •

edited

Loading

InnovArul commented Jun 11, 2018 •

edited

Loading

InnovArul commented Jun 12, 2018 •

edited

Loading

Atcold commented Jun 13, 2018 •

edited

Loading