Doubts on the paper "universal transformers". #1215

futaoo · 2018-11-12T06:22:23Z

Description

The detailed figure 4 in appendix seems to do not follow the iterative equations (4)(5) in the paper. If I follow the figure, it should be H^t = LayerNorm(A^t+Transition(A^t)), and A^t = LayerNorm(H^(t-1)+P^t+MultiHeadSelfAttention(H^(t-1)+P^t)). It is very confusing. Could anyone help me to figure this doubt out? Thank you!

senarvi · 2018-11-12T17:31:09Z

I'm pretty sure there's a typo in equation 4.

futaoo · 2018-11-13T03:53:57Z

@senarvi thx, I consider the same as you.

lkluo · 2018-11-13T04:24:11Z

I believe Eq 4 is typo. Eq 5 may be typo as well, but could also be misinterpretation of Figure 4. I think you can have a check on the code to figure it out.

MostafaDehghani · 2018-11-15T09:11:14Z

Yes! there are small typos as well as a problem in fig4 in the current arXiv version of the paper. We'll update it soon. In the meantime, you can check the slides here and, as always, a better way to understand what's going on exactly is digging into the code :)

futaoo · 2018-11-16T00:59:20Z

@MostafaDehghani Very lucky to have the slides, thanks!

colmantse · 2018-11-21T08:07:41Z

Hi @MostafaDehghani , thank you for the slides! they are really helpful. On a side note, may i inquire if UT and transformer both use the EN-DE default generator provided in the tensor2tensor library? i noticed the version is the same, but i want to be certain.

MostafaDehghani · 2018-11-21T13:53:24Z

Yes, we used problem=translate_ende_wmt32k for all the MT experiments, both with Transformer and Univeral Transformer.

colmantse · 2018-11-22T02:28:15Z

thank you

afrozenator · 2018-12-04T08:51:31Z

Thanks @MostafaDehghani and others.

afrozenator closed this as completed Dec 4, 2018

gregory112 mentioned this issue Jun 18, 2019

There was a typo in the diagram showing the arrangement of the layers in Universal Transformer paper kpot/keras-transformer#20

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Doubts on the paper "universal transformers". #1215

Doubts on the paper "universal transformers". #1215

futaoo commented Nov 12, 2018

senarvi commented Nov 12, 2018

futaoo commented Nov 13, 2018

lkluo commented Nov 13, 2018

MostafaDehghani commented Nov 15, 2018

futaoo commented Nov 16, 2018

colmantse commented Nov 21, 2018

MostafaDehghani commented Nov 21, 2018

colmantse commented Nov 22, 2018

afrozenator commented Dec 4, 2018

Doubts on the paper "universal transformers". #1215

Doubts on the paper "universal transformers". #1215

Comments

futaoo commented Nov 12, 2018

Description

senarvi commented Nov 12, 2018

futaoo commented Nov 13, 2018

lkluo commented Nov 13, 2018

MostafaDehghani commented Nov 15, 2018

futaoo commented Nov 16, 2018

colmantse commented Nov 21, 2018

MostafaDehghani commented Nov 21, 2018

colmantse commented Nov 22, 2018

afrozenator commented Dec 4, 2018