IWSLT'14 Results using ESPnet2-MT #4132

pyf98 · 2022-03-04T18:48:08Z

Hi, I tried some configs for IWSLT'14 De-En using ESPnet2-MT. I followed the standard Transformer config in fairseq, but the batchifying method is different. I think my effective batch size is a few times larger than fairseq, and my learning rate and warmup steps are also larger. The current script for BLEU calculation is different from the standard script for this dataset.

The best result is reported in README.md. Besides, I added two options in mt/espnet_model.py to allow weight tying of the input embedding and the output linear layer in encoder and decoder.

We probably need to test the implementation further on other datasets.

pyf98 · 2022-03-04T18:50:40Z

Hi @siddalmia @ftshijt @brianyan918 Could you check this PR? If the changes are reasonable, we can continue the experiments.

brianyan918 · 2022-03-04T18:53:46Z

Looks good! I believe your initial result on this dataset is reasonable as well.

codecov · 2022-03-04T19:07:28Z

Codecov Report

Merging #4132 (11e3e7c) into master (a04a98c) will not change coverage.
The diff coverage is n/a.

@@           Coverage Diff           @@
##           master    #4132   +/-   ##
=======================================
  Coverage   80.43%   80.43%           
=======================================
  Files         442      442           
  Lines       38557    38557           
=======================================
  Hits        31015    31015           
  Misses       7542     7542

Flag	Coverage Δ
test_integration_espnet1	`67.13% <ø> (ø)`
test_integration_espnet2	`51.14% <ø> (ø)`
test_python	`66.51% <ø> (ø)`
test_utils	`24.45% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update a04a98c...11e3e7c. Read the comment docs.

siddalmia · 2022-03-04T19:20:33Z

Perfect thank you @pyf98 ! Some minor comments -

Readme - Can you mention Blue-4 and also mention if the scoring was done on detokenized cased or uncased or tokenized outputs
The fix that you made in line 515 of mt.sh also exists in st.sh. Can you also fix it there as well?

sw005320 · 2022-03-04T19:29:23Z

I leave detailed reviews by @siddalmia @ftshijt @brianyan918.
My comment is that we should add this result to https://github.com/espnet/espnet/blob/master/README.md#mt-results

pyf98 · 2022-03-04T20:35:55Z

Thanks for your comments. I'm fixing them now.

BTW, I have a question about the generated hypotheses. In some examples (131 out of 6750 in the test set), there is a trailing <sos/eos> as shown below:

They &apos;re even hurting .<sos/eos>	(They-utt000382)
I &apos;m a writer .<sos/eos>	(I-utt000553)
Thank you .<sos/eos>	(Thank-utt000722)

If we directly use this result to calculate BLEU, will it adversely affect the performance?

brianyan918 · 2022-03-04T21:02:53Z

I think it would. Not sure that sacrebleu knows to ignore that.

…

On Fri, Mar 4, 2022 at 3:36 PM Yifan Peng ***@***.***> wrote: Thanks for your comments. I'm fixing them now. BTW, I have a question about the generated hypotheses. In some examples, there is a trailing <sos/eos> as shown below: They 're even hurting .<sos/eos> (They-utt000382) I 'm a writer .<sos/eos> (I-utt000553) Thank you .<sos/eos> (Thank-utt000722) If we directly use this result to calculate BLEU, will it adversely affect the performance? — Reply to this email directly, view it on GitHub <#4132 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ACQBBOOJHAO7Q3IPYF5DCZTU6JX3ZANCNFSM5P6JNJOA> . Triage notifications on the go with GitHub Mobile for iOS <https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675> or Android <https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>. You are receiving this because you were mentioned.Message ID: ***@***.***>

mergify · 2022-03-04T22:55:29Z

This pull request is now in conflict :(

pyf98 · 2022-03-04T23:16:09Z

Hi @siddalmia, about the second point, I checked the st.sh for the token_joint part (https://github.com/espnet/espnet/blob/master/egs2/TEMPLATE/st1/st.sh#L745).

It seems that the source text is not combined with the target text if token_joint is true, so the same problem doesn't exist. But I'm not sure if this is the expected behavior for ST or this is actually a bug. Maybe @ftshijt could compare mt.sh and st.sh for the generation of token_list.

pyf98 · 2022-03-05T00:42:58Z

I have another question about the encoder input embedding layer. Where should we put the embedding layer, which is a torch.nn.Embedding instance followed by pos_enc_class? According to the current implementation, there seem to be two options:

Put it in the frontend (Sid added a new type of frontend as embedding, and also modified transformer_encoder). But currently it only supports transformer encoder and absolute positional encoding.
Put it in the embed of encoder. This requires some modifications on mt/espnet_model.py.

Current results and the weight tying implementation are based on the first option.

ftshijt · 2022-03-05T04:01:22Z

Hi @siddalmia, about the second point, I checked the st.sh for the token_joint part (https://github.com/espnet/espnet/blob/master/egs2/TEMPLATE/st1/st.sh#L745).

It seems that the source text is not combined with the target text if token_joint is true, so the same problem doesn't exist. But I'm not sure if this is the expected behavior for ST or this is actually a bug. Maybe @ftshijt could compare mt.sh and st.sh for the generation of token_list.

I think it is actually a bug to be fixed, could you help me to combine the tgt_text ahead of time?

ftshijt · 2022-03-06T22:17:00Z

Since the PR already looks great (and also could be beneficial to upcoming deadlines), I will merge the PR now. @pyf98 could you also reflect the bug in ST implementation in another PR?

pyf98 · 2022-03-06T22:18:00Z

Cool! Thanks @ftshijt

pyf98 · 2022-03-07T05:25:59Z

Hi @ftshijt I made a new PR for the ST issue: #4143

pyf98 added 9 commits February 27, 2022 23:30

fix an issue with token_joint in mt.sh

01975b5

init transformer config that works (at least) until training

3e106c3

update config for token_joint, bpe10k

d5e3d7a

support shared embeddings: encoder, decoder, input, output

5eec47e

update decode

98c1951

update decode config

288e3d7

update training config for transformer

a78d9a8

update results

6d64c90

apply black and reformat the string

4f141eb

mergify bot added ESPnet2 README labels Mar 4, 2022

update readme.md

6acc45a

update the overall README.md

4877cd4

mergify bot added the conflicts label Mar 4, 2022

resolve conflict in README.md

11e3e7c

mergify bot removed the conflicts label Mar 4, 2022

sw005320 added the MT Machine translation label Mar 5, 2022

sw005320 added this to the v.0.10.7 milestone Mar 5, 2022

sw005320 added the Recipe label Mar 5, 2022

ftshijt merged commit bfb23b8 into espnet:master Mar 6, 2022

pyf98 deleted the mt branch March 6, 2022 22:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

IWSLT'14 Results using ESPnet2-MT #4132

IWSLT'14 Results using ESPnet2-MT #4132

pyf98 commented Mar 4, 2022

pyf98 commented Mar 4, 2022

brianyan918 commented Mar 4, 2022

codecov bot commented Mar 4, 2022 •

edited

siddalmia commented Mar 4, 2022

sw005320 commented Mar 4, 2022

pyf98 commented Mar 4, 2022 •

edited

brianyan918 commented Mar 4, 2022 via email

mergify bot commented Mar 4, 2022

pyf98 commented Mar 4, 2022

pyf98 commented Mar 5, 2022 •

edited

ftshijt commented Mar 5, 2022

ftshijt commented Mar 6, 2022 •

edited

pyf98 commented Mar 6, 2022

pyf98 commented Mar 7, 2022

IWSLT'14 Results using ESPnet2-MT #4132

IWSLT'14 Results using ESPnet2-MT #4132

Conversation

pyf98 commented Mar 4, 2022

pyf98 commented Mar 4, 2022

brianyan918 commented Mar 4, 2022

codecov bot commented Mar 4, 2022 • edited

Codecov Report

siddalmia commented Mar 4, 2022

sw005320 commented Mar 4, 2022

pyf98 commented Mar 4, 2022 • edited

brianyan918 commented Mar 4, 2022 via email

mergify bot commented Mar 4, 2022

pyf98 commented Mar 4, 2022

pyf98 commented Mar 5, 2022 • edited

ftshijt commented Mar 5, 2022

ftshijt commented Mar 6, 2022 • edited

pyf98 commented Mar 6, 2022

pyf98 commented Mar 7, 2022

codecov bot commented Mar 4, 2022 •

edited

pyf98 commented Mar 4, 2022 •

edited

pyf98 commented Mar 5, 2022 •

edited

ftshijt commented Mar 6, 2022 •

edited