Chainer CSJ results #23

kan-bayashi · 2017-12-20T12:05:15Z

I finished testing CSJ recipe.

exp/train_nodup_vggblstmp_e4_subsample1_2_2_1_1_unit320_proj320_d1_unit300_location_aconvc10_aconvf100_mtlalpha0.5_adadelta_bs30_mli800_mlo150/decode_eval1_beam20_eacc.best_p0_len0.0-0.8/result.txt:
|        Sum/Avg         |        1272                 43897        |        84.9                  6.2                   8.9                  1.4                 16.5                 70.6        |
exp/train_nodup_vggblstmp_e4_subsample1_2_2_1_1_unit320_proj320_d1_unit300_location_aconvc10_aconvf100_mtlalpha0.5_adadelta_bs30_mli800_mlo150/decode_eval2_beam20_eacc.best_p0_len0.0-0.8/result.txt:
|        Sum/Avg         |        1292                 43623        |        89.2                  5.0                   5.8                  1.0                 11.7                 65.9        |
exp/train_nodup_vggblstmp_e4_subsample1_2_2_1_1_unit320_proj320_d1_unit300_location_aconvc10_aconvf100_mtlalpha0.5_adadelta_bs30_mli800_mlo150/decode_eval3_beam20_eacc.best_p0_len0.0-0.8/result.txt:
|        Sum/Avg         |        1385                 28225        |        89.3                  5.6                   5.1                  1.6                 12.3                 53.8        |

The text was updated successfully, but these errors were encountered:

sw005320 · 2017-12-20T13:43:01Z

Thanks.
Can you change --maxlenratio 0.5 --minlenratio 0.1?

kan-bayashi · 2017-12-20T23:14:15Z

Here is the results.
Slightly worse than the results in papers.

$ grep -e Avg -e SPRK -m 2 exp/train_nodup_vggblstmp_e4_subsample1_2_2_1_1_unit320_proj320_d1_unit300_location_aconvc10_aconvf100_mtlalpha0.5_adadelta_bs30_mli800_mlo150/decode_eval*_beam20_eacc.best_p0_len0.1-0.5/result.txt
exp/train_nodup_vggblstmp_e4_subsample1_2_2_1_1_unit320_proj320_d1_unit300_location_aconvc10_aconvf100_mtlalpha0.5_adadelta_bs30_mli800_mlo150/decode_eval1_beam20_eacc.best_p0_len0.1-0.5/result.txt:
|        Sum/Avg         |        1272                 43897        |        90.1                  7.0                   2.9                  1.6                 11.4                 70.6        |
exp/train_nodup_vggblstmp_e4_subsample1_2_2_1_1_unit320_proj320_d1_unit300_location_aconvc10_aconvf100_mtlalpha0.5_adadelta_bs30_mli800_mlo150/decode_eval2_beam20_eacc.best_p0_len0.1-0.5/result.txt:
|        Sum/Avg         |        1292                 43623        |        93.2                  5.3                   1.5                  1.0                  7.8                 66.0        |
exp/train_nodup_vggblstmp_e4_subsample1_2_2_1_1_unit320_proj320_d1_unit300_location_aconvc10_aconvf100_mtlalpha0.5_adadelta_bs30_mli800_mlo150/decode_eval3_beam20_eacc.best_p0_len0.1-0.5/result.txt:
|        Sum/Avg         |        1385                 28225        |        92.2                  5.9                   1.9                  1.7                  9.5                 53.9        |

I will try to use penalty=0.1

kan-bayashi · 2017-12-21T01:52:50Z

Results with maxlenratio=0.5 & minlenratio=0.1 & penalty=0.1.
Slightly improved.

$ grep -e Avg -e SPRK -m 2 exp/train_nodup_vggblstmp_e4_subsample1_2_2_1_1_unit320_proj320_d1_unit300_location_aconvc10_aconvf100_mtlalpha0.5_adadelta_bs30_mli800_mlo150/decode_eval*_beam20_eacc.best_p0.1_len0.1-0.5/result.txt
exp/train_nodup_vggblstmp_e4_subsample1_2_2_1_1_unit320_proj320_d1_unit300_location_aconvc10_aconvf100_mtlalpha0.5_adadelta_bs30_mli800_mlo150/decode_eval1_beam20_eacc.best_p0.1_len0.1-0.5/result.txt:
|        Sum/Avg         |        1272                  43897        |        90.3                  7.0                   2.6                  1.6                 11.3                  70.5        |
exp/train_nodup_vggblstmp_e4_subsample1_2_2_1_1_unit320_proj320_d1_unit300_location_aconvc10_aconvf100_mtlalpha0.5_adadelta_bs30_mli800_mlo150/decode_eval2_beam20_eacc.best_p0.1_len0.1-0.5/result.txt:
|        Sum/Avg         |        1292                  43623        |        93.3                  5.3                   1.4                  1.1                  7.8                  66.1        |
exp/train_nodup_vggblstmp_e4_subsample1_2_2_1_1_unit320_proj320_d1_unit300_location_aconvc10_aconvf100_mtlalpha0.5_adadelta_bs30_mli800_mlo150/decode_eval3_beam20_eacc.best_p0.1_len0.1-0.5/result.txt:
|        Sum/Avg         |        1385                  28225        |        92.6                  6.0                   1.5                  1.8                  9.2                  53.9        |

sw005320 · 2017-12-21T01:56:32Z

Thanks.
Please write them in RESULTS, and commit it.
It's good to know that our initial results are not so crazy.

Sync to upstream/master

Speaker

kan-bayashi mentioned this issue Dec 21, 2017

Added CSJ result #24

Merged

sw005320 closed this as completed Dec 21, 2017

sw005320 pushed a commit that referenced this issue Jan 15, 2021

Merge pull request #23 from espnet/master

8a06381

Sync to upstream/master

zzxiang mentioned this issue Oct 30, 2021

Conformer FastSpeech2 fine-tuning suspended at 24epoch #3721

Closed

himanshucodz55 mentioned this issue Jul 24, 2022

RuntimeError: [1] is setting up NCCL communicator and retreiving ncclUniqueId from [0] via c10d key-value store by key '0', but store->get('0') got error: Timeout waiting for key: default_pg/0/0 after 1800000 ms #4531

Open

mergify bot pushed a commit that referenced this issue Jul 21, 2023

Merge pull request #23 from Jungjee/speaker

7411f5d

Speaker

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chainer CSJ results #23

Chainer CSJ results #23

kan-bayashi commented Dec 20, 2017

sw005320 commented Dec 20, 2017

kan-bayashi commented Dec 20, 2017 •

edited

Loading

kan-bayashi commented Dec 21, 2017

sw005320 commented Dec 21, 2017

Chainer CSJ results #23

Chainer CSJ results #23

Comments

kan-bayashi commented Dec 20, 2017

sw005320 commented Dec 20, 2017

kan-bayashi commented Dec 20, 2017 • edited Loading

kan-bayashi commented Dec 21, 2017

sw005320 commented Dec 21, 2017

kan-bayashi commented Dec 20, 2017 •

edited

Loading