This repository has been archived by the owner on Oct 31, 2023. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 498
Unable to train Supervised MT on En-De translation #79
Comments
Could you show what files are in processed folder? |
Hi @stanisz93 In
|
Same error here. Any update on how this could be fixed? |
Hi, Can you try to have a look at which file is missing? Typically, if this line is failing: Try to replace it with:
And see the output? |
Solved the issue. Thanks! |
Did you reproduce the En-De results reported in the paper ? |
Sign up for free
to subscribe to this conversation on GitHub.
Already have an account?
Sign in.
Hi ,
Firstly thanks for the release I wanted to try the En-De MT task using language model. So the following is the command I used for training the language model
python train.py --exp_name test_ende_mlm --dump_path ./dumped/ --data_path ./data/processed/en-de/ --lgs 'en-de' --clm_steps '' --mlm_steps 'en,de' --emb_dim 512 --n_layers 6 --n_heads 8 --dropout 0.1 --attention_dropout 0.1 --gelu_activation true --bptt 250 --batch_size 32 --optimizer adam,lr=0.0001 --epoch_size 200000 --validation_metrics _valid_mlm_ppl --stopping_criterion _valid_mlm_ppl,10
Next as suggested by you here I tokenized applied bpe and binarized the parallel data files.
Then to train the supervised MT from pretrained model I used the following command
python train.py --exp_name super_mt_en_de --dump_path ./dumped/ --reload_model 'dumped/test_ende_mlm/3rn075dpy4/best-valid_mlm_ppl.pth,dumped/test_ende_mlm/3rn075dpy4/best-valid_mlm_ppl.pth' --data_path ./data/processed/en-de/ --lgs 'en-de' --mt_steps "en-de" --encoder_only false --emb_dim 512 --n_layers 6 --n_heads 8 --dropout 0.1 --attention_dropout 0.1 --tokens_per_batch 4096 --bptt 250 --batch_size 32 --optimizer adam_inverse_sqrt,beta1=0.9,beta2=0.98,lr=0.0007 --epoch_size -1 --eval_bleu true --stopping_criterion 'valid_en-de_mt_bleu,10' --validation_metrics 'valid_en-de_mt_bleu' --gelu_activation true
But I get the following error
Can you tell me are there any additional steps to be followed to do so ? I want to pretrain a model and then use it for training a supervised MT task. Did I miss something kindly let me know if there is any end to end tutorial for supervised MT training. Kindly help.
Thanks in advance .
The text was updated successfully, but these errors were encountered: