No such dataset implementation None #3

aquorio15 · 2022-08-02T10:32:35Z

Hi, I have been trying to implement the code on MTTT dataset as given in the paper. But while loading the data during fairseq train, I am getting the following error 'No such dataset implementation None' probably while loading the data.

Any kind of help would be greatly appreciated

Command line for traing in case i am doing something wrong
CUDA_VISIBLE_DEVICES=0 fairseq-train data-bin/MMMT.tokenized.en-tr --task visual_text --source-lang en --target-lang tr --target-dict dict.tr.txt --arch visual_text_transformer --image-window 15 --image-stride 10 --image-font-path fairseq/data/visual/fonts/NotoSans-Regular.ttf --image-embed-normalize --image-embed-type 1layer --share-decoder-input-output-embed --optimizer adam --adam-betas '(0.9, 0.98)' --clip-norm 0.0 --lr 5e-4 --lr-scheduler inverse_sqrt --warmup-updates 4000 --dropout 0.3 --weight-decay 0.0001 --criterion label_smoothed_cross_entropy --label-smoothing 0.1 --max-tokens 4096 --max-epoch 50

esalesky · 2022-08-02T17:50:50Z

My guess is that you need to set the parameter --dataset_impl in your training command.

In the first line in the screenshot, dataset_impl=None. If you're using binarized data, you'll want to set this to mmap, and if you're using raw data (not binarized, images computed when internal dataset representation constructed, which is what I typically did for MTTT because it is relatively small), you can set this to raw.

If you still see an issue, please comment again and I'll do my best to help, and if that works, feel free to close!

aquorio15 · 2022-08-03T03:59:32Z

Hi thank you for the reply
It is working now, just a small question did you try this model with very small dataset let's say (around 10000 sentence pair)

Also how did you evaluate your trained model.

esalesky · 2022-08-03T13:09:36Z

Great, glad to hear!

No, I did not try datasets smaller than MTTT. You may want to try adapting a model trained on more data.

We evaluated using BLEU as computed by sacrebleu, after first removing sentencepiece segmentation.

esalesky closed this as completed Aug 3, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

No such dataset implementation None #3

No such dataset implementation None #3

aquorio15 commented Aug 2, 2022 •

edited

Loading

esalesky commented Aug 2, 2022

aquorio15 commented Aug 3, 2022 •

edited

Loading

esalesky commented Aug 3, 2022 •

edited

Loading

No such dataset implementation None #3

No such dataset implementation None #3

Comments

aquorio15 commented Aug 2, 2022 • edited Loading

esalesky commented Aug 2, 2022

aquorio15 commented Aug 3, 2022 • edited Loading

esalesky commented Aug 3, 2022 • edited Loading

aquorio15 commented Aug 2, 2022 •

edited

Loading

aquorio15 commented Aug 3, 2022 •

edited

Loading

esalesky commented Aug 3, 2022 •

edited

Loading