Finetuning on HF using Seq2SeqTrainer. #2

varadhbhatnagar · 2022-06-10T14:49:17Z

I need to finetune csebuetnlp/mT5_m2m_crossSum using the Seq2SeqTrainer on HuggingFace. What special symbols need to be inserted during tokenization of input-output pairs?

The text was updated successfully, but these errors were encountered:

abhik1505040 · 2022-06-11T15:50:02Z

Hi, We use language-specific bos tokens only for the decoder. The language-to-bos-token mapping is present in the model config. Please have a look at the code snippet given on the model page to see how you can access it. If you want to readily use our scripts to further fine-tune this model on your own data, just replace google/mt5-base with csebuetnlp/mT5_m2m_crossSum on this line

abhik1505040 closed this as completed Jul 8, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Finetuning on HF using Seq2SeqTrainer. #2

Finetuning on HF using Seq2SeqTrainer. #2

varadhbhatnagar commented Jun 10, 2022

abhik1505040 commented Jun 11, 2022

Finetuning on HF using Seq2SeqTrainer. #2

Finetuning on HF using Seq2SeqTrainer. #2

Comments

varadhbhatnagar commented Jun 10, 2022

abhik1505040 commented Jun 11, 2022