MaskedLM for longer texts #8

AsmaBaccouche · 2022-06-23T14:59:07Z

Is it possible to apply the same logic on Roberta for the MaskedLM model? I need it for pretraining on a custom dataset that has long texts - Thanks

MichalBrzozowski91 · 2023-03-10T15:49:47Z

Hi, the method we used here is applied on already pre-trained model during fine-tuning stage. I am not sure if it can be applied during pre-training stage. Maybe it could be better to train the models with architecture modified for longer texts like BigBird or Longformer if you want to pre-train it from scratch.

MichalBrzozowski91 closed this as completed Mar 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MaskedLM for longer texts #8

MaskedLM for longer texts #8

AsmaBaccouche commented Jun 23, 2022 •

edited

Loading

MichalBrzozowski91 commented Mar 10, 2023

MaskedLM for longer texts #8

MaskedLM for longer texts #8

Comments

AsmaBaccouche commented Jun 23, 2022 • edited Loading

MichalBrzozowski91 commented Mar 10, 2023

AsmaBaccouche commented Jun 23, 2022 •

edited

Loading