Is it possible/is there a plan to enable continued pretraining? #1547

oligiles0 · 2019-10-17T14:46:56Z

🚀 Feature

Standardised interface to pretrain various Transformers with standardised expectations with regards to formatting training data.

Motivation

To achieve state of the art within a given domain it is not sufficient to take models pretrained on nonspecific literature (wikipedia/books/etc). The ideal situation would be able to leverage all the compute put into this training and then further train on domain literature before fine tuning on a specific task. The great strength of this library is having a standard interface to use new SOTA models and it would be very helpful if this was extended to include further pretraining to help rapidly push domain SOTAs.

enzoampil · 2019-10-18T03:36:28Z

Hi @oligiles0, you can actually use run_lm_finetuning.py for this. You can find more details in the RoBERTa/BERT and masked language modeling section in the README

oligiles0 · 2019-10-21T09:36:52Z

Hi @oligiles0, you can actually use run_lm_finetuning.py for this. You can find more details in the RoBERTa/BERT and masked language modeling section in the README

Thanks very much @enzoampil . Is there a reason this uses a single text file as opposed to taking a folder of text files? I wouldn't want to combine multiple documents because some chunks will then cross documents and interfere with training, but I also wouldn't want to rerun the script for individual documents.

iedmrc · 2019-12-04T21:51:25Z

Thanks very much @enzoampil . Is there a reason this uses a single text file as opposed to taking a folder of text files? I wouldn't want to combine multiple documents because some chunks will then cross documents and interfere with training, but I also wouldn't want to rerun the script for individual documents.

Please check #1896 (comment)

stale · 2020-02-02T22:44:23Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

stale bot added the wontfix label Feb 2, 2020

stale bot closed this as completed Feb 9, 2020

ksopyla mentioned this issue Feb 11, 2020

Repository with recipes how to pretrain model from scratch on my own data #2814

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is it possible/is there a plan to enable continued pretraining? #1547

Is it possible/is there a plan to enable continued pretraining? #1547

oligiles0 commented Oct 17, 2019

enzoampil commented Oct 18, 2019 •

edited

oligiles0 commented Oct 21, 2019 •

edited

iedmrc commented Dec 4, 2019

stale bot commented Feb 2, 2020

Is it possible/is there a plan to enable continued pretraining? #1547

Is it possible/is there a plan to enable continued pretraining? #1547

Comments

oligiles0 commented Oct 17, 2019

🚀 Feature

Motivation

enzoampil commented Oct 18, 2019 • edited

oligiles0 commented Oct 21, 2019 • edited

iedmrc commented Dec 4, 2019

stale bot commented Feb 2, 2020

enzoampil commented Oct 18, 2019 •

edited

oligiles0 commented Oct 21, 2019 •

edited