This repository has been archived by the owner on Mar 3, 2024. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 513
finetune BERT with custom dataset #20
Labels
Comments
There is a test case that trains the model: Lines 26 to 93 in 02c7eb2
However, I recommend training with the official implementation then load the checkpoint (since the optimizer and the creation of sentence pairs are different). |
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. |
Sign up for free
to subscribe to this conversation on GitHub.
Already have an account?
Sign in.
Is your feature request related to a problem? Please describe.
Wish to finetune BERT (MLM, PairSentence) with customer dataset, e.g. text exacted from a book.
Describe the solution you'd like
Describe alternatives you've considered
Which function wherein we can feed a customer dataset, for example, a text file from a book ?
Do we need write a function to format the text file so that it can be taken by BERT ?
Additional context
The text was updated successfully, but these errors were encountered: