Training with custom dataset #3

vivekam101 · 2019-10-24T15:35:36Z

Hi Kamal,
Can you please share how to do finetuning with custom dataset

kamalkraj · 2019-11-02T09:38:08Z

The pre-trained model comes from https://github.com/huggingface/transformers.
You can refer to there documentation for fine-tuning on a custom dataset.
After finetuning you can use this repo for inference. Point the code to new model

gurvesh · 2019-11-07T14:23:36Z

Thanks for for providing this Kamal!

For anyone else interested, I was able to get better performance with the following model which is provided by the transformer library directly (need to edit bert.py):

tokenizer = BertTokenizer.from_pretrained('bert-large-uncased-whole-word-masking-finetuned-squad')
model = BertForQuestionAnswering.from_pretrained('bert-large-uncased-whole-word-masking-finetuned-squad', config=config)

kamalkraj · 2019-11-07T15:31:04Z

Hi @gurvesh,
I think the new model is recently released by transformers. Thanks for posting here.
I will add this information to README

What are EM and F1 Score of bert-large-uncased-whole-word-masking-finetuned-squad ?

gurvesh · 2019-11-08T16:42:42Z

I think the scores are about the same. But in actual usage, I found the answers given by the finetuned-squad model on random articles I put through the models were much better. YMMV

kamalkraj closed this as completed Feb 8, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training with custom dataset #3

Training with custom dataset #3

vivekam101 commented Oct 24, 2019

kamalkraj commented Nov 2, 2019

gurvesh commented Nov 7, 2019 •

edited

Loading

kamalkraj commented Nov 7, 2019

gurvesh commented Nov 8, 2019

Training with custom dataset #3

Training with custom dataset #3

Comments

vivekam101 commented Oct 24, 2019

kamalkraj commented Nov 2, 2019

gurvesh commented Nov 7, 2019 • edited Loading

kamalkraj commented Nov 7, 2019

gurvesh commented Nov 8, 2019

gurvesh commented Nov 7, 2019 •

edited

Loading