Trained model on custom dataset. Though predicting only conll2003 entities. #52

ranjeetds · 2019-11-04T06:59:18Z

I have custom entities data of around 8 entities. I combined that dataset with the conll2003 (As I am interested in conll2003 entities also). I trained the model. Though the trained model is unable to predict any entities outside conll2003. Could you please help me if I am missing anything while training on custom dataset.

I used below command to train the model.

nohup python3 run_ner.py --data_dir=data --bert_model=bert-large-cased --task_name=ner --output_dir=out_bert_large --max_seq_length=128 --num_train_epochs 10 --do_train --do_eval --no_cuda --warmup_proportion=0.4 > log.txt &

The text was updated successfully, but these errors were encountered:

kamalkraj · 2019-11-04T07:03:13Z

Hi @ranjeetds ,
Did you change the

BERT-NER/run_ner.py

Line 156 in 0198457

def get_labels(self):

?

ranjeetds · 2019-11-04T07:09:59Z

Yes

Changed it to below which i wanted.

# This function will return list of labels present in the dataset

def get_labels(self): return ["O", "B-PER", "I-PER", "B-ORG", "I-ORG", "B-LOC", "I-LOC", "B-ART", "I-ART", "B-EVE", "I-EVE", "B-GPE", "I-GPE", "B-NAT", "I-NAT", "B-TIM", "I-TIM", "[CLS]", "[SEP]"]

kamalkraj · 2019-11-04T10:08:11Z

Check model_config.json file in the saved model dir. and verify all the labels are there in the label_map

ranjeetds · 2019-11-04T10:50:09Z

Checked. All the labels are in the model_config.json file's label_map.

kamalkraj · 2019-11-04T11:10:41Z

How is the new entity distribution across the whole dataset?
if possible share the dataset, I will try on my machine

ranjeetds · 2019-11-04T11:36:37Z

Hi,

Below is the distribution across training dataset

14988
47958
402 B-ART
308 B-EVE
15870 B-GPE
44784 B-LOC
201 B-NAT
26464 B-ORG
23590 B-PER
20333 B-TIM
297 I-ART
253 I-EVE
198 I-GPE
8571 I-LOC
51 I-NAT
20488 I-ORG
21779 I-PER
6528 I-TIM
1063025 O
Also attaching files
Valid and test are same as yours

test.txt
train.txt
valid.txt

ranjeetds · 2019-11-05T08:28:43Z

@kamalkraj did you try on your machine? If yes could you please share results by trying statements containing other entities like time for example "Tomorrow i will write to Shyam regarding hurricane MAHA"

kamalkraj · 2019-11-06T04:34:48Z

The training data format was not correct. The new sentences you added were not split by \n.
Corrected train data.
train.txt

Training metric one bert-base-model:

ranjeetds · 2019-11-07T06:21:56Z

Hi @kamalkraj . Thank you for checking out the dataset and pointing to the bug. Though I ran model on updated train file you shared and somehow model performance went to 0 for all the classes in evaluation set. Now model is predicting every token as 'O' entity.

I am running 10 epochs so do you think model might be overfitting for O class? If yes, How do i reduce the model bias.

kamalkraj · 2019-11-07T06:28:50Z

Hi @ranjeetds,

Don't run model for 10 epochs. It will only cause overfitting.
Downsample the CoNLL-2003 entities, Because the new entity distribution you added to the data is very small compared to CoNLL-2003.
Train for 3-5 epochs with learning rate { 2e-5, 3e-5, 5e-5 }

ranjeetds · 2019-11-12T07:00:37Z

Hi @kamalkraj Thanks for the suggestions. Tried it on OntoNotes 5.0 and could achieve benchmarks.

Just one last question.

Does your script support transfer learning on custom trained model. That is if I trained model for 10 entities store it in 'Out' directory.

Now I get new dataset with 5 different entities. I train this model where my pretrained model will be 'out' instead pretrained bert model. Will this approach work and give better results? Or should i just train model each time from scratch with bert pretrained model?

kamalkraj · 2019-11-12T08:44:11Z

Hi @ranjeetds,

Transfer learning on the custom trained model not supported. And I won't recommend doing that.
When you add new entities or remove existing entities, the Dimensions of Output layer changes. So you can only reuse the BERT encoder from the custom trained model.

Could you please share the onotOntoNotes 5.0 results with me.

ranjeetds · 2019-11-12T11:09:29Z

Hi,

Sure will user BERT encoder form custom trained model for fine-tuning.

Below are the results on OntoNotes 5.0 dataset. SOTA is around 89% F1. With BERT people have got f1 around 83% to 85%.

kamalkraj · 2019-11-12T14:36:40Z

OntoNotes 5.0 full or CoNLL-2012 train,test,dev split ?

ranjeetds · 2019-11-13T05:40:55Z

OntoNotes 5.0 Full.
Used https://github.com/yuchenlin/OntoNotes-5.0-NER-BIO this to covert it to BIO tagged format.

Jeetkarsh · 2020-07-13T18:16:08Z

Getting the following error while following the above steps.

Traceback (most recent call last):
File "run_ner.py", line 594, in
main()
File "run_ner.py", line 582, in main
temp_2.append(label_map[logits[i][j]])
KeyError: 0

kamalkraj closed this as completed Nov 15, 2019

kamalkraj mentioned this issue Nov 18, 2019

is this using CoNll dataset for training NER tag ? dose the code support ontonotes as well? #58

Open

ranjeetds mentioned this issue Apr 20, 2020

how many epochs do ner need #80

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Trained model on custom dataset. Though predicting only conll2003 entities. #52

Trained model on custom dataset. Though predicting only conll2003 entities. #52

ranjeetds commented Nov 4, 2019

kamalkraj commented Nov 4, 2019

ranjeetds commented Nov 4, 2019 •

edited

Loading

kamalkraj commented Nov 4, 2019

ranjeetds commented Nov 4, 2019 •

edited

Loading

kamalkraj commented Nov 4, 2019

ranjeetds commented Nov 4, 2019

ranjeetds commented Nov 5, 2019

kamalkraj commented Nov 6, 2019

ranjeetds commented Nov 7, 2019 •

edited

Loading

kamalkraj commented Nov 7, 2019

ranjeetds commented Nov 12, 2019

kamalkraj commented Nov 12, 2019 •

edited

Loading

ranjeetds commented Nov 12, 2019 •

edited

Loading

kamalkraj commented Nov 12, 2019 •

edited

Loading

ranjeetds commented Nov 13, 2019

Jeetkarsh commented Jul 13, 2020

Trained model on custom dataset. Though predicting only conll2003 entities. #52

Trained model on custom dataset. Though predicting only conll2003 entities. #52

Comments

ranjeetds commented Nov 4, 2019

kamalkraj commented Nov 4, 2019

ranjeetds commented Nov 4, 2019 • edited Loading

kamalkraj commented Nov 4, 2019

ranjeetds commented Nov 4, 2019 • edited Loading

kamalkraj commented Nov 4, 2019

ranjeetds commented Nov 4, 2019

ranjeetds commented Nov 5, 2019

kamalkraj commented Nov 6, 2019

ranjeetds commented Nov 7, 2019 • edited Loading

kamalkraj commented Nov 7, 2019

ranjeetds commented Nov 12, 2019

kamalkraj commented Nov 12, 2019 • edited Loading

ranjeetds commented Nov 12, 2019 • edited Loading

kamalkraj commented Nov 12, 2019 • edited Loading

ranjeetds commented Nov 13, 2019

Jeetkarsh commented Jul 13, 2020

ranjeetds commented Nov 4, 2019 •

edited

Loading

ranjeetds commented Nov 4, 2019 •

edited

Loading

ranjeetds commented Nov 7, 2019 •

edited

Loading

kamalkraj commented Nov 12, 2019 •

edited

Loading

ranjeetds commented Nov 12, 2019 •

edited

Loading

kamalkraj commented Nov 12, 2019 •

edited

Loading