-
Notifications
You must be signed in to change notification settings - Fork 280
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Trained model on custom dataset. Though predicting only conll2003 entities. #52
Comments
Hi @ranjeetds , Line 156 in 0198457
|
Yes Changed it to below which i wanted.
|
Check model_config.json file in the saved model dir. and verify all the labels are there in the label_map |
Checked. All the labels are in the model_config.json file's label_map. |
How is the new entity distribution across the whole dataset? |
Hi, Below is the distribution across training dataset
|
@kamalkraj did you try on your machine? If yes could you please share results by trying statements containing other entities like time for example "Tomorrow i will write to Shyam regarding hurricane MAHA" |
The training data format was not correct. The new sentences you added were not split by |
Hi @kamalkraj . Thank you for checking out the dataset and pointing to the bug. Though I ran model on updated train file you shared and somehow model performance went to 0 for all the classes in evaluation set. Now model is predicting every token as 'O' entity. I am running 10 epochs so do you think model might be overfitting for O class? If yes, How do i reduce the model bias. |
Hi @ranjeetds,
|
Hi @kamalkraj Thanks for the suggestions. Tried it on OntoNotes 5.0 and could achieve benchmarks. Just one last question. Does your script support transfer learning on custom trained model. That is if I trained model for 10 entities store it in 'Out' directory. Now I get new dataset with 5 different entities. I train this model where my pretrained model will be 'out' instead pretrained bert model. Will this approach work and give better results? Or should i just train model each time from scratch with bert pretrained model? |
Hi @ranjeetds,
Could you please share the onotOntoNotes 5.0 results with me. |
OntoNotes 5.0 full or CoNLL-2012 train,test,dev split ? |
OntoNotes 5.0 Full. |
Getting the following error while following the above steps. Traceback (most recent call last): |
I have custom entities data of around 8 entities. I combined that dataset with the conll2003 (As I am interested in conll2003 entities also). I trained the model. Though the trained model is unable to predict any entities outside conll2003. Could you please help me if I am missing anything while training on custom dataset.
I used below command to train the model.
nohup python3 run_ner.py --data_dir=data --bert_model=bert-large-cased --task_name=ner --output_dir=out_bert_large --max_seq_length=128 --num_train_epochs 10 --do_train --do_eval --no_cuda --warmup_proportion=0.4 > log.txt &
The text was updated successfully, but these errors were encountered: