-
Notifications
You must be signed in to change notification settings - Fork 99
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Format for training custom NER classifiers #6
Comments
Hi @mrshu, |
Thanks for your answer @minhvannguyen! I did try to put a simple demo of https://colab.research.google.com/drive/1sgU0U42c1ipn6QbskFcRDs_tdvQgK7Gf It seems that everything indeed works but despite the loss getting close to 0 towards the second epoch for the respective training examples, the dev F1 score still reports 0. I suspect it may be connected to the format of the input files (now If I end up getting it to work, I'd be happy to write it up for the Thanks again! |
All right, I did find the I think something like this could be mentioned in the docs so that it is relatively obvious to anyone trying to train a NER model on a custom dataset. trankit/trankit/utils/ner_utils.py Lines 42 to 51 in 576d918
|
Thank you a lot for pointing it out, @mrshu. |
The docs has been updated. |
Thanks a ton for your speedy response @minhvannguyen, let's close this issue then! |
First of all, thanks for opensourcing
trankit
-- it looks very interesting!I would be interested in training a custom NER model as described in the docs. Could you please comment a bit on what sort of a format should the
.bio
files be stored in?Thanks!
cc @minhvannguyen
The text was updated successfully, but these errors were encountered: