Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Convert to Spacy format #4

Open
obtic-sorbonne opened this issue Sep 26, 2019 · 2 comments
Open

Convert to Spacy format #4

obtic-sorbonne opened this issue Sep 26, 2019 · 2 comments

Comments

@obtic-sorbonne
Copy link

Hi !
Does your tool convert files generated by webanno (uima binary cas, uima cas json, uima cas xmi, conll, tsv3) to Spacy NER training data format?
Thank you in advance !!!

@ManivannanIkomet
Copy link

Hi,
Once the annotation is completed in webanno and download annotated json.
After you can use the python script to convert to spacy format

@obtic-sorbonne
Copy link
Author

obtic-sorbonne commented Sep 26, 2019

Thank you for your answer! I am beginner in this field, so all these details are new for me.
I exported from webanno the annotated sentences (in arabic) to uima cas Json format (attachment). When I apply your convert_spacy_train_data.py script to my data, i get this error:

Traceback (most recent call last): File "D:\OBVIL\Bureau\spacy-ner-annotator-master\convert_spacy_train_data.py", line 12, in <module> ents = [tuple(entity) for entity in data['entities']] TypeError: string indices must be integers

Can you advise please?

CURATION_USER.zip

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants