Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

List of possible formats #3

Open
3 of 10 tasks
ivyleavedtoadflax opened this issue Jul 17, 2019 · 11 comments
Open
3 of 10 tasks

List of possible formats #3

ivyleavedtoadflax opened this issue Jul 17, 2019 · 11 comments
Assignees
Labels
enhancement New feature or request

Comments

@ivyleavedtoadflax
Copy link
Collaborator

ivyleavedtoadflax commented Jul 17, 2019

What other formats do we want the package to be able to use? For starters:

  • list of labels
  • Prodigy
  • spaCy
  • StanfordNER
  • CoNLLa
  • Single-Lineb
  • xml: TODO (add example)
  • inlineXML: TODO (add example)
  • tsv: TODO (add example)
  • slashTags: TODO (add example)
  • a
,	O
Davos	PERSON
2018	O
:	O
Soros	PERSON
accuses	O
Trump	PERSON
of	O
wanting	O
a	O
`	O
mafia	O
state	O
'	O
and	O
blasts	O
social	O
media	O
.	O
  • b
Switzerland/LOCATION ,/O Davos/PERSON 2018/O :/O Soros/PERSON accuses/O Trump/PERSON of/O wanting/O a/O /O mafia/O state/O '/O and/O blasts/O social/O media/O ./O`
@ivyleavedtoadflax
Copy link
Collaborator Author

@ivyleavedtoadflax
Copy link
Collaborator Author

See https://universaldependencies.org/format.html for CoNLL format.

@ivyleavedtoadflax
Copy link
Collaborator Author

There is an implementation of CoNLL to spacy here: explosion/spaCy#533 (comment) which should be easy to adapt to the prodigy format now used by nervaluate.

@ivyleavedtoadflax
Copy link
Collaborator Author

@davidsbatista I've merged this into #3 so will close this here for now.

@ivyleavedtoadflax
Copy link
Collaborator Author

Ok - will close the correct issue this time!

@gunturbudi
Copy link

I would love to see the Flair output compatible with this library.

https://github.com/flairNLP/flair/blob/master/resources/docs/TUTORIAL_2_TAGGING.md

@ivyleavedtoadflax
Copy link
Collaborator Author

Thanks @gunturbudi for the suggestion. I'd love to know more if you are currently using this package and what your use cases might be. We are just kicking off some development on it.

@gunturbudi
Copy link

Hi, we are conducting an experiment on NER for informal medical phrases. Since Flair library did not provide a complete evaluation, we really glad we found this package :)

@ivyleavedtoadflax
Copy link
Collaborator Author

That's great to hear @gunturbudi, and I'm very glad you find it useful.

Just to make you aware, there are a few bugs which are listed in the other issues which will affect the evaluation in some cases. We've just this week started to work on the package again, so I'm hoping that we will resolve those issues soon. But just to make you aware 👀

@ivyleavedtoadflax ivyleavedtoadflax added the enhancement New feature or request label Jul 1, 2021
@gunturbudi
Copy link

Thanks for the information @ivyleavedtoadflax
We also cross-check it with the evaluation from flair. So far the difference is quite small. But we will wait and see for the update :)

@ivyleavedtoadflax
Copy link
Collaborator Author

Thanks for the information @ivyleavedtoadflax
We also cross-check it with the evaluation from flair. So far the difference is quite small. But we will wait and see for the update :)

If you don't mind sharing the differences you've found with flair, that could be useful.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants