Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support CoNLL-U Plus #26

Open
BramVanroy opened this issue Apr 2, 2023 · 0 comments
Open

Support CoNLL-U Plus #26

BramVanroy opened this issue Apr 2, 2023 · 0 comments
Assignees
Labels
enhancement New feature or request

Comments

@BramVanroy
Copy link
Owner

As requested as part of #24

It would be neat to support CoNLL-U Plus:

  • export only the requested fields (and mark the output CoNLL-U with global.columns)
  • allow reading in a CoNLL-U Plus file
  • it also supports custom columns but I am hesitant to support those. Perhaps we can use them, if a custom field is present in the private spaCy registered space ._. then we may use that destination. Will have to think about it some more.

Here is an example of a CoNLL-U Plus file. Note how the first line indicates which fields are present (separated by spaces).

# global.columns = ID FORM LEMMA UPOS XPOS FEATS HEAD DEPREL DEPS MISC
# newdoc id = mf920901-001
# newpar id = mf920901-001-p1
# sent_id = mf920901-001-p1s1A
# text = Slovenská ústava: pro i proti
# text_en = Slovak constitution: pros and cons
1   Slovenská   slovenský   ADJ     AAFS1----1A---- Case=Nom|Degree=Pos|Gender=Fem|Number=Sing|Polarity=Pos 2 amod _ _
2   ústava      ústava      NOUN    NNFS1-----A---- Case=Nom|Gender=Fem|Number=Sing|Polarity=Pos 0 root _ SpaceAfter=No
3   :           :           PUNCT   Z:------------- _          2       punct   _       _
4   pro         pro         ADP     RR--4---------- Case=Acc   2       appos   _       LId=pro-1
5   i           i           CCONJ   J^------------- _          6       cc      _       LId=i-1
6   proti       proti       ADP     RR--3---------- Case=Dat   4       conj    _       LId=proti-1

If you want to see this implemented, please give this post a thumbs up so that I know what to prioritize.

@BramVanroy BramVanroy added the enhancement New feature or request label Apr 2, 2023
@BramVanroy BramVanroy self-assigned this Apr 2, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant