Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Converting co-reference chains from conlll format to jsonlines format of clusters. #32

Closed
ashim95 opened this issue Sep 12, 2018 · 2 comments

Comments

@ashim95
Copy link

ashim95 commented Sep 12, 2018

First of all, thank you for sharing your code.

I am unable to understand how the co-reference chains from conlll files have been converted to the cluster format (the numbers in the clusters seem arbitrary !!).

Thanks,

@kentonl
Copy link
Owner

kentonl commented Sep 13, 2018

The script that converts the conll files to the jsonlines format is here: https://github.com/kentonl/e2e-coref/blob/master/minimize.py. Could you be a bit more specific about what is confusing?

If you're asking about the cluster numbers in the conll format, then yes they are arbitrary. If you're asking about the numbers in the jsonline format, they are referring to start and end indices of each span within a cluster.

@ashim95
Copy link
Author

ashim95 commented Sep 13, 2018

Thank You very much for your swift reply.

I was, in fact talking about the cluster numbers in jsonlines format. Now that you mentioned they are start and end of the spans, it all makes sense.

Again, thank you very much.

@ashim95 ashim95 closed this as completed Sep 13, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants