Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

instructions for UD annotation tools #373

Closed
arademaker opened this issue Dec 12, 2016 · 16 comments
Closed

instructions for UD annotation tools #373

arademaker opened this issue Dec 12, 2016 · 16 comments
Assignees
Milestone

Comments

@arademaker
Copy link
Contributor

http://universaldependencies.org/tools.html#brat-rapid-annotation-tool

says "brat is a browser-based tool for text annotation. The brat visualization component is used in the UD documentation system and the tool can be easily configured for UD annotation (TODO: link instructions)."

Does anyone knows the step-by-step? I would be glad to help on debug it and publish the revised instructions.

@spyysalo
Copy link
Member

Thanks for offering to help! The basic steps are

  1. Tokenize source texts
  2. Set up tokenized texts in brat
  3. Copy in UD config (https://github.com/nlplab/brat/tree/master/configurations/Universal-Dependencies)
  4. Annotate
  5. Convert brat standoff format to CoNLL-U

Caveats: the config is for UD v1 and needs to be updated, I'm not aware of a published brat-to-CoNLL-U conversion that I could be confident in recommending, and brat lacks direct support for multiword tokens.

@dan-zeman
Copy link
Member

I have written a script that I used to convert the annotation of the Cairo Cicling Corpus. You may want to have a look whether it helps you. It resides at

https://github.com/UniversalDependencies/cairo/blob/master/weaver/brat2conllu.pl

@dan-zeman dan-zeman added this to the later milestone Dec 15, 2016
@jnivre
Copy link
Contributor

jnivre commented Dec 15, 2016

There is also WebAnno, which Richard Eckhart de Castillo set up specifically for UD last year, so that it allows two-level word segmentation and a distinction between basic and enhanced dependencies. We should really test this and see how it works.

@spyysalo
Copy link
Member

+1 for including WebAnno! ( @reckart )

@reckart
Copy link
Contributor

reckart commented Dec 15, 2016

@spyysalo I have submitted a pull request ;)

@spyysalo
Copy link
Member

Thanks, merged!

@amir-zeldes
Copy link
Contributor

Should we add Arborator to this page?

https://arborator.ilpga.fr/

It has both a great project management system and a quick copy-paste mode without setting up a project, very convenient:

https://arborator.ilpga.fr/q.cgi

It's what we use for UD_Coptic.

@dan-zeman
Copy link
Member

@amir-zeldes : Yes, please go ahead and add it.

@nschneid
Copy link
Contributor

I opened an issue here: Arborator/arborator-server#6

@amir-zeldes
Copy link
Contributor

I've added it, but I can't seem to make the image work. I added it to docs/img/ - is that wrong?

@nschneid
Copy link
Contributor

Looks good to me.

spyysalo added a commit that referenced this issue Dec 16, 2016
@spyysalo spyysalo changed the title intructions for brat+UD intructions for UD annotation tools Dec 16, 2016
@spyysalo
Copy link
Member

Updated title to reflect the broader discussion here. The description of other tools would also benefit from step-by-step instructions for how to get started using them for UD annotation.

@nschneid nschneid changed the title intructions for UD annotation tools instructions for UD annotation tools Dec 16, 2016
@amir-zeldes
Copy link
Contributor

Oh, weird, it just took much longer to appear than the text edit. I see it fine now too.

@spyysalo
Copy link
Member

@arademaker : are you still interested in testing and publishing more detailed instructions?

@arademaker
Copy link
Contributor Author

@spyysalo , yes, I can help on that. We will try to document it probably opening a new subpage for brat linked to http://universaldependencies.org/tools.html. Does that make sense?

@spyysalo
Copy link
Member

@arademaker Works for me, thanks!

nschneid added a commit that referenced this issue Oct 16, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

7 participants