Skip to content

An Apache-licensed, web-based sense annotation tool

Notifications You must be signed in to change notification settings

UKPLab/lrec2016-ubyline

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

25 Commits
 
 
 
 
 
 

Repository files navigation

Ubyline

Ubyline is an Apache-licensed, web-based sense annotation tool whose user interface is optimized for lexical sample data. Ubyline supports a wide range of sense inventories in several languages, including WordNet and GermaNet.

Please use the following citation:

@InProceedings{miller2016sense-annotating,
  author =    {Tristan Miller and Mohamed Khemakhem and Eckart de Castilho, Richard and Iryna Gurevych},
  title =     {Sense-annotating a Lexical Substitution Data Set with {Ubyline}},
  booktitle = {Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016)},
  editor =    {Nicoletta Calzolari and Khalid Choukri and Thierry Declerck and Marko Grobelnik and Bente Maegaard and Joseph Mariani and Asunci{\'{o}}n Moreno and Jan Odijk and Stelios Piperidis},
  year =      {2016},
  pages =     {828--835},
  publisher = {European Language Resources Association},
  month =     may,
  isbn =      {978-2-9517408-9-1},
}

Abstract: We describe the construction of GLASS, a newly sense-annotated version of the German lexical substitution data set used at the GermEval 2015: LexSub} shared task. Using the two annotation layers, we conduct the first known empirical study of the relationship between manually applied word senses and lexical substitutions. We find that synonymy and hypernymy/hyponymy are the only semantic relations directly linking targets to their substitutes, and that substitutes in the target's hypernymy/hyponymy taxonomy closely align with the synonyms of a single GermaNet synset. Despite this, these substitutes account for a minority of those provided by the annotators. The results of our analysis accord with those of a previous study on English-language data (albeit with automatically induced word senses), leading us to suspect that the sense--substitution relations we discovered may be of a universal nature. We also tentatively conclude that relatively cheap lexical substitution annotations can be used as a knowledge source for automatic WSD. Also introduced in this paper is Ubyline, the web application used to produce the sense annotations. Ubyline presents an intuitive user interface optimized for annotating lexical sample data, and is readily adaptable to sense inventories other than GermaNet.

Contact person: Richard Eckart de Castilho, eckart@ukp.informatik.tu-darmstadt.de

https://www.ukp.tu-darmstadt.de/

https://www.tu-darmstadt.de/

Don't hesitate to send us an e-mail or report an issue, if something is broken (and it shouldn't be) or if you have further questions.

This repository contains experimental software and is published for the sole purpose of giving additional background details on the respective publication.

Project structure

Follow the link for the documentation of Ubyline configuration: https://zoidberg.ukp.informatik.tu-darmstadt.de/jenkins/job/Ubyline-Lrec2016%20Doc/de.tudarmstadt.ukp.ubyline$dkpro-uby-ubyline/doclinks/1/

Requirements

  • Java 7
  • Apache Maven 3.3
  • Apache Tomcat 7
  • MySQL 5.5
  • CWB 3.0 from the IMS Open Corpus Workbench

About

An Apache-licensed, web-based sense annotation tool

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •