Skip to content
No description, website, or topics provided.
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
UD_English-EWT
annotated
annotated_structure
autoid-autosyn-autopss
autoid-autosyn-goldpss added concatenated labels data and standard xml versions of SNACS and… May 18, 2019
find_refined_standard
goldid-autosyn-goldpss
goldud
orig
snacs_standard
spacy
stanfordnlp
udpipe
README.md
annotate.sh

README.md

ucca-streusle

This repository contains data from the English Web Treebank (EWT) section of the UD corpus, which has been annotated with SNACS (Schneider et al., 2018) and UCCA (Abend and Rappoport, 2013). Here present specifically a version of this data where the two different semantic annotations have been integrated to facilitate joint learning (Prange et al., 2019).

Overview of directories

  • orig: original UCCA annotations
  • annotated: original UCCA annotations with SNACS categories as token features, for pipeline setup in Prange et al., 2019
  • autoid-autosyn-autopss, autoid-autosyn-goldpss, goldid-autosyn-goldpss: UCCA annotations with integrated SNACS categories. For details on automatic SNACS target identification (autoid) and disambiguation (autopss), see Schneider et al., 2018; automatic SNACS labels are obtained from their SVM classifier). Subdirectories refer to the following setup-specific inputs in Prange et al., 2019:
    • integrated: dependent MTL/rel
    • terminal: dependent MTL/ter
    • integrated-concat: joint/rel
    • terminal-concat: joint/ter
  • snacs_standard: SNACS targets, formatted as UCCA standard xml, for independent MTL setup in Prange et al., 2019
  • find_refined_standard: UCCA passages, with boolean edge labels identifying SNACS-refinement, for independent MTL setup in Prange et al., 2019

References

Omri Abend and Ari Rappoport. 2013. Universal Conceptual Cognitive Annotation (UCCA). In Proc. of ACL, pages 228–238, Sofia, Bulgaria.

Jakob Prange, Nathan Schneider, and Omri Abend. 2019. Made for Each Other: Broad-coverage Semantic Structures Meet Preposition Supersenses. To appear at CoNLL, Hong Kong, China.

Nathan Schneider, Jena D. Hwang, Vivek Srikumar, Jakob Prange, Austin Blodgett, Sarah R. Moeller, Aviram Stern, Adi Bitan, and Omri Abend. 2018. Comprehensive supersense disambiguation of English prepositions and possessives. In Proc. of ACL, pages 185–196, Melbourne, Australia.

You can’t perform that action at this time.