GujTB is an in-progress treebank of Gujarati (an Indo-Aryan language) in Gujarati script.
Currently the treebank is comprised of 187 sentences, out of which 100 are doubly annotated by the authors. We plan to update the treebank with proper morphological annotations and features in the upcoming release.
This work has been accepted at MWE-UD Workshop at LREC-COLING'24. We will soon add the citation.
- (citation)
- 2022-11-15 v2.11
- Initial release in Universal Dependencies.
=== Machine-readable metadata (DO NOT REMOVE!) ================================ Data available since: UD v2.11 License: CC BY-SA 4.0 Includes text: yes Genre: grammar-examples Lemmas: manual native UPOS: manual native XPOS: not available Features: manual native Relations: manual native Contributors: Mehta, Maitrey; Jobanputra, Mayank Contributing: here Contact: maitrey@cs.utah.edu ===============================================================================