A treebank of Chinese patent application texts collected from the Chinese patent office's website CNIPA.
The sentences are randomly selected from the patent claims of the IPC section "G" from November 2017 to September 2018.
The syntactic analysis is originally done in SUD on the character level under the name SUD_Chinese-PatentChar. See SUD Guidelines : https://surfacesyntacticud.github.io/guidelines/u/
-
2024-05-15 v2.14
- Original annotation in mSUD (https://github.com/surfacesyntacticud/mSUD_Chinese-PatentChar), see LREC-COLING 2024 paper: Joint Annotation of Morphology and Syntax in Dependency Treebanks
-
2022-11-15 v2.11
- Initial release in Universal Dependencies.
=== Machine-readable metadata (DO NOT REMOVE!) ================================ Data available since: UD v2.11 License: CC BY-NC-SA 3.0 Includes text: yes Genre: legal Lemmas: not available UPOS: manual native XPOS: not available Features: manual native Relations: converted from manual Contributors: Li, Yixuan; Gerdes, Kim; Guillaume, Bruno Contributing: elsewhere Contact: li.yixuan727@gmail.com ===============================================================================