Chinese-PatentChar
is a treebank of Chinese patent application texts collected from the Chinese patent office's website CNIPA.
The sentences are randomly selected from the patent claims of the IPC section "G" from November 2017 to September 2018.
The syntactic analysis is originally done in mSUD (on the character level).
A regular SUD version is available in the SUD_Chinese-PatentChar
folder.
-
2023-11-16 v2.13
- 100 additionnal sentences
- Annotation in the mSUD format, with conversion into SUD
-
2022-11-15 v2.11
- Initial release in Universal Dependencies.
=== Machine-readable metadata (DO NOT REMOVE!) ================================ Data available since: UD v2.11 License: CC BY-NC-SA 3.0 Includes text: yes Genre: legal Lemmas: not available UPOS: manual native XPOS: not available Features: manual native Relations: converted from manual Contributors: Li, Yixuan; Gerdes, Kim; Guillaume, Bruno Contributing: elsewhere Contact: li.yixuan727@gmail.com ===============================================================================