Skip to content

surfacesyntacticud/mSUD_Chinese-PatentChar

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

60 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Summary

Chinese-PatentChar is a treebank of Chinese patent application texts collected from the Chinese patent office's website CNIPA.

The sentences are randomly selected from the patent claims of the IPC section "G" from November 2017 to September 2018.

Introduction

The syntactic analysis is originally done in mSUD (on the character level). A regular SUD version is available in the SUD_Chinese-PatentChar folder.

Changelog

  • 2023-11-16 v2.13

    • 100 additionnal sentences
    • Annotation in the mSUD format, with conversion into SUD
  • 2022-11-15 v2.11

    • Initial release in Universal Dependencies.
=== Machine-readable metadata (DO NOT REMOVE!) ================================
Data available since: UD v2.11
License: CC BY-NC-SA 3.0
Includes text: yes
Genre: legal
Lemmas: not available
UPOS: manual native
XPOS: not available
Features: manual native
Relations: converted from manual
Contributors: Li, Yixuan; Gerdes, Kim; Guillaume, Bruno
Contributing: elsewhere
Contact: li.yixuan727@gmail.com
===============================================================================

About

syntactic analysis of Chinese patents on the character level

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •