Skip to content

Latest commit

 

History

History

ucd

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

UNIC — Unicode Character Database

Crates.io Documentation

This UNIC component provides access to character properties as defined in the Unicode® Standard Annex #44 - Unicode Character Database.

UCD is a UNIC super-crate, composed of smaller crates that provide data in specific areas, therefore, allowing access only to the data needed instead of forcing dependent crates to import all UCD data.

Crates

Here's a list of components (available or planned) for this super-crate:

  • version: The Unicode Version of UCD data.

  • common: Common properties, such as Alphabetic, White-Space, Control and Numeric.

  • age: Age property.

  • bidi: Bidirectional properties. (Hebrew, Arabic, ...)

  • block: Block properties.

  • case: Letter Case properties.

  • category: General_Category property.

  • hangul: Hangul Syllable Composition & Decomposition.

  • ident: Identifier properties.

  • name: Name property.

  • normal: Normalization properties.

  • segment: Segmentation properties.

  • ea-width: East Asian Width properties.

  • joining: Cursive joining properties. (Arabic, Syriac, ...)

  • numeric: Other character numeric properties.

  • script: Script properties.

See http://unicode.org/reports/tr44/#Property_List_Table for the complete list of properties defined in UCD. Eventually, all these properties will be available by under unic-ucd.