Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Failed to load latest commit information.

PanCJKV IVD Collection

PanCJKV IVD Collection is an unregistered IVD (Ideographic Variation Database) collection that assigns eleven IVSes (Ideographic Variation Sequences), one for each region among VN (Vietnam), KP (DPRK), KR (ROK), JP (Japan), MY (Malaysia), MO (Macao SAR), HK (Hong Kong SAR), TW (ROC), SG (Republic of Singapore), CN (PRC), and XK (Kāngxī) as a pseudo-region, to all CJK Unified Ideographs (80,388 in Unicode Version 8.0). VS246 (U+E01E5) through VS256 (U+E01EF), which represent the final eleven VSes (Variation Selectors), are used for this purpose, and each one uniformly corresponds to one of the eleven supported regions, assigned in the order shown above. Thus, this IVD collection includes 884,268 IVSes.

Included in this project are the standard IVD data files, IVD_Collections.txt, IVD_Sequences.txt, and IVD_Stats.txt, to which the "PanCJKV" IVD collection has been added. These data files are based on the current (2014-05-16) version of the IVD. The script in the Scripts directory was used to create the data that was added to the IVD_Sequences.txt file.

Also included in this project are two example font implementations, both of which are based on the language-specific Simplified Chinese Source Han Sans OpenType/CFF font, but its SC identifier has been changed to R04 or R11. The script in the Scripts directory was used to create the SourceHanSansR04_CN_sequences.txt and SourceHanSansR11_CN_sequences.txt UVS (Unicode Variation Sequence) definition files in the Sources directory (the script in the same directory can be used to regenerate those files), which are used as the argument of the AFDKO makeotf tool's -ci command-line option. (The UTF-32 to CID mapping files in the Resources directory are those for Source Han Sans, and are used as arguments to that script's per-region command-line options.) The SourceHanSansR04_CN_sequences.txt UVS definition file includes only a subset of the "PanCJKV" IVD collection, specifically covering 29,777 CJK Unified Ideograph code points and four regions (CN, TW, JP, and KR), which means that only 119,108 of the 884,268 IVSes are included. Of these 119,108 IVSes, 91,307 become default UVSes, and the remaining 27,801 become non-default ones. The SourceHanSansR11_CN_sequences.txt UVS definition file aliases IVSes for unsupported regions to supported ones as appropriate, specifically MY/SG to CN, HK/MO/VN to TW, and KP/XK to KR. 327,547 of the 884,268 IVSes are included in this UVS definition file. Of these 327,547 IVSes, 255,558 become default UVSes, and the remaining 71,989 become non-default ones.

Pending UTC approval based on L2/16-063, this IVD collection will be submitted for registration in the near future, which will entail issuing a PRI (Public Review Issue) with a 90-day review period. Once registered, this project will serve as the site that describes the "PanCJKV" IVD collection. In addition, support for this IVD collection will be added to Source Han Sans.

Example font installation instructions

Building the example font from source


To build the binary example font file from source, you need to have installed the Adobe Font Development Kit for OpenType (AFDKO). The AFDKO tools are widely used for font development today, and are part of most font editor applications.

Building the example font

All necessary files for building the example OpenType/CFF font are in the Sources directory of this project, and the file in the same directory provides the command lines that are used.

Getting Involved

Send any feedback to the PanCJKV IVD Collection project maintainer, Dr. Ken Lunde, for consideration.