Switch branches/tags
Find file Copy path
Fetching contributors…
Cannot retrieve contributors at this time
207 lines (138 sloc) 6.36 KB
This file contains character (or code point) sets (or classes)
* Structure
The file contains a JSON object with following name/value pair:
sets [protocol-object]
The value is a JSON object whose names are character set names and
values are JSON objects with following additional name/value pairs:
chars [string]
The list of the code points in the set, represented in following
The first character is "[" and the last character is "]". Any
character between them are character in the set except for "\"
with some following characters, "-", "^", "[", and "]". If
there is a "-" character between two characters, any character
whose code point is between code points of them are also in
the set. Characters "\u{" followed by a code point followed
by a "}" indicate that the character is in the set.
Characters "\u" followed by four hexadecimal alphabet indicate
that the character whose code point is equal to the
hexadecimal number is in the set.
label [string]
A short English string describing the set.
suikawiki_name [string]
A page name in SuikaWiki.
<{name}>, where {name} is a
percent-encoded value of this field, is the URL for the page.
* Sources
ECMAScript 5.1.
HTML Standard <>.
RFC 1034.
RFC 1738.
RFC 1945, Hypertext Transfer Protocol -- HTTP/1.0
RFC 2046, Multipurpose Internet Mail Extensions (MIME) Part Two: Media
Types <>.
RFC 2068, Hypertext Transfer Protocol -- HTTP/1.1
RFC 2231, MIME Parameter Value and Encoded Word Extensions: Character
Sets, Languages, and Continuations
RFC 2234, Augmented BNF for Syntax Specifications: ABNF
RFC 2295, Transparent Content Negotiation in HTTP
RFC 2396.
RFC 2616, Hypertext Transfer Protocol -- HTTP/1.1
RFC 2822, Internet Message Format
RFC 3454.
RFC 3629, UTF-8, a transformation format of ISO 10646
RFC 3722, RFC 3920, RFC 2986, RFC 3987, RFC 5234.
RFC 4518, Lightweight Directory Access Protocol (LDAP):
Internationalized String Preparation
RFC 5322, Internet Message Format
RFC 5335, Internationalized Email Headers
RFC 5987, Character Set and Language Encoding for Hypertext Transfer
Protocol (HTTP) Header Field Parameters
RFC 6122.
RFC 6532, Internationalized Email Headers
RFC 6570, URI Template <>.
RFC 6749, The OAuth 2.0 Authorization Framework
RFC 6750, The OAuth 2.0 Authorization Framework: Bearer Token Usage
RFC 6838, Media Type Specifications and Registration Procedures
RFC 7230, Hypertext Transfer Protocol (HTTP/1.1): Message Syntax and
Routing <>.
RFC 7235, Hypertext Transfer Protocol (HTTP/1.1): Authentication
RFC 7468, Textual Encodings of PKIX, PKCS, and CMS Structures
Unicode Character Database
Unicode Standard Annex #44: Unicode Character Database
Unicode in XML and other Markup Languages
URL Standard <>.
Extensible Markup Language (XML) 1.0 (Fourth Edition)
Extensible Markup Language (XML) 1.0 (Fifth Edition)
Extensible Markup Language (XML) 1.1 (Second Edition)
Character Model for the World Wide Web 1.0: Normalization
RFC 5892 - The Unicode Code Points and Internationalized Domain Names
for Applications (IDNA) <>.
IDNA Parameters
PRECIS Derived Property Value
OpenType specification version 1.4
OpenType specification version 1.5
TTML Text and Image Profiles for Internet Media Subtitles and Captions
ISO/IEC TR 10176:1998, Information technology -- Guidelines for the
preparation of programming language standards, 1998-09-01 (Second
TCVN 6909:2001, 16-bit Coded Vietnamese Character Set.
Network.IDN.blacklist chars - MozillaZine Knowledge Base
JIS X 0221-1:2001.
JIS X 4051-1995, 日本語文書の行組版方法, Line composition rules for
Japanese documents.
JIS X 4052:2000, 日本語文書の組版指定交換形式, Exchange format for
Japanese documents with composition markup.
* License
You are granted a license to use, reproduce, and create derivative
works of the JSON file and this document.
Per CC0 <>, to the
extent possible under law, the author of the JSON file and this
document has waived all copyright and related or neighboring rights to
the JSON file and this document.
The JSON file contains data extracted from Unicode Character Database.
Copyright © 1991-2014 Unicode, Inc. All rights reserved. See
<> or
The JSON file contains data extracted from HTML Standard. "Written by
Ian Hickson (Google, - Parts © Copyright 2004-2014 Apple
Inc., Mozilla Foundation, and Opera Software ASA; You are granted a
license to use, reproduce and create derivative works of this