Permalink
Switch branches/tags
Find file Copy path
Fetching contributors…
Cannot retrieve contributors at this time
207 lines (138 sloc) 6.36 KB
sets.json
~~~~~~~~~
This file contains character (or code point) sets (or classes)
definitions.
* Structure
The file contains a JSON object with following name/value pair:
sets [protocol-object]
The value is a JSON object whose names are character set names and
values are JSON objects with following additional name/value pairs:
chars [string]
The list of the code points in the set, represented in following
format:
The first character is "[" and the last character is "]". Any
character between them are character in the set except for "\"
with some following characters, "-", "^", "[", and "]". If
there is a "-" character between two characters, any character
whose code point is between code points of them are also in
the set. Characters "\u{" followed by a code point followed
by a "}" indicate that the character is in the set.
Characters "\u" followed by four hexadecimal alphabet indicate
that the character whose code point is equal to the
hexadecimal number is in the set.
label [string]
A short English string describing the set.
suikawiki_name [string]
A page name in SuikaWiki.
<https://wiki.suikawiki.org/n/{name}>, where {name} is a
percent-encoded value of this field, is the URL for the page.
* Sources
ECMAScript 5.1.
HTML Standard <https://html.spec.whatwg.org/>.
RFC 822, STANDARD FOR THE FORMAT OF ARPA INTERNET TEXT MESSAGES
<https://tools.ietf.org/html/rfc822>.
RFC 1034.
RFC 1738.
RFC 1945, Hypertext Transfer Protocol -- HTTP/1.0
<https://tools.ietf.org/html/rfc1945>.
RFC 2046, Multipurpose Internet Mail Extensions (MIME) Part Two: Media
Types <https://tools.ietf.org/html/rfc2046>.
RFC 2068, Hypertext Transfer Protocol -- HTTP/1.1
<https://tools.ietf.org/html/rfc2068>.
RFC 2231, MIME Parameter Value and Encoded Word Extensions: Character
Sets, Languages, and Continuations
<https://tools.ietf.org/html/rfc2231>.
RFC 2234, Augmented BNF for Syntax Specifications: ABNF
<https://tools.ietf.org/html/rfc2234>.
RFC 2295, Transparent Content Negotiation in HTTP
<https://tools.ietf.org/html/rfc2295>.
RFC 2396.
RFC 2616, Hypertext Transfer Protocol -- HTTP/1.1
<https://tools.ietf.org/html/rfc2616>.
RFC 2822, Internet Message Format
<https://tools.ietf.org/html/rfc2822>.
RFC 3454.
RFC 3629, UTF-8, a transformation format of ISO 10646
<https://tools.ietf.org/html/rfc3629>.
RFC 3722, RFC 3920, RFC 2986, RFC 3987, RFC 5234.
RFC 4518, Lightweight Directory Access Protocol (LDAP):
Internationalized String Preparation
<https://tools.ietf.org/html/rfc4518>.
RFC 5322, Internet Message Format
<https://tools.ietf.org/html/rfc5322>.
RFC 5335, Internationalized Email Headers
<https://tools.ietf.org/html/rfc5335>.
RFC 5987, Character Set and Language Encoding for Hypertext Transfer
Protocol (HTTP) Header Field Parameters
<https://tools.ietf.org/html/rfc5987>.
RFC 6122.
RFC 6532, Internationalized Email Headers
<https://tools.ietf.org/html/rfc6532>.
RFC 6570, URI Template <http://tools.ietf.org/html/rfc6570>.
RFC 6749, The OAuth 2.0 Authorization Framework
<https://tools.ietf.org/html/rfc6749>.
RFC 6750, The OAuth 2.0 Authorization Framework: Bearer Token Usage
<https://tools.ietf.org/html/rfc6750>.
RFC 6838, Media Type Specifications and Registration Procedures
<https://tools.ietf.org/html/rfc6838>.
RFC 7230, Hypertext Transfer Protocol (HTTP/1.1): Message Syntax and
Routing <https://tools.ietf.org/html/rfc7230>.
RFC 7235, Hypertext Transfer Protocol (HTTP/1.1): Authentication
<https://tools.ietf.org/html/rfc7235>.
RFC 7468, Textual Encodings of PKIX, PKCS, and CMS Structures
<https://tools.ietf.org/html/rfc7468>.
Unicode Character Database
<http://www.unicode.org/ucd/>.
Unicode Standard Annex #44: Unicode Character Database
<http://www.unicode.org/reports/tr44/>.
Unicode in XML and other Markup Languages
<http://www.unicode.org/reports/tr20/>.
URL Standard <https://url.spec.whatwg.org/>.
Extensible Markup Language (XML) 1.0 (Fourth Edition)
<http://www.w3.org/TR/2006/REC-xml-20060816/>.
Extensible Markup Language (XML) 1.0 (Fifth Edition)
<http://www.w3.org/TR/2008/REC-xml-20081126/>.
Extensible Markup Language (XML) 1.1 (Second Edition)
<http://www.w3.org/TR/2006/REC-xml11-20060816/>.
Character Model for the World Wide Web 1.0: Normalization
<http://www.w3.org/TR/charmod-norm/>.
RFC 5892 - The Unicode Code Points and Internationalized Domain Names
for Applications (IDNA) <http://tools.ietf.org/html/rfc5892>.
IDNA Parameters
<https://www.iana.org/assignments/idna-tables/idna-tables.xml>.
PRECIS Derived Property Value
<https://www.iana.org/assignments/precis-tables/precis-tables.xhtml>.
OpenType specification version 1.4
<http://www.microsoft.com/typography/otspec140/default.htm>.
OpenType specification version 1.5
<http://www.microsoft.com/typography/otspec150/default.htm>.
TTML Text and Image Profiles for Internet Media Subtitles and Captions
1.0
<https://dvcs.w3.org/hg/ttml/raw-file/tip/ttml-ww-profiles/ttml-ww-profiles.html>.
ISO/IEC TR 10176:1998, Information technology -- Guidelines for the
preparation of programming language standards, 1998-09-01 (Second
edition).
TCVN 6909:2001, 16-bit Coded Vietnamese Character Set.
Network.IDN.blacklist chars - MozillaZine Knowledge Base
<http://kb.mozillazine.org/Network.IDN.blacklist_chars>.
JIS X 0221-1:2001.
JIS X 4051-1995, 日本語文書の行組版方法, Line composition rules for
Japanese documents.
JIS X 4052:2000, 日本語文書の組版指定交換形式, Exchange format for
Japanese documents with composition markup.
* License
You are granted a license to use, reproduce, and create derivative
works of the JSON file and this document.
Per CC0 <https://creativecommons.org/publicdomain/zero/1.0/>, to the
extent possible under law, the author of the JSON file and this
document has waived all copyright and related or neighboring rights to
the JSON file and this document.
The JSON file contains data extracted from Unicode Character Database.
Copyright © 1991-2014 Unicode, Inc. All rights reserved. See
<http://www.unicode.org/copyright.html#Exhibit1> or
|doc/LICENSE.unicode|.
The JSON file contains data extracted from HTML Standard. "Written by
Ian Hickson (Google, ian@hixie.ch) - Parts © Copyright 2004-2014 Apple
Inc., Mozilla Foundation, and Opera Software ASA; You are granted a
license to use, reproduce and create derivative works of this
document."