From d6d4a84e3a617a8d698ddaea0a8689d49cea4b2d Mon Sep 17 00:00:00 2001 From: "@aphillips" Date: Mon, 1 Feb 2016 15:13:07 -0800 Subject: [PATCH] Fixed trancoders definition so ReSpec links it automagically. Subtle tweak to a note. Added correct "Example" markup to the example about Persian ZWNJ. --- index.html | 9 +++++---- 1 file changed, 5 insertions(+), 4 deletions(-) diff --git a/index.html b/index.html index d8cdd6f..9286efd 100644 --- a/index.html +++ b/index.html @@ -257,7 +257,7 @@

Terminology and Notation

A legacy character encoding is a character encoding not based on the Unicode character set.

-

A transcoder is a process that converts code units (generally bytes) from a legacy character encoding +

A transcoder is a process that converts code units (generally bytes) from a legacy character encoding to a Unicode encoding form.

Syntactic content is any text in a document format or protocol that belongs to the structure of the format or protocol. This @@ -1057,7 +1057,8 @@

Unicode Controls and Invisible Markers

joining and shape selection in Arabic and Indic scripts. For example, ZWJ and ZWNJ are used in some Indic scripts to allow authors to specify the shape that certain conjuncts take. See the discussion in Chapter 12 of [[!Unicode]].

-
+
+

Variation selectors (U+FE00 through U+FE0F) are characters used to select an alternate appearance or glyph @@ -1302,7 +1303,7 @@

Requirements for Resources

is at odds with how transcoders are actually implemented, this version no longer includes this requirement. Bear in mind that most transcoders produce NFC output and that even those transcoders that do not produce NFC for all characters mainly produce NFC for the preponderence - of characters. In particular, there are no transcoders that produce decomposed forms where + of characters. In particular, there are no commonly-used transcoders that produce decomposed forms where precomposed forms exist or which produce a different combining character sequence from the normalized sequence.