Skip to content

Commit

Permalink
Fixed trancoders definition so ReSpec links it automagically.
Browse files Browse the repository at this point in the history
Subtle tweak to a note.
Added correct "Example" markup to the example about Persian ZWNJ.
  • Loading branch information
aphillips committed Feb 1, 2016
1 parent f16f449 commit d6d4a84
Showing 1 changed file with 5 additions and 4 deletions.
9 changes: 5 additions & 4 deletions index.html
Expand Up @@ -257,7 +257,7 @@ <h3>Terminology and Notation</h3>
<p>A <dfn data-lt="legacy character encoding|legacy character encodings">legacy
character encoding</dfn> is a character encoding not based on the
Unicode character set.</p>
<p>A <dfn>transcoder</dfn> is a process that converts code units (generally bytes) from a <a>legacy character encoding</a>
<p>A <dfn data-lt="transcoder|transcoders">transcoder</dfn> is a process that converts code units (generally bytes) from a <a>legacy character encoding</a>
to a <a href="http://www.w3.org/TR/2005/REC-charmod-20050215/#Unicode_Encoding_Form">Unicode encoding form</a>.</p>
<p><dfn id="def_syntactic_content">Syntactic content</dfn> is any text in a document format or
protocol that belongs to the structure of the format or protocol. This
Expand Down Expand Up @@ -1057,15 +1057,16 @@ <h3>Unicode Controls and Invisible Markers</h3>
joining and shape selection in Arabic and Indic scripts. For example, ZWJ and ZWNJ are used in some Indic scripts to allow
authors to specify the shape that certain conjuncts take. See the
discussion in Chapter 12 of [[!Unicode]].</p>
<div class="example">
<aside class="example">
<div class="example-title marker"></div>
<p>The <span class="uname" translate="no">Zero Width Non-Joiner</span> is used in Persian to
prevent certain "normal" Arabic script joining. In these cases, the
character affects the meaning. For example, the word تنها ("alone") and the word تن‌ها&nbsp; ("bodies"
or "corpuses") are encoded as "<span class="uname">U+062A
U+0646 U+0647 U+0627</span>" and "<span class="uname">U+062A U+0646
<span style="text-decoration:underline">U+200C</span> U+0647 U+0627</span>"
respectively, the only difference being the ZWNJ in the latter word.</p>
</div>
</aside>
<p>Variation selectors (<span class="uname">U+FE00</span> through
<span class="uname" translate="no">U+FE0F</span>) are
characters used to select an alternate appearance or glyph
Expand Down Expand Up @@ -1302,7 +1303,7 @@ <h4>Requirements for Resources</h4>
is at odds with how transcoders are actually implemented, this version no longer includes
this requirement. Bear in mind that most transcoders produce NFC output and that even those
transcoders that do not produce NFC for all characters mainly produce NFC for the preponderence
of characters. In particular, there are no transcoders that produce decomposed forms where
of characters. In particular, there are no commonly-used transcoders that produce decomposed forms where
precomposed forms exist or which produce a different combining character sequence from the
normalized sequence.</p>

Expand Down

0 comments on commit d6d4a84

Please sign in to comment.