From 81b5bd6324b4b2aa56bdd05253051d0c0ac30a19 Mon Sep 17 00:00:00 2001 From: "@aphillips" Date: Tue, 24 Oct 2017 14:35:03 -0700 Subject: [PATCH] Issue 130; also fixed http->https for ReSpec. --- index.html | 17 ++++++++--------- 1 file changed, 8 insertions(+), 9 deletions(-) diff --git a/index.html b/index.html index 216f39a..a51ac0d 100644 --- a/index.html +++ b/index.html @@ -19,7 +19,7 @@ noRecTrack: true, shortName: "charmod-norm", copyrightStart: "2004", - edDraftURI: "http://w3c.github.io/charmod-norm/", + edDraftURI: "https://w3c.github.io/charmod-norm/", // lcEnd: "2009-08-05", @@ -39,7 +39,7 @@ // name of the WG wg: "Internationalization Working Group", - wgURI: "http://www.w3.org/International/core/", + wgURI: "https://www.w3.org/International/core/", wgPublicList: "www-international", bugTracker: { new: "https://github.com/w3c/charmod-norm/issues", open: "https://github.com/w3c/charmod-norm/issues" } , @@ -1163,25 +1163,24 @@

Invisible Unicode Characters

text. Because many of these characters are invisible or do not have keyboard equivalents, users are not always aware of their presence or absence. As a result, these characters can interfere with string matching when they are part of the encoded character sequence but the expected matching text does not also include them. Some examples of these characters include:

-

The Unicode control characters U+200D Zero Width Joiner (also known + +

The Unicode control characters U+200D Zero Width Joiner (also known as ZWJ) and U+200C Zero Width Non-Joiner (also known as ZWNJ). While these characters can be used to control ligature formation—either preventing the formation of undesirable ligatures or encouraging the formation of desirable ones—their primary use is to control the joining and shape selection in complex scripts such as the Arabic or various of the Indic scripts. - For example, ZWJ and ZWNJ are used in some Indic scripts to allow - authors to control the shape that certain conjuncts take. See the + Some Indic scripts use the ZWJ and ZWNJ characters to allow authors to control the shape that certain conjuncts take. See the discussion in Chapter 12 of [[!Unicode]].

- +

The ZWJ character is also used in forming certain emoji sequences, which is discussed in more detail below.