diff --git a/index.html b/index.html index 09c1283..59aa236 100644 --- a/index.html +++ b/index.html @@ -730,15 +730,8 @@

Unicode Normalization

Unicode Normalization Form C.

Canonical vs. Compatibility Equivalence

-

Unicode defines two types of equivalence between characters: canonical - equivalence and compatibility equivalence.

-

Canonical equivalence is a fundamental equivalency - between Unicode characters or sequences of Unicode characters that - represent the same abstract character. When correctly displayed, - these should always have the same visual appearance and behavior. - Generally speaking, two canonically equivalent Unicode texts should - be considered to be identical as text. Unicode defines a process called - canonical decomposition that removes these primary distinctions between two texts.

+

Unicode defines two types of equivalence between characters: canonical equivalence and compatibility equivalence.

+

Canonical equivalence is a fundamental equivalency between Unicode characters or sequences of Unicode characters that represent the same abstract character. Equivaent sequences, ideally should have the same visual appearance (although there are many factors that can cause them to appear somewhat differently) and they should be treated and processed equivalently. Generally speaking, two canonically equivalent Unicode texts should be considered to be identical as text. Unicode defines a process called canonical decomposition that removes these primary distinctions between two differently-encoded but canonically equivalent texts.

Examples of canonical equivalence defined by Unicode include: