diff --git a/index.html b/index.html index 8bbc8ed..ba24b79 100644 --- a/index.html +++ b/index.html @@ -94,10 +94,10 @@ "W3-ARAB-MATH": { "authors": [ - "Azzeddine Lazrek", - "Mustapha Eddahibi", - "Khalid Sami", - "Bruce R. Miller" + "Azzeddine Lazrek", + "Mustapha Eddahibi", + "Khalid Sami", + "Bruce R. Miller" ], "href": "https://www.w3.org/TR/arabic-math/", "publisher": "World Wide Web Consortium", @@ -1310,8 +1310,10 @@

Punctuation @@ -1332,10 +1334,15 @@

Text segmentation script languages.

-
Should all ligatures be selectable as a single unit, or as individual parts - corresponding to the underlying characters?
+
+ Should all ligatures be selectable as a single unit, or as individual parts corresponding + to the underlying characters? +
+ -
Expand on the exceptions.
+
+ Expand on the exceptions. +
@@ -1362,8 +1369,10 @@

Positioning diacritics relative to base charactersSome applications allow adjustment of the distance between the diacritics and the base character. Is this a requirement for most text systems? +
  • What about adjustment to the horizontal position of the diacritic?
  • +
  • Should it be possible to influence whether a font places the kasra below the base character or immediately below the shadda, when combined with the latter?
  • @@ -1400,6 +1409,7 @@

    Letter-spacing @@ -1515,11 +1525,18 @@

    Styling individual letters

    Handling oblique and italicised text in Arabic

    +

    Describe the problem here.

    -
    Which way should oblique/italic text slant in Arabic?
    -
    Misuse of generic font styles.
    +
    + Which way should oblique/italic text slant in Arabic? +
    + + +
    + Misuse of generic font styles. +
    @@ -1534,7 +1551,7 @@

    Considerations for mixed-script text
    - What are the font-size aspects that must be considered in mixed text scenarios? + What are the font-size aspects that must be considered in mixed text scenarios?
    @@ -1543,227 +1560,439 @@

    Considerations for mixed-script text

    Numbers

    +
    -

    Preferred Terminology

    -

    Before entering this section in we need to introduce few preferred terminological - conventions for disambiguation and simplicity.

    - -
    -
    -

    Families of Numerals

    -

    There are three families of numerals used with languages using the Arabic Script. - The first is known as European Numerals; the second family is Arabic-Indic Numerals. - The third is Eastern Arabic-Indic Numerals. The following table, based on a similar table in - [[W3-ARAB-MATH]] exhibits these three families:

    - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
    FamilyUnicode RangeNumeral DigitsRegions in Use
    EuropeanU+0030..U+00390123456789Western Arabic-speaking countries; e.g. Algeria or Morocco.
    Arabic-IndicU+0660..U+0669٠١٢٣٤٥٦٧٨٩Arabic-speaking countries of the Middle-East, eastern Arabic-speaking countries; e.g. Egypt.
    Eastern Arabic-IndicU+06F0..U+06F9۰۱۲۳۴۵۶۷۸۹Iran and Afghanistan.
    - -

    In this table, those three numerals which have different shapes - between Arabic-Indic and Eastern Arabic-Indic are highlighted.

    - -

    Arabic numbers are written with the lowest significant digits to - the right and the highest digits to the left. That arrangement is - identical to the Western one, even though Arabic script is written - from right to left. Numbers with many digits use - decimal thousands separators.

    -

    European digits are used with , [U+002C COMMA] and . [U+002E FULL STOP] - as decimal and thousands separator, respectively, or vice versa.

    - -

    Arabic-Indic numerals use two specific separators:

    - -

    An important fact to note here is the bidirectional category of - these numbers.

    - -

    The difference in Bidi category between Arabic-Indic digits and - Eastern Arabic-Indic digits is due to the difference in Bidi - behavior desired in Arabic vs. Persian.

    -

    What is the origin of this decision. More important, what is the observed effect of these differences in normal Arabic script text?

    -

    As a consequence, a sentence like « Five is written ۵ in Iran and ٥ - in Egypt », say, will give (in RTL context)

    -

    « ‫Five is written ۵ in Iran and ٥ in Egypt‬ »

    -

    requiring the use of markup to correct it. Which is not the case - with the sentence « Five is written ۵ in Iran and 5 in Morocco ».

    -
    -
    -

    Arabic number in other - uses

    -

    Numbers do not always appear alone, and may appear alongside other - characters like financial symbols, fraction signs, decimal and/or - thousands signs (excluding math expressions here). Also there are Arabic-specific signs such as ؉ [U+0609 ARABIC-INDIC PER MILLE SIGN] and ٪ [U+066A ARABIC PERCENT SIGN], as well as the Arabic decimal " ٫ " (U+66B) - and thousands  " ٬ " (U+066C) separators, mentioned earlier. These are - mostly used with Arabic-Indic digits.

    -

    Numerals can also come separated by or mixed with space or other - signs. Example are phone numbers - +12 34 56 78 89, cars licence plate like - 123 د‎ 4,  quantities 37.5°,   ٥٠ كلم (50km) - etc.

    -

    Particular attention is needed here. Firstly, numbers have a weak - directionality with regards to the Bidi algorithm. For example, - alongside a number, certain otherwise neutral characters, such as - negative/positive sign, currency or degree symbols, are likely to be - treated as part of the number rather than a neutral.

    -

    Secondly, the placement of the accompanying signs and symbols may - depend on the region: generally Middle East (or anglophone) vs. - Western (or francophone) regions. This is not to mention punctuation - signs.

    - -

    Other issues

    -

    How to know that a sign (space, comma ...) is a separator or a - sign within a number? +12 34 56 78 90 - is a phone number or a sequence of digits? Which may be inverted - in RTL. A tip is to use a syntax like 12.34.56.78.90 - or 12-34-56-78-90 for phones.

    -

    Maybe mention the Decimal Separator Key Symbol " ⎖ " (U+2396), - used with keyboards (resembles an apostrophe)

    - -
    -

    Old text bellow

    + +
    +

    Families of Numerals +

    + + +

    There are three families of numerals used with languages using the Arabic Script. The + first is known as European Numerals; the second family is Arabic-Indic Numerals. The third + is Eastern Arabic-Indic Numerals. The following table, based on a similar table in + [[W3-ARAB-MATH]] exhibits these three families:

    + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
    FamilyUnicode RangeNumeral DigitsRegions in Use
    EuropeanU+0030..U+00390123456789Western Arabic-speaking countries; e.g. Algeria or Morocco.
    Arabic-IndicU+0660..U+0669٠١٢٣٤٥٦٧٨٩Arabic-speaking countries of the Middle-East, eastern Arabic-speaking countries; + e.g. Egypt.
    Eastern Arabic-IndicU+06F0..U+06F9۰۱۲۳۴۵۶۷۸۹Iran and Afghanistan.
    + + +

    In this table, those three numerals which have different shapes between Arabic-Indic and + Eastern Arabic-Indic are highlighted.

    + + +

    Arabic numbers are written with the lowest significant digits to the right and the + highest digits to the left. That arrangement is identical to the Western one, even though + Arabic script is written from right to left. Numbers with many digits use decimal thousands + separators.

    + + +

    European digits are used with , [U+002C COMMA] and . [U+002E FULL STOP] + as decimal and thousands separator, respectively, or vice versa.

    + + + + + +

    Arabic-Indic numerals use two specific separators:

    + + + + + +

    An important fact to note here is the bidirectional category of these numbers.

    + + + + + +

    The difference in Bidi category between Arabic-Indic digits and Eastern Arabic-Indic + digits is due to the difference in Bidi behavior desired in Arabic vs. Persian.

    + + +

    What is the origin of this decision. More important, what is the observed + effect of these differences in normal Arabic script text?

    + + +

    As a consequence, a sentence like « Five is written ۵ in Iran and ٥ in Egypt », say, + will give (in RTL context)

    + + +

    « ‫Five is written ۵ in Iran and ٥ in Egypt‬ »

    + + +

    requiring the use of markup to correct it. Which is not the case with the sentence « + Five is written ۵ in Iran and 5 in Morocco ».

    +
    + + +
    +

    Arabic number in other uses +

    + + +

    Numbers do not always appear alone, and may appear alongside other characters like + financial symbols, fraction signs, decimal and/or thousands signs (excluding math + expressions here). Also there are Arabic-specific signs such as ؉ [U+0609 ARABIC-INDIC PER + MILLE SIGN] and ٪ + [U+066A ARABIC PERCENT SIGN], as well as the Arabic + decimal " ٫ " (U+66B) and thousands  " ٬ " (U+066C) separators, mentioned earlier. + These are mostly used with Arabic-Indic digits.

    + + +

    Numerals can also come separated by or mixed with space or other signs. Example are + phone numbers +12 34 56 78 89, cars licence plate like + 123 د‎ 4,  quantities 37.5°,   ٥٠ كلم (50km) etc.

    + + +

    Particular attention is needed here. Firstly, numbers have a weak directionality with + regards to the Bidi algorithm. For example, alongside a number, certain otherwise neutral + characters, such as negative/positive sign, currency or degree symbols, are likely to be + treated as part of the number rather than a neutral.

    + + +

    Secondly, the placement of the accompanying signs and symbols may depend on the region: + generally Middle East (or anglophone) vs. Western (or francophone) regions. This is not to + mention punctuation signs.

    + + + + + +

    Other issues

    + + +

    How to know that a sign (space, comma ...) is a separator or a sign within + a number? +12 34 56 78 90 is a phone number or a sequence + of digits? Which may be inverted in RTL. A tip is to use a syntax like + 12.34.56.78.90 or 12-34-56-78-90 for phones.

    + + +

    Maybe mention the Decimal Separator Key Symbol " ⎖ " (U+2396), used with + keyboards (resembles an apostrophe)

    +
    + + +

    Old text bellow +

    + +

    Arabic script uses non-European digits for numbers in certain locales and situations.

    Arabic digits are also used for counters (see ).

    -
    Describe the arabic-indic digits, and when they are used, including the distinction - between arabic-indic and eastern-arabic-indic digits.
    +
    + Describe the arabic-indic digits, and when they are used, including the distinction between + arabic-indic and eastern-arabic-indic digits. +
    -
    Provide resources and guidelines on how to choose the right set of numerals based on - the language.
    +
    + Provide resources and guidelines on how to choose the right set of numerals based on the + language. +
    @@ -1791,11 +2020,11 @@

    Line breaking
    -
    @@ -1856,11 +2088,11 @@

    Hyphenation
    -
    @@ -1874,24 +2106,22 @@

    Justification
    - -
    - + +
    -
    -
    -
    -

    There are a number of different ways to produce justified text in Arabic. In some cases - several of these methods may be combined. In other cases, certain methods are - disallowed.

    +

    There are a number of different ways to produce justified text in Arabic. In some cases + several of these methods may be combined. In other cases, certain methods are disallowed.

    -

    Typical methods include:

    +

    Typical methods include:

    -

    Of the four basic justification methods (flush left, flush right, justified, and @@ -2300,7 +2525,6 @@

    Paragraph and line alignment
    -