The current rules for decimal formats are too restrictive (i.e., too much focused on Anglo-Saxon formatting rules). The most prominent case is the Arabic exponent-separator „character“, which consists of two characters: عر (https://www.localeplanet.com/icu/ar/). The exponent separator of other locales is not restricted to a single character either. For example, se-NO uses ·10^.
When we include the ICU library in the analysis, we also find minus-sign, percent and per-mille properties that are longer than 1 character. Examples:
- The
minus-sign character for he consists of 200e and 002d (200e is the Left-to-Right Mark).
- The Arabic
percent character consists of 066a and 061c (061c is the “Arabic Letter Mark”).
- The
per-mille property of en-US-posix is 0/00.