Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Western Arabic Numerals (ASCII) to A.3 Numeral characters #151

Open
behnam opened this issue Oct 31, 2017 · 5 comments
Open

Add Western Arabic Numerals (ASCII) to A.3 Numeral characters #151

behnam opened this issue Oct 31, 2017 · 5 comments

Comments

@behnam
Copy link
Member

behnam commented Oct 31, 2017

Western Arabic Numerals, encoded as ASCII digits, are the main numerals used in all West Arab regions. Table in Section A.3 Numeral characters is missing them and needs to be fixed.

@behnam behnam added the cldr label Oct 31, 2017
@behnam
Copy link
Member Author

behnam commented Oct 31, 2017

Root cause must be CLDR only recognizing Eastern Arabic Numerals (Arabic-Indic digits) as the number system for the ar locale.

@r12a
Copy link
Contributor

r12a commented Oct 31, 2017

I know we have been trying to follow CLDR (or other standards) so far for the list of characters, but i personally see no problem with diverging if we discover that those don't represent reality. (I actually see it as our job to find places where CLDR needs to be corrected.) So i have no problem with adding Western numerals to the table.

@behnam
Copy link
Member Author

behnam commented Oct 31, 2017

/cc @brawer

Right. What we've been doing so far has been to file tickets with CLDR for the fixes, and either wait on those to be resolved, or hard-code what we need.

In this case, it's a tricky matter. I have talked to some people about it, and there isn't (and may never be) a locale in CLDR for Western Arabic and Eastern Arabic, to assign these properties to.

Maybe, we should look at Numbering System property for the main locale (ar) and all its sublocales. In this case, this would give use both arab and latn numerals: https://unicode.org/cldr/charts/latest/summary/ar.html

@brawer
Copy link

brawer commented Oct 31, 2017

Not sure if this helps, but you can put registered number system identifiers into BCP47 extension U. For example, ar-u-nu-arabext is a valid BCP47 language code. See Unicode TR35 for background.

@shervinafshar
Copy link
Contributor

https://unicode.org/cldr/charts/latest/summary/ar.html#5
Main: arab
Sublocale: latn
CLDR data is correct. The way we're consuming it is not.

@behnam behnam added this to the First_Public_Draft milestone Nov 1, 2017
@shervinafshar shervinafshar removed the cldr label Nov 1, 2017
@r12a r12a added the drafting label Mar 8, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

4 participants