-
Notifications
You must be signed in to change notification settings - Fork 399
CLDR-17202 kbd: add Bengali (bn) keyboard #3368
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Where are the key ids from? They seem to be in some sort of transliteration scheme but then there is ch, sh, w etc. |
|
heh that page is tagged with this issue, also from the Talk page:
I would be quite uncomfortable having this scheme, could we use one of the established ones? (ISO15919 if it was up to me, or I could live with ALA-LC too) |
ok. i might use ala-lc as i have worked with that one. |
I'll use the ALA-LC table at https://www.loc.gov/catdir/cpso/romanization/bengali.pdf One of the test failures seems to be a CLDR bug, tracking at https://unicode-org.atlassian.net/browse/CLDR-17204 |
I checked and the only substantial difference between ALA-LC and ISO15919:2001 is sha in ALA-LC and ṣa in ISO-15919. EDIT: other difference is ṁ vs. ṃ for anusvara |
will need to borrow this one from the Assamese table
|
keyboards/3.0/bn.xml
Outdated
|
||
<!-- UNSHIFTED KEYS --> | ||
<!-- E: (top) row --> | ||
<key id="n̐" output="\u{0981}" /> <!-- candrabindu--> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
could just be id="candrabindu" and avoid comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is per romanization table
keyboards/3.0/bn.xml
Outdated
<key id="au" output="\u{09CC}" /> | ||
<key id="pha" output="ফ" /> | ||
|
||
<key id="au-length" output="\u{09D7}" /> <!-- TODO: better name? --> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
just wanted to double-check we want encourage people to be entering E+AU LENGTH MARK rather than VOWEL SIGN AU (the spec says length mark exists for compatibility and does not have a meaning on its own)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I wasn't able to verify this one yet.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You're right ,Not sure what that one was doing here. I'll remove it.
keyboards/3.0/bn.xml
Outdated
<transform from="\u{q}\u{09C8}" to="ঐ" /> | ||
<transform from="\u{q}\u{09CB}" to="ও" /> | ||
<transform from="\u{q}\u{09CC}" to="ঔ" /> | ||
<transform from="\u{q}\u{09D7}" to="আ" /> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just to make it clear, the situation is:
A -> SIGN AA
SHIFT+A -> AU MARK
Q, A -> LETTER A
Q, SHIFT+A -> LETTER AA
(notably Q and long sign produce short letter unlike Q in combination with everything else)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This does match the upstream
OK I checked the assignments. It's a bit difficult to review though, why would you order the keys in bag in layout order rather than [Brahmic] alphabetical order? This probably should have a rupee sign. |
Still todo
|
It was for ease of entry. I can sort them now though.
So as with a number of other comments: this is a port of an existing keyboard file (see link in xml). I'm fine with improving it, just noting that step 1 is a port. Thanks for the comments and thorough review. I'll keep working on it. |
348d80c
to
ff24ba3
Compare
This comment was marked as outdated.
This comment was marked as outdated.
f246d4c
to
ad6ad45
Compare
Hooray! The files in the branch are the same across the force-push. 😃 ~ Your Friendly Jira-GitHub PR Checker Bot |
rebased and should build clean. @miloush or anyone can I get an approval on it? |
|
- add a test
- rename lengthener to au-lengthener - add displays for 3 keys
- fix XML order to match spec - spec also allows <startContext> to be optional in test files
- reorder keys for review
- fix ya/sha confusion
183f086
to
44bb8f0
Compare
Hooray! The files in the branch are the same across the force-push. 😃 ~ Your Friendly Jira-GitHub PR Checker Bot |
- document the reorders
8808917
to
c6dd369
Compare
Notice: the branch changed across the force-push!
~ Your Friendly Jira-GitHub PR Checker Bot |
d86e6d0
to
c6dd369
Compare
i'm tracking a build issue causing kbd-check to not work in keymanapp/keyman#10111 |
Did we say the processing turns everything into NFD? Are the reordering matches done before or after that? Or are the match expressions also normalized? Because 9CB O/9CC AU which are under right side vowels decompose into left+right side vowels, in the latter case the right one being 09D7 AU LENGTH MARK which is not part of the rules. We don't care about characters not entered through the keyboard (like using Alt+Numpad), correct? (especially thinking about Vedic marks) |
I'm working on a document for review on normalization https://unicode-org.atlassian.net/browse/CLDR-17192
We will say that the match acts as if it's in NFD.
Matches will be in NFD
and the expression will be normalized also.
I think I, earlier, removed 09D7 due to normalization. But I should re-add it for this reason.
The reorder rules (being script specific) are eventually supposed to be imported data separate from the keyboard proper. But besides that, someone might enter something with Alt-numpad and then click and type. Just like I could type |
- re-add U+09D7 with a note from upstream reorder
@miloush PTAL, U+09D7 is now included in the reorders |
@miloush OK to merge? |
@miloush thanks! |
CLDR-17202
ALLOW_MANY_COMMITS=true