Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Issues with s-cedilla in several Turkic and related languages #73

Open
MrBrezina opened this issue Feb 17, 2022 · 2 comments
Open

Issues with s-cedilla in several Turkic and related languages #73

MrBrezina opened this issue Feb 17, 2022 · 2 comments
Assignees
Labels
data Issues in the language data

Comments

@MrBrezina
Copy link
Member

Instead of S-cedilla (U+15E) and s-cedilla (U+015F), S-comma(U+0218) and s-comma(U+0219) are listed as required letters. It has to be S-cedilla and s-cedilla.

Expanding on the issue mentioned in #71 and #72

@kontur
Copy link
Contributor

kontur commented Feb 17, 2022

👍 Both other sources (Omniglot and Wikipedia) we reference list the cedilla versions, too.

@kontur kontur self-assigned this Feb 17, 2022
@alerque
Copy link
Contributor

alerque commented Feb 17, 2022

I originally thought this was correct, but am not so sure now and backed out my related changes in #72 for now.

The remaining Turkic languages that list both S-cedilla and S-commaaccent variants probably stem from the fact that they are Romanizations and there is more than one scheme. Most of these are Latin variants of a default Cyrilic based alphabet. Some schemes for Cyrilic → Latin do call for commaaccent glyphs even if the most prevalant use case for these will use the modern Turkish alphabed with its cedilla based glyph.

If the goal is to be canocical, research will be needed into each case. If the goal is to cover all possible Romanization schemes then having both might be correct.

Note this is also true for other letters in affected languages such as T-cedilla and C-cedilla.

@kontur kontur added the data Issues in the language data label Jun 8, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
data Issues in the language data
Projects
None yet
Development

No branches or pull requests

3 participants