Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Confusion between KOI8-U and KOI8-RU encodings #74

Closed
bhaible opened this issue Oct 3, 2016 · 4 comments
Closed

Confusion between KOI8-U and KOI8-RU encodings #74

bhaible opened this issue Oct 3, 2016 · 4 comments

Comments

@bhaible
Copy link

bhaible commented Oct 3, 2016

The current draft maps the labels "koi8-u" and "koi8-ru" to a single encoding, and the mapping table that it uses (index-koi8-u.txt) does not match either of the widely used mapping tables for KOI8-U and KOI8-RU.

In detail:

Glibc and other software consider KOI8-U and KOI8-RU to be different.
KOI8-U was defined through an RFC, see https://en.wikipedia.org/wiki/KOI8-U.
KOI8-RU was defined through a draft RFC that was never finalized, see https://www.terena.org/activities/multiling/koi8-ru/

The mapping in index-koi8-u.txt differs from the common KOI8-U mapping at the positions 0xAE, 0xBE.

The mapping in index-koi8-u.txt differs from the common KOI8-RU mapping at the positions 0x93, 0x96..0x99, 0x9B..0x9D, 0x9F.

For details about the mapping tables, see
http://haible.de/bruno/charsets/conversion-tables/index.html
http://haible.de/bruno/charsets/conversion-tables/KOI8-U.html
http://haible.de/bruno/charsets/conversion-tables/KOI8-RU.html

Do you have data about who actually uses KOI8-U and KOI8-RU and how?
Is it necessary to deal with these two encodings at all?

@annevk
Copy link
Member

annevk commented Oct 4, 2016

See the discussion in https://www.w3.org/Bugs/Public/show_bug.cgi?id=17053.

@bhaible
Copy link
Author

bhaible commented Oct 4, 2016

See the discussion in https://www.w3.org/Bugs/Public/show_bug.cgi?id=17053.

The main argument I can see this in this discussion is "IE takes the labels koi8-u and koi8-ru to mean KOI8-RU and not KOI8-U."

It amounts to the question: Which of Internet Explorer and Mozilla is more used with KOI8-U and KOI8-RU?

@annevk
Copy link
Member

annevk commented Oct 4, 2016

Historically, most likely IE given the market dominance it had.

@annevk
Copy link
Member

annevk commented Nov 16, 2016

Given that we already discussed this and nobody cared enough to gather data back then, closing this. I don't think we'll end up wiser this time around.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

No branches or pull requests

2 participants