-
-
Notifications
You must be signed in to change notification settings - Fork 29.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add support of the cp1125 encoding #63867
Comments
The proposed patch adds support of the CP1125 encoding. The CP1125 encoding (also known as IBM866, CP866U (in Microsoft), 866nav (in TeX), RUSCII) is standard DOS encoding for Ukrainian language. It is only one DOS encoding (or at least most popular) suitable for Ukrainian. It is Ukrainian government standard for DOS. The CP1125 encoding differs from CP866 encoding only in 6 codes for Ukrainian letters. http://www-03.ibm.com/systems/resources/systems_i_software_globalization_pdf_cp01125z.pdf |
See also issue bpo-19459. |
Nowadays, a good motivation for supporting a new codec is to be able to start Python 3. For example, I added cp65001 because some using try Python 3 with this Windows code page. It looks like at least one user is unable to start Python 3 because he/she uses GEORGIAN-PS as the locale encoding (issue bpo-19459). For cp1125: is it used as the ANSI code page on Windows? Otherwise, how do you use this encoding. Supporting all encodings in the world is meaningless because they are too many encodings. |
No, it is not official Microsoft codepage (it was introduced by IBM in its PC-DOS). AFAIK this encoding yet widely used in banking software. My old text files were written in this encoding, ;) |
On 21.11.2013 09:34, Serhiy Storchaka wrote:
+1 for adding this. It gets enough Google hits to be worth adding. Please also add the aliases you mentioned. Thanks,Marc-Andre Lemburg |
I have already added these aliases. I should remove added in last moment the 'cp866nav' alias from Lib/encodings/aliases.py because it is actually a little different encoding (it have two Byelorussian letters "Ўў" instead cp866 characters "·√" at positions 0xfa-0xfb). |
So what is a decision? |
+1 |
New changeset d0fd68ef1aa9 by Serhiy Storchaka in branch 'default': |
Thanks all. |
New changeset 355d8950f574 by Serhiy Storchaka in branch 'default': |
New changeset 11bc9f669015 by R David Murray in branch 'default': |
Note: these values reflect the state of the issue at the time it was migrated and might not reflect the current state.
Show more details
GitHub fields:
bugs.python.org fields:
The text was updated successfully, but these errors were encountered: