-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
--enable-charset by default #1611
Comments
Sorry, I wasn't clear. They do assume UTF-8 when communicating with the terminal, but they assume ISO-8859-whatever encoding for the content of the file. So it appears in "double utf8" encoding, every accented letter replaced by two symbols.
E.g. The file contains (in UTF-8 encoding): áéõûőű
Is it supposed to work correctly? I'd be more than happy to hear that it's already implemented, it's just something unusual in my environment. Probably I forgot an option to ./configure or to change something in setting? Any idea? I'm eager to figure it out.
Thanks! |
Hah, --enable-charset lets you choose the charset of the file, including "No translation" and UTF-8. Then it works fine.
Without charset support, the default is "no translation" for filenames, but "latin1" for file content. This doesn't sound logical to me. I think the behavior should be "no translation" for file contents too.
Or, alternatively, charset support should be turned on by default.
Nowadays more and more distributions use UTF-8 by default and it's the recommended encoding for everything: filenames, file content etc. Imagine thousands of users downloading and installing mc-4.7 just as I did and figuring out that file contents are not displayed correctly. Imagine tons of stupid bug reports just as this one :) You don't want that, users don't want that either. The default behavior (simplest way of compiling and running mc) should provide proper support for fully UTF-8 systems.
I've got a some other similar philosophical corners, I'll file separate report for them.
Overall, however, mc's forthcoming official UTF-8 support looks super great, HUGE THANKS to everyone involved!!! |
Yes, egmont, you are right. |
branch: 1611_autoconf_enable_charset (parent: master) |
|
|
fixed: [4ed46db6a76905e22c7157ea43a19152636514c2] |
|
Important
This issue was migrated from Trac:
egmont
(@egmontkob)viewer/editor not doing utf-8
mc-4.7.0-pre2 with UTF-8 everywhere. Locale is set to UTF-8, mc's Display bits is UTF-8 too. The main screen is fine.
mc's builtin viewer and editor, though, still use some 8-bit character set, so accents don't appear correctly.
This is a serious regression from 4.6.x+utf8 patches where the viewer and editor had reasonably good UTF-8 support.
The text was updated successfully, but these errors were encountered: