Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

(request) Disable "Autodetect character encoding" by default #3759

rebusx opened this Issue Sep 21, 2017 · 0 comments


None yet
1 participant
Copy link

rebusx commented Sep 21, 2017

Description of the Issue

The "Autodetect encoding" feature clearly works wrong in some cases and corrupts file encoding (940, 2843, 3188, 3222, 3588). The bug causing this may be in "uchardet" library (reported here), but one cannot be sure if it's going to be fixed soon, if ever.
And unfortunately, the bug seems to affect not only special Unicode symbols, but also some combinations of Russian letters\cyrillic characters (see 3588); I use Notepad++ to write in Russian, and I had many files saved in Macintosh encoding instead of UTF-8 (w/o BOM) when the "Autodetect encoding" feature was enabled, which corrupted some characters. There is a chance that this bugged feature may some day completely corrupt the encoding of some large file(s) of some user.
Having said that, and taking into account that this bug is quite old, I suggest to make this feature optional (i.e. not have it enabled by default). Maybe it's worth to make a note in options, that the feature is bugged.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
You can’t perform that action at this time.