Enabling conversion options breaks encoding (Linux, 0.1.9, x64) #37

miniworm · 2015-11-26T17:45:13Z

Mostly i download subtitles in win1250. But sometimes files are in UTF-8 so i want to be sure that i've final subtitles in win1250. So i enable after enabling conversion options and selecting converting to win1250. But every subtitle after such conversion is gibberish. enca (Linux tool to check file charset) says final file is in UTF-8 but it isn't. It's some strange encoding.

krzemin · 2015-11-26T18:23:41Z

Please check version 0.2.0. There were improvements applied in file encoding detection.

miniworm · 2015-11-26T19:27:45Z

I've problem checking it - see #41

miniworm · 2015-11-26T22:32:47Z

Still a lot of problems with conversion options on 0.2.0 and master from git. The only one which works all the time is "Nie dodawaj informacji o QNAPI". Charset conversion works only if i select manually input and output charset. Auto-detection doesn't work. The same with file type conversion.
When i say that it doesn't work i mean no final file on disk. Just like in #41 - i'm asked about choosing subtitle file but file disappears in conversion.

Nucleoprotein · 2015-12-14T17:16:05Z

Reproduced encoding problem on Windows too.
Downloading SRT subtitle, original file is UTF-8 BOM, converted file is UTF-8 without BOM. Encoding of some characters is broken, also file have a garbage (bytes C4 8F C2 BB C5 BC) at start so it does not load in MPC-HC. Files here: http://www41.zippyshare.com/v/u68GiQjt/file.html

EDIT: I'm used charset auto-detection.

krzemin · 2015-12-14T18:06:47Z

@Tapcio Thank you for the files. Auto-detection of file encoding was broken for non-polish subtitles. Bugfix will be available in 0.2.1 version.

miniworm changed the title ~~Enabling convertion options breaks encoding (Linux, 0.1.9, x64)~~ Enabling conversion options breaks encoding (Linux, 0.1.9, x64) Nov 26, 2015

miniworm mentioned this issue Nov 27, 2015

Subtitle downloading problem (Linux, 0.2.0, x64) #41

Closed

krzemin added the bug label Dec 14, 2015

krzemin added this to the 0.2.1 milestone Dec 14, 2015

krzemin self-assigned this Dec 14, 2015

krzemin added a commit that referenced this issue Dec 14, 2015

fixed utf-8 encoding detection for non-polish files #37

a2697c5

krzemin closed this as completed Dec 14, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enabling conversion options breaks encoding (Linux, 0.1.9, x64) #37

Enabling conversion options breaks encoding (Linux, 0.1.9, x64) #37

miniworm commented Nov 26, 2015

krzemin commented Nov 26, 2015

miniworm commented Nov 26, 2015

miniworm commented Nov 26, 2015

Nucleoprotein commented Dec 14, 2015

krzemin commented Dec 14, 2015

Enabling conversion options breaks encoding (Linux, 0.1.9, x64) #37

Enabling conversion options breaks encoding (Linux, 0.1.9, x64) #37

Comments

miniworm commented Nov 26, 2015

krzemin commented Nov 26, 2015

miniworm commented Nov 26, 2015

miniworm commented Nov 26, 2015

Nucleoprotein commented Dec 14, 2015

krzemin commented Dec 14, 2015