In non-English books, and the title in the toc garbled #42

chappyhome · 2013-09-06T02:54:43Z

Screenshot below:

chappyhome · 2013-09-06T04:51:35Z

I tested again after only online unpack epub a situation, as already unzipped does not happen.

fchasen · 2013-09-09T05:19:56Z

Yikes - what language is this occurring with?

DokaMax · 2013-10-11T12:42:45Z

Also have this problem on russian books.
So far found reason here:

Used encoding in:

unarchiver.js

EPUBJS.Unarchiver.prototype.getText = function(url){

entry.getText(function(text){
deferred.resolve(text);
}, null, null, 'ISO-8859-1'

after change to UTF-8 - works fine for me.

BTW - this connected to page title also.

X-Ryl669 · 2014-01-07T22:24:50Z

I second that. The page is claiming UTF-8 encoding, yet it's giving ISO-8859-1 text. The fix is easy

fchasen · 2014-01-09T00:34:23Z

Great - I've added parsing the encoding from the opf file and defaulted to utf-8 in v0.1.8

If you happen to have a book that wasn't working before and is public domain, please submit a pull request to https://github.com/futurepress/books so I test with it.

Thanks

fchasen closed this as completed Jan 9, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

In non-English books, and the title in the toc garbled #42

In non-English books, and the title in the toc garbled #42

chappyhome commented Sep 6, 2013

chappyhome commented Sep 6, 2013

fchasen commented Sep 9, 2013

DokaMax commented Oct 11, 2013

X-Ryl669 commented Jan 7, 2014

fchasen commented Jan 9, 2014

In non-English books, and the title in the toc garbled #42

In non-English books, and the title in the toc garbled #42

Comments

chappyhome commented Sep 6, 2013

chappyhome commented Sep 6, 2013

fchasen commented Sep 9, 2013

DokaMax commented Oct 11, 2013

X-Ryl669 commented Jan 7, 2014

fchasen commented Jan 9, 2014