Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Already on GitHub? Sign in to your account

String#sanitize won't work with encoding other than #<Encoding:ASCII-8BIT> #76

Closed
skalee opened this Issue Dec 14, 2011 · 4 comments

Comments

Projects
None yet
5 participants

skalee commented Dec 14, 2011

To reproduce:

wiki =  Feedzirra::Feed.fetch_and_parse "https://pl.wikipedia.org/w/index.php?title=Specjalna:Ostatnie_zmiany&feed=atom"

wiki.title
=> "Wikipedia  - Ostatnie zmiany [pl]"

wiki.title.sanitize
output error : unknown encoding ASCII-8BIT
=> ""

wiki.title.encode(Encoding::UTF_8).sanitize
=> "Wikipedia  - Ostatnie zmiany [pl]"

Perhaps #sanitize should ensure that is working with UTF-8 (or should be fixed to work with other encodings).

lumpidu commented Dec 27, 2011

+1

also for sanitize_entries!

Since changing to ruby-1.9.3-p0 I get a load of those errors

ippa commented Dec 27, 2011

Same here..ended up removing all sanitize_entries!-lines

jup :(

Contributor

ezkl commented May 21, 2012

Thanks for the report. I've confirmed that this continues to be an issue with the master branch and added it to the list of bugs to tackle for v0.2.0 release.

@ezkl ezkl closed this in 044c313 Jun 4, 2012

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment