Encoding error when parsing UTF-8 CSV in (1.7.3 - 1.9 mode) #563

Closed
plexus opened this Issue Mar 6, 2013 · 3 comments

Projects

None yet

2 participants

@plexus
plexus commented Mar 6, 2013

See this test case for much more details.

https://gist.github.com/arnebrasseur/6a2b1c11d90cc321d334

Parsing the included CSV raises an encoding error, but parsing any slice of a handful of lines works fine. The test passes on MRI and in 1.8 mode.

@lucasallan
Member

@plexus the link you provided is broken.

@plexus
plexus commented Jun 20, 2013

Sorry for that, username change. This link should work https://gist.github.com/plexus/6a2b1c11d90cc321d334

@lucasallan
Member

Cool, thanks. I will take a look.

@headius headius added a commit that closed this issue Jun 22, 2013
@headius headius Add incomplete-character smarts to StringIO#gets + other tweaks.
* Add incomplete character logic to StringIO#gets.
* Move incomplete character logic to common place for
  StringIO#gets and GZipReader#gets.
* Untag passing StringIO#gets test.
* Fixes #563.
f0b9edd
@headius headius closed this in f0b9edd Jun 22, 2013
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment