New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Windows newlines part of last column #87
Comments
I'm having trouble following the code flow but it you point me to the general file/files to manipulate I'd be happy to see if I can patch this myself. |
@tonyfischetti unfortunately I don't think is a simple fix - the way the tokenizer is currently written would require some new states (STATE_NL, STATE_CR, I think) to do it correctly. (I think this is why people often write a lexer, then a tokenizer). |
Yeah, I was looking into it just now and that's what it seemed like :( |
Wow |
+1 this is an issue for us since quite a bit of the data we get come in Excel spreadsheets... |
Just came across this problem as well. |
.. and the final column name has '"' postpended |
Hi,
When I use read_csv on a file with dos/windows line endings a carriage return "\r" becomes a part of the last column.
To replicate: I made a csv that looks like this
first,second,
1,2
3,4
then I used
unix2dox
to convert to dos line endings.The text was updated successfully, but these errors were encountered: