Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fread does not always work when csv file contains newlines in quoted fields #810

Closed
vlsi opened this issue Sep 15, 2014 · 2 comments
Closed
Assignees
Labels
Milestone

Comments

@vlsi
Copy link

vlsi commented Sep 15, 2014

Here's input file (sorry, I had to scramble the data, however the error message is pretty much the same): https://gist.github.com/vlsi/34f56202bc8aaf665581

In general it looks like autostart does not work.
Altering sep and/or autostart parameters does not help.

Can you please provide a way to tell fread "I promise this file is comma delimited and doublequote-quoted"?

> fread("data_table_quoted_multiline_fields.csv", autostart=1)
Error in fread("data_table_quoted_multiline_fields.csv", autostart = 1) : 
  Expected sep (',') but new line, EOF (or other non printing character)
ends field 1 on line 3 when detecting types:    ETGIOHCFFB MCTAK/EVPN LFRHC
@mattdowle mattdowle added this to the v1.9.6 milestone Oct 25, 2014
@mattdowle
Copy link
Member

Thanks for the test file on this one. Was really useful and is now included in the test suite.

@vlsi
Copy link
Author

vlsi commented Nov 15, 2014

I've just run into another case of the similar issue and the update of data.table solves it. Thanks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants