Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fread fails to parse quotation mark as first character of field #1256

Closed
RuinedSubmartingale opened this issue Aug 10, 2015 · 2 comments
Closed
Assignees
Milestone

Comments

@RuinedSubmartingale
Copy link

Consider the following files:
file1.csv

x,y
x1,"oops" y1

file2.csv

x,y
x1, "oops" y1

file3.csv

x,y
"oops" x1,y1

file4.csv

x,y
 "oops" x1,y1

Running fread('file2.csv') and fread('file4.csv') run smoothly. But fread('file1.csv') throws the following error:

Field 2 on line 2 starts with quote (") but then has a problem. It can contain balanced unescaped quoted subregions but if it does it can't contain embedded \n as well. Check for unbalanced unescaped quotes: "oops" y1

And fread('file3.csv') throws a similar error:
Error in fread("testfile.csv") :

Field 1 on line 2 starts with quote (") but then has a problem. It can contain balanced unescaped quoted subregions but if it does it can't contain embedded \n as well. Check for unbalanced unescaped quotes: "oops" x1,y1

The only difference between the two pairs of files is that file1 and file3 have fields that begin with a " character, whereas there's a space before the " in the fields for file2 and file4. Recent changes to fread seem to have caused this error in v1.9.5, seeing as how v1.9.4 is able to parse all 4 files smoothly.

@RuinedSubmartingale RuinedSubmartingale changed the title fread fails to parse quote immediately after separator fread fails to parse quotation mark as first character of field Aug 10, 2015
@arunsrinivasan arunsrinivasan self-assigned this Sep 7, 2015
@arunsrinivasan arunsrinivasan added this to the v1.9.6 milestone Sep 7, 2015
@arunsrinivasan
Copy link
Member

Thanks for the report. quote = "" is implemented. Either use that, or create your file with "" around character columns so that quote = "\"" can then read it properly. See ?fread quote argument.

@arunsrinivasan
Copy link
Member

Removed quote argument with a better fix (slightly more robust) for quotes. Please upgrade and test.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants