Join GitHub today
GitHub is home to over 20 million developers working together to host and review code, manage projects, and build software together.
skipfooter doesn't really "skip" in read_csv #13879
Comments
jreback
added API Design IO CSV
labels
Aug 2, 2016
jreback
added this to the
Next Major Release
milestone
Aug 2, 2016
gfyoung
added a commit
to gfyoung/pandas
that referenced
this issue
Nov 26, 2016
|
|
gfyoung |
8ef5aa5
|
gfyoung
referenced
this issue
Nov 26, 2016
Merged
BUG: Improve error message for skipfooter malformed rows in Python engine #14749
|
If this feature would be implemented in the C engine, I would expect it to work in this case, so that the skipped lines need not to parse correctly. But I am not sure if this is actually possible? Questions on how to treat quotations marks (are they parsed or not to determine the number of lines to skip .. ?) similar as those recent issues about skiprows will also come up. So for this to be consistent, they maybe need to get parsed to some extent? |
|
@jorisvandenbossche : You are correct. This code should not break, though whether it's possible is another story, as some parsing might be needed. But in any case, not sure yet how to implement for the C engine, though that can be dealt with separately from this issue. |
gfyoung
added a commit
to gfyoung/pandas
that referenced
this issue
Nov 28, 2016
|
|
gfyoung |
8bcfb77
|
gfyoung
added a commit
to gfyoung/pandas
that referenced
this issue
Nov 28, 2016
|
|
gfyoung |
9b1d065
|
gfyoung
added a commit
to gfyoung/pandas
that referenced
this issue
Nov 29, 2016
|
|
gfyoung |
8aae4fe
|
jorisvandenbossche
closed this
in #14749
Nov 29, 2016
jorisvandenbossche
added a commit
that referenced
this issue
Nov 29, 2016
|
|
gfyoung + jorisvandenbossche |
dfeae39
|
jorisvandenbossche
added Error Reporting and removed API Design
labels
Nov 29, 2016
jorisvandenbossche
modified the milestone: 0.19.2, Next Major Release
Nov 29, 2016
jorisvandenbossche
added a commit
that referenced
this issue
Dec 15, 2016
|
|
gfyoung + jorisvandenbossche |
8fda0c9
|
gfyoung commentedAug 2, 2016
•
edited
On
master:If we were truly "skipping" the last row, no error should have been raised. However, this occurs because the data is all parsed in memory first with Python's
csvlibrary.Whether this is intended behaviour or not has implications for the C engine in terms of implementing analogous
skipfooterbehaviour. Or perhaps it has something to do with the fact thaterror_bad_linesanderror_warn_linesparameters not with the Python engine?xref #5232