fread's nrow argument could accept -ve values to skip last 'n' rows #1643

arunsrinivasan · 2016-04-11T21:04:17Z

could be useful for this post: http://stackoverflow.com/q/36558437/559784

fread(file, nrow=-3) # could skip last 3 lines, for example.

MichaelChirico · 2016-04-11T21:15:38Z

Note that this would break the current default of -1L.

Other than that, it seems like it shouldn't be too hard to implement, maybe one or two extra lines in this branch of fread.c:

https://github.com/Rdatatable/data.table/blob/master/src/fread.c#L956-L1018

franknarf1 · 2016-06-30T19:41:28Z

Yeah, I could use this. I'm currently reading in csvs that often have an incomplete last line (not enough fields as inferred from commas), which reliably causes fread to crash R.

It would be nice to set skip.last=1L to avoid this. Because nrow already allows a negative value as Michael mentioned, I think it would be cleaner as a separate arg or allowing the skip arg to have a length of two (with the second component of the vector taking on this role when present).

jangorecki · 2016-06-30T20:07:55Z

I would prefer the way mentioned by Arun, as it would be consistent to linux head and tail way of handling negative values. If negative skip is currently being used, and cannot be easily changed, then it make sense to allow skip of length two, so skip=c(0, 1) would skip just the last line.
Just for completeness current workaround: fread("head -n -1 filename.csv")

lanceculnane · 2018-08-06T18:27:46Z

On a related note, it would be nice if we could pass a list of indicies (also as part of the 'skip' parameter) to explicitly read in the rows you want, like we can do in python's Pandas. If the list of indicies is random, it is a nice way to create a random sample of a data frame which is too large to be read onto a local machine, for instance.

MichaelChirico · 2018-08-06T18:41:29Z

@lanceculnane see also #583

arunsrinivasan added enhancement fread labels Apr 11, 2016

st-pasha mentioned this issue Jul 6, 2017

Master task for fread bugs / proposals #2247

Closed

MichaelChirico mentioned this issue Oct 26, 2018

cross-platform version of fread("tail -1 file")? #3128

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fread's nrow argument could accept -ve values to skip last 'n' rows #1643

fread's nrow argument could accept -ve values to skip last 'n' rows #1643

arunsrinivasan commented Apr 11, 2016

MichaelChirico commented Apr 11, 2016

franknarf1 commented Jun 30, 2016 •

edited

jangorecki commented Jun 30, 2016 •

edited

lanceculnane commented Aug 6, 2018

MichaelChirico commented Aug 6, 2018

fread's nrow argument could accept -ve values to skip last 'n' rows #1643

fread's nrow argument could accept -ve values to skip last 'n' rows #1643

Comments

arunsrinivasan commented Apr 11, 2016

MichaelChirico commented Apr 11, 2016

franknarf1 commented Jun 30, 2016 • edited

jangorecki commented Jun 30, 2016 • edited

lanceculnane commented Aug 6, 2018

MichaelChirico commented Aug 6, 2018

franknarf1 commented Jun 30, 2016 •

edited

jangorecki commented Jun 30, 2016 •

edited