Fixed width files #57

aetiologicCanada · 2019-02-01T20:51:21Z

This is an enhancement request, but I can't see how to designate it as such.

disk.frame looks to be wonderfully valuable. Many thanks in advance.

It would be helpful if the csv reading capacity could be extended to fixed-width files, as these files (often in the form of logs, etc) are typically massive.

The readr::read_fwf() is a nice implementation of fwf input, and might be a model for work on something comparable for this package.

Many thanks

xiaodaigh · 2019-02-01T23:35:27Z

Sounds useful. The problem with all of these is that the functions don't naturally allow for chunk-by-chunk reading. I have made a feature request to the chunked package which is the only package I know that does chunk by chunk reading.

xiaodaigh · 2019-02-01T23:41:44Z

@aetiologicCanada can you share a self contained example of a fwf file and how to use readr?

I tried

data(cars)
library(gdata)
write.fwf(cars, "test.fwf")
f = file("test.fwf")
readr::read_fwf("test.fwf", n_max=1)

it doesn't seem to work

aetiologicCanada · 2019-02-05T18:12:14Z

data(cars)
library(gdata) 
library(tidyverse)
library(fs)
f = here::here("test.fwf") 

gdata::write.fwf(cars, f) 
junk <- readr::read_fwf(f, skip = 1, readr::fwf_positions(
  start = c(1,4),
  end   = c(2,6),
  col_names = c("A", "B")
))

xiaodaigh · 2019-08-05T06:47:55Z

Maybe log an issue with readr so they can provide a read_fwf_chunked function like the readr::read_csv_chunked. Once they have that, we can use disk.frame::add_chunk to easily create a disk.frame

xiaodaigh mentioned this issue Feb 1, 2019

Feature request: support for fixed width file? edwindj/chunked#14

Open

xiaodaigh added the enhancement label Feb 1, 2019

aetiologicCanada closed this as completed Feb 5, 2019

aetiologicCanada reopened this Feb 5, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixed width files #57

Fixed width files #57

aetiologicCanada commented Feb 1, 2019

xiaodaigh commented Feb 1, 2019

xiaodaigh commented Feb 1, 2019

aetiologicCanada commented Feb 5, 2019 •

edited by xiaodaigh

xiaodaigh commented Aug 5, 2019

Fixed width files #57

Fixed width files #57

Comments

aetiologicCanada commented Feb 1, 2019

xiaodaigh commented Feb 1, 2019

xiaodaigh commented Feb 1, 2019

aetiologicCanada commented Feb 5, 2019 • edited by xiaodaigh

xiaodaigh commented Aug 5, 2019

aetiologicCanada commented Feb 5, 2019 •

edited by xiaodaigh