-
Notifications
You must be signed in to change notification settings - Fork 150
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Should use read.csv
instead of read.table
for "csv" format?
#50
Comments
If I can remember I went for read.table because it is more flexible, but I
|
Yeah, csv isn't really a standard and there are wide variations on how people parse "csv" files. One option would be to simply default to |
How is the rmr csv format not consistent with read.table? A new input On Tue, Jun 11, 2013 at 8:14 AM, Jamie F Olson notifications@github.comwrote:
|
I meant that you're currently completely consistent with I'm currently using make.input.format("csv","text",sep=sep,comment.char = comment.char,
colClasses=colClasses,
fill=fill,flush=flush,quote=quote,...) |
And what do you need to do, if anything, in Hive and Pig? On Mon, Jun 17, 2013 at 7:25 AM, Jamie F Olson notifications@github.comwrote:
|
Those parameters should be consistent with the default default format for Jamie Olson On Mon, Jun 17, 2013 at 11:34 AM, Antonio Piccolboni <
|
I am implementing this for 2.3.0 and I was wondering why you added the ... to the make input call. Of course that's not correct R but I was wondering if you meant that I should accept additional arguments. Or more in general, should I make the pig/hive format fixed or are some variations useful? |
I just accepted additional arguments assuming that I'd find additional things I'd want to configure. I think a couple options that might depend on circumstances are |
Since the rmr2 format is referred to as
"csv"
, shouldn't it actually callread.csv
so that it has the expected default parameters? Of particular importance iscomment.char = ""
, which I spent a surprising amount of time debugging before I finally noticed that rmr actually callsread.table
. I think it specifies somewhere in the documentation thatread.table
is being called, but at least I still found it surprising that it's not callingread.csv
.The text was updated successfully, but these errors were encountered: