-
Notifications
You must be signed in to change notification settings - Fork 28
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
write back into text format after reading a file stored into HDFS and running rhwatch in rhipe #42
Comments
I'll respond to this tomorrow. On Mon, May 9, 2016 at 9:02 PM, Indranil Gayen notifications@github.com
|
Hello, It is true that RHIPE doesn't have too many output / inputformats, but it If you load RHIPE and type rhoptions$ioformats you'll see the options
An example of Text output y <- rhwatch(map=function(a,b){ This converts an input file where 'a' is character vector and 'b' is
Also present in the package is HBase input (not sure if output works) But for that to work you'll need this JAR file which translates Hbase to RHIPE io formats are fairly pluggable i.e. you can write your own. HTH On Mon, May 9, 2016 at 9:27 PM, Saptarshi Guha saptarshi.guha@gmail.com
|
I was trying out rhipe and RHadoop [rmr rhdfs rhbase etc.] series of packages.
Now in both of the packages [rhipe and rmr] I can ingest / read the data stored into csv or text file. Both of them kind of supports creation of new file formats but I find rmr has more support for it or at least more resources to get started. Well, this requirement will be useful when one plans to perform few data processing on raw data stored in HDFS and finally want to store it back to HDFS in a format recognizable by other components of Hadoop like Hive Impala etc. Both of the packages can write in their native format recognizable by the package only. The package rmr supports few other formats.
For reference related to rmr have a look into this page.
However for rhipe I did not get any such document and I tried various ways it failed.
So my question is how can I write back into text [as for example, other recognizable format will also work] after reading a file stored into HDFS and running rhwatch in rhipe ?
I have asked same question here.
The text was updated successfully, but these errors were encountered: