support general partitioners and comparators #4

piccolbo · 2011-09-13T00:00:59Z

advanced but important hadoop features, one can't work around their unavailability. Possible approach to do this is to create java classes that start an R server and pass an R expression to it to eval. See JRI.

piccolbo · 2012-07-23T20:18:54Z

follow this https://issues.apache.org/jira/browse/MAPREDUCE-614

piccolbo · 2013-01-02T19:46:30Z

The above issue is abandoned, i think we could use a different route made possible by https://issues.apache.org/jira/browse/HADOOP-5528, see also #129

The approach would be as follows. Use BinaryPartitioner or a custom written partitioner to read key and value of type typedbyteswritable and convert the key into an integer. Make sure the integer is in the appropriate range, most likely by taking the reminder with the number of partitions. Return such reminder. Done. The simplifying assumption here is that we let the values v carry any data we are interested in. We let the key be the partition number. Given that a variety of complex data structures are allowed for the value, this is unlikely to imply any loss of generality.

piccolbo · 2013-01-02T20:10:30Z

It should be possible to use BinaryPartitioner. This is because it needs the key to be BinaryComparable and implementing WritableComparable implies BinaryComparable and TypedBytesWritable is BinaryComparable. The other reason is that since the BinaryPartitioner Patch was proposed by the author of Dumbo, so we know it must work for TypedBytesWritable keys. The advantage is that this class is part of all major recent distros already.

piccolbo · 2013-03-11T23:37:35Z

This is now RevolutionAnalytics/rmr2#21

piccolbo mentioned this issue Jan 2, 2013

Add support for multiple outputs #130

Closed

piccolbo mentioned this issue Mar 11, 2013

support general partitioners and comparators RevolutionAnalytics/rmr2#21

Open

piccolbo closed this as completed Mar 11, 2013

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support general partitioners and comparators #4

support general partitioners and comparators #4

piccolbo commented Sep 13, 2011

piccolbo commented Jul 23, 2012

piccolbo commented Jan 2, 2013

piccolbo commented Jan 2, 2013

piccolbo commented Mar 11, 2013

support general partitioners and comparators #4

support general partitioners and comparators #4

Comments

piccolbo commented Sep 13, 2011

piccolbo commented Jul 23, 2012

piccolbo commented Jan 2, 2013

piccolbo commented Jan 2, 2013

piccolbo commented Mar 11, 2013