text output and combiner don't work together #113

piccolbo · 2012-06-29T19:55:33Z

It was reported by Saar on the RHadoop Google group https://groups.google.com/d/topic/rhadoop/5yYKZZLSX8U/discussion. It seems a consequence of mapper and reducer writing out in two different formats, and the reducer expecting the one from the mapper. When the combiner is activated, it outputs the reducer output format which the reducer then can't read. This warning is telling:

52 WARN streaming.PipeMapRed: java.io.IOException: wrong key class: class org.apache.hadoop.io.Text is not class org.apache.hadoop.typedbytes.TypedBytesWritable

Investigating.

piccolbo · 2012-06-29T20:00:01Z

Repro with

from.dfs(mapreduce(to.dfs(1:10), combine = T, reduce = function(k,vv) keyval(NULL,sum(unlist(vv))), output.format="csv"), format="csv")

piccolbo · 2012-06-29T20:54:29Z

Entered a comment here: https://issues.apache.org/jira/browse/HADOOP-1722?focusedCommentId=13404201#comment-13404201 as this is where binary formats for hadoop streaming were introduced and I suspect they did not foresee the use of streaming combiners, added way later with HADOOP-4842. I added a comment there too. I am trying to understand what the intent was when both binary formats and streaming combiners were added to hadoop.

piccolbo · 2012-12-06T20:55:38Z

Updated test case to

from.dfs(mapreduce(to.dfs(1:10), combine = T, map = function(k,v) keyval(1,v),  reduce = function(k,vv) keyval(1,sum(unlist(vv))), output.format="csv"), format="csv")

piccolbo · 2013-03-05T22:56:33Z

this is now RevolutionAnalytics/rmr2#16

piccolbo mentioned this issue Mar 5, 2013

text output and combiner don't work together RevolutionAnalytics/rmr2#16

Open

piccolbo closed this as completed Mar 5, 2013

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

text output and combiner don't work together #113

text output and combiner don't work together #113

piccolbo commented Jun 29, 2012

piccolbo commented Jun 29, 2012

piccolbo commented Jun 29, 2012

piccolbo commented Dec 6, 2012

piccolbo commented Mar 5, 2013

text output and combiner don't work together #113

text output and combiner don't work together #113

Comments

piccolbo commented Jun 29, 2012

piccolbo commented Jun 29, 2012

piccolbo commented Jun 29, 2012

piccolbo commented Dec 6, 2012

piccolbo commented Mar 5, 2013