You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
It was reported by Saar on the RHadoop Google group https://groups.google.com/d/topic/rhadoop/5yYKZZLSX8U/discussion. It seems a consequence of mapper and reducer writing out in two different formats, and the reducer expecting the one from the mapper. When the combiner is activated, it outputs the reducer output format which the reducer then can't read. This warning is telling:
52 WARN streaming.PipeMapRed: java.io.IOException: wrong key class: class org.apache.hadoop.io.Text is not class org.apache.hadoop.typedbytes.TypedBytesWritable
Investigating.
The text was updated successfully, but these errors were encountered:
Entered a comment here: https://issues.apache.org/jira/browse/HADOOP-1722?focusedCommentId=13404201#comment-13404201 as this is where binary formats for hadoop streaming were introduced and I suspect they did not foresee the use of streaming combiners, added way later with HADOOP-4842. I added a comment there too. I am trying to understand what the intent was when both binary formats and streaming combiners were added to hadoop.
It was reported by Saar on the RHadoop Google group https://groups.google.com/d/topic/rhadoop/5yYKZZLSX8U/discussion. It seems a consequence of mapper and reducer writing out in two different formats, and the reducer expecting the one from the mapper. When the combiner is activated, it outputs the reducer output format which the reducer then can't read. This warning is telling:
52 WARN streaming.PipeMapRed: java.io.IOException: wrong key class: class org.apache.hadoop.io.Text is not class org.apache.hadoop.typedbytes.TypedBytesWritable
Investigating.
The text was updated successfully, but these errors were encountered: