org.itadaki.bzip2 instead of Apache's bzip2 implementation. Added BZip2BlockOffsetTool for creating/examining .blockOffsets files. Fixes.
Add option to prep tool to allow multiple reducers(for later merging) Added more logging. Hadoop 0.23.10. Small fixes.
pairs to the reducer. More counters in the reducer.
relevant results. Need to talk to Sebastiano about this, but for now disable the last page link to discourage these requests.