Skip to content

HTTPS clone URL

Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP
Java
branch: master
Failed to load latest commit information.
sample Smaller sample size
ConvertEmailsToSequence.java first commit
README reduced sample size due to folks reporting long run times
SearchEmail.java first commit
WholeFileInputFormat.java
WholeFileInputReader.java first commit
convertsearch.jar first commit

README

To run the sample, take the following steps:
1. Put sample emails from data folder into HDFS
2. Run hadoop job: 
   hadoop jar convertsearch.jar ConvertEmailsToSequence <sample email dir> <output dir>
   hadoop jar convertsearch.jar SearchEmail <sequence file dir> 
3. The sample data contains small set of .msg files (all copies) and the results in your /tmp dir should be identical to this
Something went wrong with that request. Please try again.